Ma Shijian

AI Researcher
📱 +44 7421468631
🔗 mashijian.com
💼 linkedin.com/in/mashijian
💻 github.com/IIIIQIIII
🤗 huggingface.co/FutureMa

Professional Summary

Agentic AI Researcher and founder of LocoreMind, specializing in local agent models and LLM benchmarks. Developed LocoOperator-4B (17K+ downloads, HF trending top 6) and Eva-4B-V2 (84.9% SOTA, surpassing Claude Opus 4.5 and Gemini 3 Flash). Created EvasionBench, selected as official HuggingFace benchmark case (trending top 4). Published at ICLR 2026 and ICRA 2026. Expertise in knowledge distillation, multi-model consensus evaluation, and zero-cost local agent deployment. Track record of high-impact open-source contributions with 400+ GitHub stars and 20K+ HuggingFace downloads across projects.

Professional Experience

Founder & Lead Researcher
LocoreMind - Local Agent Model Community
Feb 2026 - Present
  • Founded LocoreMind community for local agent model family, specializing in small models for private agentic workflows running on personal hardware
  • Developed LocoOperator-4B (288 likes, 17K+ downloads, HF trending top 6, 139 GitHub stars) and LocoTrainer-4B (164 likes, 1.2K+ downloads, HF trending top 8, 99 GitHub stars) via knowledge distillation from Qwen3-Coder-Next
  • Released qwen3.5-27b-cli-reasoning-3632x training dataset (52 likes, HF trending top 7) for agent research community
  • Enabled zero-cost local agent deployment via llama.cpp for multi-turn codebase exploration and tool-calling
Research Assistant
Hong Kong University of Science and Technology
Jan 2026 - Feb 2026
  • Conducted research under the supervision of Prof. Yi Yang on financial communication analysis
  • Developed Eva-4B (81.3% accuracy) and Eva-4B-V2 (84.9% SOTA, surpassing Gemini 3 Flash and Claude Opus 4.5) for detecting evasive answers in earnings call Q&A
  • Created EvasionBench with 30K training samples using multi-model consensus and LLM-as-Judge protocol; selected as official HuggingFace benchmark case (117 likes, trending top 4, 33 GitHub stars)
  • Published arXiv paper (2601.09142), featured on HF Daily Paper with 4K+ downloads and 95 likes
Research Assistant
University of Macau
Aug 2025 - Dec 2025
  • Conducted research under the supervision of Prof. Yan LIN and Prof. Hongchuan SHEN on LLM applications and agent systems
  • Created DramaBench benchmark for drama script continuation evaluation (110 likes, 490+ downloads on HuggingFace, published at ICLR 2026 Workshop)
  • Developed multiple agent systems: Financial Agent, Computer Agent, ShortStudio video platform, and multimodal emotion analysis for drama content
  • Fine-tuned Qwen3-8B for stance detection (0.77→0.93 accuracy) and open-sourced MSJ-Factory fine-tuning guide (150+ GitHub stars)
Visiting Student
UCL Robot Perception and Learning Lab
Jan 2025 - Jun 2025
  • Conducted research under the supervision of Prof. Dimitrios Kanoulas and Dr. Jianhao Jiao on embodied AI and robot navigation
  • Implemented real-time object tracking and position-based planning on Unitree Go2 quadruped robot (published at ICRA 2026)
  • Developed environmental mapping system integrating knowledge graphs, scene graphs, VLMs, and graph-based RAG for indoor navigation
Graduate Researcher - Robotics
University of Macau
Aug 2023 - Dec 2024
  • Developed embodied AI system for short video platform operations and RoboAgent hardware-software integration
  • Built unmanned surface vessels for water photography, 3D reconstruction, and obstacle avoidance with depth estimation
Graduate Researcher - Machine Learning
University of Macau
Aug 2023 - Dec 2024
  • Conducted research under the supervision of Prof. Zeng Jicheng and Prof. Lin Yan on LLM applications in content analysis
  • Collected 330K game reviews and built synthetic dataset for automated game issue detection using LLMs
  • Applied difference-in-differences methodology to study LLM impact on HuggingFace ecosystem
Undergraduate Researcher
University College Dublin
Dec 2021 - Aug 2023
  • Developed TCMFormer Traditional Chinese Medicine diagnostic model and business data visualization systems
  • Conducted machine learning research on public datasets, published multiple papers on prediction and classification tasks
Research Assistant
Green Bay Marine Technology Co., Ltd.
Jun 2020 - Oct 2022
  • Collaborated with Prof. Xiao Li's robotics team at Huazhong University of Science and Technology on USV systems
  • Developed autonomous navigation for rescue USV and implemented multi-agent reinforcement learning on vessel arrays

Technical Skills

Agent Frameworks:
LangChain, AutoGPT, BabyAGI, CrewAI, LangGraph, Semantic Kernel
LLM & APIs:
GPT, Claude, Gemini, Qwen, Mistral, OpenAI API, Anthropic API
Programming:
Python, TypeScript, JavaScript, C++
ML/DL Frameworks:
PyTorch, TensorFlow, Transformers, TRL
Vector Databases:
Pinecone, Weaviate, Chroma, Qdrant, FAISS
Tools & Platforms:
Docker, Kubernetes, AWS, GCP, FastAPI, Redis, PostgreSQL

Publications

  • Shijian Ma, Yunqi Huang, Lin Yan. "I Can't Believe LLMs Still Can't Write Drama: Multi-Dimensional Failures in Script Continuation." ICLR 2026 Workshop ICBINB, 2026.
  • Shijian Ma, Yan Lin, Yi Yang. "EvasionBench: Detecting Evasive Answers in Financial Q&A via Multi-Model Consensus and LLM-as-Judge." arXiv, 2026.
  • Qianyi Zhang, Shijian Ma, Boyi Liu, Jianhao Jiao, Dimitrios Kanoulas. "Follow Everything: A Leader-Following and Obstacle Avoidance Framework with Goal-Aware Adaptation." ICRA, 2026.
  • Shijian Ma, Shicong Ma, Jianhao Jiao. "RoboAgent: An Integrated Hardware-Software Agent for Short Video Interactions." TechRxiv, 2024.
  • Shijian Ma, Shicong Ma, Jianhao Jiao. "WaveDepth: Enabling Water Depth Estimation with Water Segmentation." TechRxiv, 2024.
  • Shijian Ma, Shicong Ma. "WaveShot: A Compact Portable Unmanned Surface Vessel for Dynamic Water Surface Videography and Media Production." arXiv, 2024. AIME accept.
  • Shijian Ma, Shicong Ma. "Exploring the Relationship between Vehicle Specifications and Fuel Efficiency using Random Forest Regression and Gradient Boosting Regression Models." ECCS, 2024.
  • Shijian Ma. "Churn Prediction in Business using Logistic Regression and Logit Boost." ICDSCA, 2023.
  • Shicong Ma, Xiaoguang Tang, Shijian Ma. "Integration of Multi-sensor Marine Environment Monitoring System for Hybrid-Power USV." ICGMRS, 2023.
  • Shijian Ma, Xiaoguang Tang, Weize Ma, Jitao Liu, Li Xiao. "An Unmanned Cleaning Robot for the Inner Wall of Large LNG Tanks." ICARM, 2022.

Key Projects

LocoreMind: Local Agent Model Family
2026
  • Founded open-source community for local agent models designed for private agentic workflows running on personal hardware
  • Developed LocoOperator-4B: 4B-parameter tool-calling agent trained via knowledge distillation from Qwen3-Coder-Next for multi-turn codebase exploration
  • Created LocoTrainer-4B and LocoTrainer framework for agent training workflows with zero API cost via llama.cpp
  • Released qwen3.5-27b-cli-reasoning-3632x dataset for agent training research
  • Achieved significant community impact: LocoOperator (288 likes, 17K+ downloads, HF trending top 6, 139 GitHub stars), LocoTrainer (164 likes, 1.2K+ downloads, HF trending top 8, 99 GitHub stars), Dataset (52 likes, HF trending top 7)
Eva-4B: Financial Evasion Detection Model
2026
  • Developed Eva-4B (81.3% accuracy) and Eva-4B-V2 (84.9% SOTA, surpassing Gemini 3 Flash and Claude Opus 4.5) for detecting evasive answers in earnings call Q&A
  • Created EvasionBench with 30K training samples via multi-model consensus; selected as official HuggingFace benchmark case
  • Community impact: Eva-4B-V2 (113 likes, 1.1K+ downloads, trending 21), EvasionBench (117 likes, trending top 4, 33 GitHub stars), Paper (4K+ downloads, 95 likes, HF Daily Paper featured)
Qwen3-4B-Evasion: Financial Communication Analysis Model
2026
  • Fine-tuned Qwen3-4B-Instruct-2507 for detecting evasion levels in earnings call Q&A responses
  • Implemented Rasiah taxonomy-based classification system with three evasion categories
  • Achieved community adoption with 36 likes and 145 downloads on HuggingFace
DramaBench: Drama Script Evaluation Benchmark
2025
  • Developed first large-scale benchmark for drama script continuation with six-dimensional evaluation framework
  • Evaluated 8 SOTA models on 1,103 scripts (8,824 evaluations); published at ICLR 2026 Workshop ICBINB
  • Community impact: 110 likes, 490+ downloads on HuggingFace, 78 GitHub stars, 16 Daily Paper upvotes
Qwen3-8B-Drama-Thinking: Creative Screenwriting AI Model
2025
  • Full parameter fine-tuned Qwen3-8B on 6,319 dramatic script samples using ms-swift framework
  • Achieved +262% output length and +80% thinking depth with explicit creative reasoning chains
  • Deployed on HuggingFace achieving trending rank #28 with 89 likes and over 2,000 downloads
Agent Systems Portfolio
2025
  • Financial Agent: Automated financial text analysis with continuous learning for evolving data
  • XAgent: X.com automation framework using Claude Agent SDK and Chrome DevTools MCP (26 GitHub stars)
  • Computer Agent: Natural language computer automation with screen capture analysis and multi-step task planning
  • ShortStudio: AI-powered video creation platform analyzing trending content and generating similar videos
  • Multimodal Emotion Analysis: Drama content analysis combining visual, audio, and textual features
MSJ-Factory: Qwen Model Fine-tuning Framework
2025
  • Created comprehensive guide for fine-tuning Qwen2.5-Coder for Chinese sentiment analysis
  • Fine-tuned Qwen3-8B model for stance detection using Zhihu data
  • Achieved significant accuracy improvement from 0.77 to 0.93 in stance detection tasks
  • Open-sourced project gaining 150+ GitHub stars for community contribution
FollowEverything: Object Following System for Quadruped Robots
2025
  • Implemented real-time object tracking and position-based planning on Go2 quadruped robot
  • Achieved open-set object following capabilities for diverse target objects
PKER: Prior Knowledge-assisted Environmental Representation
2025
  • Created maps by integrating predefined knowledge graphs with real-time observations
  • Integrated scene graphs, vision-language models, and graph-based RAG
  • Applied to indoor navigation and scene understanding tasks
Unmanned Surface Vessel Research
2024
  • RoboAgent: Autonomous social media interaction system with computer vision and LLM-generated content (published at TechRxiv)
  • WaveShot: Portable USV for water videography with monocular depth estimation (published at arXiv, AIME accepted)
  • WaveDepth: Water depth estimation combining semantic segmentation with specialized dataset (published at TechRxiv)
  • Underwater Perception: 3D reconstruction system integrating sonar, RTK-GPS, and point cloud visualization
  • Patrol Boat: Real-time streaming USV with 4G module, LLM integration, and generative AI capabilities

Education

Visiting Student, Computer Science
University College London (UCL) | London, United Kingdom
Jan 2025 - Jun 2025
Master of Science in Data Science
University of Macau (UM) | Macau, China
Aug 2023 - Jun 2025
Coursework in Master of Information Technology
University of Queensland (UQ) | Brisbane, Australia
Feb 2022 - Jun 2022
Bachelor of Marketing
University College Dublin (UCD) | Dublin, Ireland
Sep 2018 - Jan 2022

Honors & Achievements

Professional Service

  • Reviewer: ICLR 2026, IROS 2025, ICRA 2025, RA-L 2024, IROS 2024

Patents

  • ZL201420713314.1 - Portable remote control water cleaning boat (2022)
  • ZL202130395091.4 - Water surface intelligent cleaning robot (2022)

Awards

  • Silver Medal, 7th National College Students "Internet plus" Entrepreneurship Contest (2022, Hubei Province)
  • Winner's Prize, 16th "Chunhui Cup" Chinese Overseas Students Innovation Competition (2022)