{"schema_version":"onlylabs.public_analysis_evidence.v1","title":"Xiaomi (MiMo) analysis evidence pack","description":"Public onlylabs evidence pack for cited agent analysis: captured pages, ranked public signals, and stored web-search provenance used by the background analysis workflow.","url":"https://onlylabs.fyi/labs/xiaomi","json_url":"https://onlylabs.fyi/analysis/xiaomi/evidence.json","generated_at":"2026-06-11T13:52:25.036Z","org":{"slug":"xiaomi","name":"Xiaomi (MiMo)","category":"neolab","category_label":"Neolab","dossier_url":"https://onlylabs.fyi/labs/xiaomi"},"analysis":null,"workflow":{"version":"onlylabs-deepagents-analysis-v3","provider":null,"model":null,"agent":null,"public_pack_mode":"local-pages-and-events","live_web_fetches":false,"note":"Public evidence exports do not trigger live Exa calls; stored Exa provenance is included when analysis metadata contains it."},"stats":{"pages":28,"events":47,"web":0,"evidence":75,"signal_desks":{"hiring":12,"forks":2,"releases":21,"talking":0,"repos":12},"data_radar_lanes":null,"data_radar_matches":null,"stored_analysis_evidence":null,"stored_analysis_web":null,"stored_analysis_signal_desks":null,"stored_analysis_data_radar_lanes":null,"stored_analysis_data_radar_matches":null},"stored_web_provenance":null,"evidence":[{"ref":"P1","kind":"page","title":"XiaomiMiMo/MiMo-VL repository metadata","date":"2026-06-11T03:20:37.16331+00:00","date_source":null,"source_url":"https://github.com/XiaomiMiMo/MiMo-VL","signal_url":null,"signal_json_url":null,"text":"# XiaomiMiMo/MiMo-VL\n\nDescription: MiMo-VL\n\nLicense: Apache-2.0\n\nStars: 643\n\nForks: 30\n\nOpen issues: 6\n\nCreated: 2025-05-29T06:40:28Z\n\nPushed: 2025-08-21T08:00:22Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<div align=\"center\">\n<picture>\n<source srcset=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo_darkmode.png?raw=true\" media=\"(prefers-color-scheme: dark)\">\n<img src=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo.png?raw=true\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</picture>\n</div>\n\n<h3 align=\"center\">\n<b>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\nMiMo-VL Technical Report\n<br/>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\n</b>\n</h3>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://huggingface.co/collections/XiaomiMiMo/mimo-vl-68382ccacc7c2875500cd212\" target=\"_blank\">🤗 HuggingFace</a>\n&nbsp;|\n<a href=\"https://www.modelscope.cn/collections/MiMo-VL-bb651017e02742\" target=\"_blank\">🤖️ ModelScope</a>\n&nbsp;|\n<a href=\"https://arxiv.org/abs/2506.03569\" target=\"_blank\">📔 Technical Report</a>\n&nbsp;|\n<a href=\"https://github.com/XiaomiMiMo/lmms-eval\" target=\"_blank\">📊 Evaluation Framework</a>\n&nbsp;|\n<a href=\"./demo/README.md\" target=\"_blank\">🔥 Gradio Demo</a>\n&nbsp;|\n<br/>\n</div>\n\n<br/>\n\n### 🔥🔥🔥 MiMo-VL 2508 Updates\n\nWe're excited to announce improvements to our MiMo-VL ([MiMo-VL-7B-RL-2508](https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-RL-2508) and [MiMo-VL-7B-SFT-2508](https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-SFT-2508)), featuring enhanced performance across multiple benchmarks, improved thinking control capabilities, and better user experience.\n\n#### 📈 Performance Improvements\n\nMiMo-VL-7B-RL-2508 demonstrates consistent improvements across both image and video benchmarks, achieving notable milestones of **70.6 on MMMU** and **70.8 on VideoMME**.\n\n<img src=\"./figures/mimo-2508.png\" alt=\"Benchmark Improvements\" width=\"768\">\n\nFull evaluation results can be found [below](#full-evaluation-results).\n\n#### 🤔 Thinking Control Feature\n\nA thinking control capability that allows users to turn off the model's reasoning mode using the no_think parameter:\n- Thinki"},{"ref":"P2","kind":"page","title":"XiaomiMiMo/MiMo repository metadata","date":"2026-06-11T03:20:37.087985+00:00","date_source":null,"source_url":"https://github.com/XiaomiMiMo/MiMo","signal_url":null,"signal_json_url":null,"text":"# XiaomiMiMo/MiMo\n\nDescription: MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 2167\n\nForks: 94\n\nOpen issues: 38\n\nCreated: 2025-04-26T09:31:17Z\n\nPushed: 2025-06-05T16:01:49Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<div align=\"center\">\n<picture>\n<source srcset=\"https://github.com/XiaomiMiMo/MiMo/raw/main/figures/Xiaomi_MiMo_darkmode.png?raw=true\" media=\"(prefers-color-scheme: dark)\">\n<img src=\"https://github.com/XiaomiMiMo/MiMo/raw/main/figures/Xiaomi_MiMo.png?raw=true\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</picture>\n</div>\n\n<h3 align=\"center\">\n<b>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\nUnlocking the Reasoning Potential of Language Model<br/>From Pretraining to Posttraining\n<br/>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\n</b>\n</h3>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://huggingface.co/XiaomiMiMo\" target=\"_blank\">🤗 HuggingFace</a>\n&nbsp;|\n<a href=\"https://www.modelscope.cn/organization/XiaomiMiMo\" target=\"_blank\">🤖️ ModelScope</a>\n&nbsp;|\n<a href=\"https://arxiv.org/abs/2505.07608\" target=\"_blank\">📔 Technical Report</a>\n&nbsp;|\n<br/>\n</div>\n\n<br/>\n\n---\n\n## Updates\n\n[2025.05.30] We scaled the SFT dataset from approximately 500K to 6M instances and continuously expanding the RL training window size from 32K to 48K, the performance of [MiMo-7B-RL-0530](https://huggingface.co/XiaomiMiMo/MiMo-7B-RL-0530) on AIME24 can be continuously improved and eventually surpass that of DeepSeek R1 (79.8).\n\n<table>\n<thead>\n<tr>\n<th>Benchmark</th>\n<th>MiMo-7B-RL</th>\n<th>MiMo-7B-RL-0530</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td colspan=\"3\"><strong>Mathematics</strong></td>\n<p align=\"center\">\n<td rowspan=\"11\"><img width=\"80%\" src=\"https://github.com/XiaomiMiMo/MiMo/raw/main/figures/length.jpg?raw=true\"></td>\n</p>\n</tr>\n<tr><td>MATH500<br/>(Pass@1)</td><td>95.8</td><td>97.2</td></tr>\n<tr><td>AIME 2024<br/>(Pass@1)</td><td>68.2</td><td>80.1</td></tr>\n<tr><td>AIME 2025<br/>(Pass@1)</td><td>55.4</td><td>70.2</td></tr>\n<tr><td colspan=\"3\"><strong>Code</strong></td></tr>\n<tr><td>LiveCodeBench v5<br/>(Pass@1)</td><td>57."},{"ref":"P3","kind":"page","title":"XiaomiMiMo/MiMo-Audio-Eval repository metadata","date":"2026-06-11T03:20:36.940977+00:00","date_source":null,"source_url":"https://github.com/XiaomiMiMo/MiMo-Audio-Eval","signal_url":null,"signal_json_url":null,"text":"# XiaomiMiMo/MiMo-Audio-Eval\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 87\n\nForks: 8\n\nOpen issues: 1\n\nCreated: 2025-09-18T16:01:04Z\n\nPushed: 2025-09-25T04:44:38Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<div align=\"center\">\n<picture>\n<source srcset=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo_darkmode.png?raw=true\" media=\"(prefers-color-scheme: dark)\">\n<img src=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo.png?raw=true\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</picture>\n</div>\n\n<h3 align=\"center\">\n<b>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\nMiMo-Audio-Eval Toolkit\n<br/>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\n</b>\n</h3>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-Audio\" target=\"_blank\">🤖 GitHub</a>\n&nbsp;|\n<a href=\"https://huggingface.co/collections/XiaomiMiMo/mimo-audio-68cc7202692c27dae881cce0\" target=\"_blank\">🤗 HuggingFace</a>\n&nbsp;|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-Audio/blob/main/MiMo-Audio-Technical-Report.pdf\" target=\"_blank\">📄 Paper</a>\n&nbsp;|\n<a href=\"https://xiaomimimo.github.io/MiMo-Audio-Demo\" target=\"_blank\">📰 Blog</a>\n&nbsp;|\n<a href=\"https://huggingface.co/spaces/XiaomiMiMo/mimo_audio_chat\" target=\"_blank\">🔥 Online Demo</a>\n&nbsp;|\n<br/>\n</div>\n\n<br/>\n\n## Introduction\n\nWelcome to the **MiMo-Audio-Eval** toolkit! This toolkit is designed to evaluate various audio language models as described in the **MiMo-Audio** paper. It provides a flexible and extensible framework, supporting a wide range of datasets, tasks, and models, specifically for evaluating pre-trained or supervised fine-tuned (SFT) audio language models. The toolkit is ideal for researchers and developers who need to assess the performance of these models across different tasks and datasets.\n\n## Supported Datasets, Tasks, and Models\n\nThe MiMo-Audio-Eval toolkit supports a comprehensive set of datasets, tasks, and models. Some of the key features include:\n\n* **Datasets**:\n\n* AISHELL1\n* LibriSpeech\n* SeedTTS\n* Expresso\n* InstructTTSEval\n* SpeechMMLU\n* MMAR\n* MMAU\n* MMAU-Pro\n* MMSU\n* ESD\n* Big Bench Audio\n* MultiChallenge "},{"ref":"P4","kind":"page","title":"XiaomiMiMo/MiMo-Audio-Tokenizer repository metadata","date":"2026-06-11T03:20:36.926389+00:00","date_source":null,"source_url":"https://github.com/XiaomiMiMo/MiMo-Audio-Tokenizer","signal_url":null,"signal_json_url":null,"text":"# XiaomiMiMo/MiMo-Audio-Tokenizer\n\nDescription: A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 144\n\nForks: 13\n\nOpen issues: 2\n\nCreated: 2025-09-18T16:02:05Z\n\nPushed: 2025-09-19T08:11:24Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<div align=\"center\">\n\n<img src=\"https://raw.githubusercontent.com/XiaomiMiMo/MiMo-VL/main/figures/Xiaomi_MiMo.png\" alt=\"Description\" width=\"25%\" />\n\n# MiMo-Audio-Tokenizer\n\n<img src=\"mimo_audio_tokenizer/assets/Tokenizer_01.png\" alt=\"Description\" width=\"90%\" />\n\n<p><em>A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.</em></p>\n\n</div>\n\n## Key Features\n\n- Scaled parameters and training data bootstrap the frontier of audio tokenization\n- 1.2B pure transformer-based architecture to keep both efficiency and effectiveness\n- trained from scratch over 11 million hours covering both audio reconstruction task and the audio-to-text (A2T) task\n\n- Unified representation enhance both cross-modal alignment and speech reconstruction quality\n- jointly capture both semantic and acoustic information while further alleviates the semantic-acoustic representation conflict\n\n## Installation\n\n```sh\ngit clone https://github.com/XiaomiMiMo/MiMo-Audio-Tokenizer\ncd MiMo-Audio-Tokenizer\n# Install base dependencies\npip install -e .\n# Install flash-attn\npip install -e \".[flash]\"\n```\n\n## Model Download\n\n```sh\n# you might need `sudo apt-get install git-lfs` before download this model\ngit clone https://huggingface.co/XiaomiMiMo/MiMo-Audio-Tokenizer\n```\n\n## Example Usage\n\n### 0. Quick start\n\n```py\nimport torchaudio\nimport mimo_audio_tokenizer\n\n# one-line model init\ntokenizer = mimo_audio_tokenizer.load_model(\"path to your model\").bfloat16().cuda() # FlashAttention only support fp16 and bf16 data type\n\n# preprocess\nmels = []\nwav_paths = [\"mimo_audio_tokenizer/assets/BAC009S0764W0121.wav\", \"mimo_audio_tokenizer/assets/BAC009S0764W0122.wav\", \"mimo_audio_tokenizer/assets/猪八戒_gt.wav\"]\nfor wav_path in wav_paths:\nwav = mimo_audio_tokenizer.load_audio(wav_path, tokenizer"},{"ref":"P5","kind":"page","title":"XiaomiMiMo/MiMo-Audio repository metadata","date":"2026-06-11T03:20:36.753432+00:00","date_source":null,"source_url":"https://github.com/XiaomiMiMo/MiMo-Audio","signal_url":null,"signal_json_url":null,"text":"# XiaomiMiMo/MiMo-Audio\n\nDescription: MiMo-Audio: Audio Language Models are Few-Shot Learners\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 1046\n\nForks: 102\n\nOpen issues: 39\n\nCreated: 2025-09-19T00:46:49Z\n\nPushed: 2026-03-03T02:34:35Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<div align=\"center\">\n<picture>\n<source srcset=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo_darkmode.png?raw=true\" media=\"(prefers-color-scheme: dark)\">\n<img src=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo.png?raw=true\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</picture>\n</div>\n\n<h3 align=\"center\">\n<b>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\nMiMo Audio: Audio Language Models are Few-Shot Learners\n<br/>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\n</b>\n</h3>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://huggingface.co/collections/XiaomiMiMo/mimo-audio-68cc7202692c27dae881cce0\" target=\"_blank\">🤗 HuggingFace</a>\n&nbsp;|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-Audio/blob/main/MiMo-Audio-Technical-Report.pdf\" target=\"_blank\">📄 Paper</a>\n&nbsp;|\n<a href=\"https://xiaomimimo.github.io/MiMo-Audio-Demo\" target=\"_blank\">📰 Blog</a>\n&nbsp;|\n<a href=\"https://huggingface.co/spaces/XiaomiMiMo/mimo_audio_chat\" target=\"_blank\">🔥 Online Demo</a>\n&nbsp;|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-Audio-Eval\" target=\"_blank\">📊 MiMo-Audio-Eval</a>\n&nbsp;|\n\n<br/>\n</div>\n\n<br/>\n\n## Introduction\n\nExisting audio language models typically rely on task-specific fine-tuning to accomplish particular audio tasks. In contrast, humans are able to generalize to new audio tasks with only a few examples or simple instructions. GPT-3 has shown that scaling next-token prediction pretraining enables strong generalization capabilities in text, and we believe this paradigm is equally applicable to the audio domain. By scaling MiMo-Audio's pretraining data to over one hundred million of hours, we observe the emergence of few-shot learning capabilities across a diverse set of audio tasks. We develop a systematic evaluation of these capabilities and find that MiMo-Audio-7B-Base achieves SOTA performance on both"},{"ref":"P6","kind":"page","title":"XiaomiMiMo/MiMo-Audio-Training repository metadata","date":"2026-06-11T03:20:36.405358+00:00","date_source":null,"source_url":"https://github.com/XiaomiMiMo/MiMo-Audio-Training","signal_url":null,"signal_json_url":null,"text":"# XiaomiMiMo/MiMo-Audio-Training\n\nLanguage: Python\n\nStars: 109\n\nForks: 13\n\nOpen issues: 5\n\nCreated: 2025-10-16T13:52:54Z\n\nPushed: 2025-10-16T13:55:07Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<div align=\"center\">\n<picture>\n<source srcset=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo_darkmode.png?raw=true\" media=\"(prefers-color-scheme: dark)\">\n<img src=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo.png?raw=true\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</picture>\n</div>\n\n<h3 align=\"center\">\n<b>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\nMiMo-Audio-Training Toolkit\n<br/>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\n</b>\n</h3>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-Audio\" target=\"_blank\">🤖 GitHub</a>\n&nbsp;|\n<a href=\"https://huggingface.co/collections/XiaomiMiMo/mimo-audio-68cc7202692c27dae881cce0\" target=\"_blank\">🤗 HuggingFace</a>\n&nbsp;|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-Audio/blob/main/MiMo-Audio-Technical-Report.pdf\" target=\"_blank\">📄 Paper</a>\n&nbsp;|\n<a href=\"https://xiaomimimo.github.io/MiMo-Audio-Demo\" target=\"_blank\">📰 Blog</a>\n&nbsp;|\n<a href=\"https://huggingface.co/spaces/XiaomiMiMo/mimo_audio_chat\" target=\"_blank\">🔥 Online Demo</a>\n&nbsp;|\n<br/>\n</div>\n\n<br/>\n\n## Introduction\n\nWelcome to the **MiMo-Audio-Training** toolkit! This toolkit is designed to fine-tune the [XiaomiMiMo/MiMo-Audio-7B-Instruct](https://huggingface.co/XiaomiMiMo/MiMo-Audio-7B-Instruct). This toolkit serves as a reference implementation for researchers and developers interested in MiMo-Audio and looking to adapt it to their own custom tasks.\n\n## Supported Tasks\n\nThe MiMo-Audio-Eval toolkit supports a comprehensive set of tasks. Some of the key features include:\n\n* **Tasks**:\n\n* **SFT**:\n\n* ASR\n* TTS / InstructTTS\n* Audio Understanding and Reasoning\n* Spoken Dialogue\n\n## Getting Started\n\nTo get started with the MiMo-Audio-Training toolkit, follow the instructions below to set up the environment and install the required dependencies.\n\n### Prerequisites (Linux)\n\n* Python 3.12\n* CUDA >= 12.0\n\n### Installation:\n\n```bash\ngit clone --"},{"ref":"P7","kind":"page","title":"XiaomiMiMo/MiMo-Audio-Demo repository metadata","date":"2026-06-11T03:20:36.254511+00:00","date_source":null,"source_url":"https://github.com/XiaomiMiMo/MiMo-Audio-Demo","signal_url":null,"signal_json_url":null,"text":"# XiaomiMiMo/MiMo-Audio-Demo\n\nLanguage: HTML\n\nStars: 11\n\nForks: 3\n\nOpen issues: 1\n\nCreated: 2025-09-19T01:01:53Z\n\nPushed: 2025-11-17T13:34:24Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# MiMo-Audio Blog\n\n- github: https://github.com/XiaomiMiMo/MiMo-Audio\n- gitpage: https://xiaomimimo.github.io/MiMo-Audio-Demo/"},{"ref":"P8","kind":"page","title":"XiaomiMiMo/MiMo-Embodied repository metadata","date":"2026-06-11T03:20:36.205897+00:00","date_source":null,"source_url":"https://github.com/XiaomiMiMo/MiMo-Embodied","signal_url":null,"signal_json_url":null,"text":"# XiaomiMiMo/MiMo-Embodied\n\nDescription: MiMo-Embodied\n\nLanguage: Python\n\nLicense: NOASSERTION\n\nStars: 386\n\nForks: 15\n\nOpen issues: 0\n\nCreated: 2025-11-19T08:54:41Z\n\nPushed: 2026-04-15T12:28:08Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<div align=\"center\">\n<img src=\"./assets/xfmlogo.svg\" width=\"600\">\n</div>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://huggingface.co/XiaomiMiMo/MiMo-Embodied-7B\" target=\"_blank\">🤗 HuggingFace</a>\n&nbsp;|\n<a href=\"https://arxiv.org/abs/2511.16518\" target=\"_blank\">📔 Technical Report</a>\n&nbsp;|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-Embodied\" target=\"_blank\">🏠 Model Repository</a>\n&nbsp;|\n<br/>\n</div>\n\n## I. Introduction\n\nThis repository provides the **official evaluation suite of MiMo-Embodied**, designed to support **rigorous** and **reproducible** evaluation for **embodied AI** and **autonomous driving** tasks.\n\nBuilt on top of the excellent [lmms-eval](https://github.com/EvolvingLMMs-Lab/lmms-eval) framework, this repository extends the evaluation pipeline with MiMo-specific model integration, benchmark support, and evaluation workflows for embodied and driving scenarios.\n\n**MiMo-Embodied** is a powerful cross-embodied vision-language model that demonstrates state-of-the-art performance in both **autonomous driving** and **embodied AI tasks**, representing the first open-source VLM that integrates these two critical areas.\n\n> This repository is for **evaluation only**. It does **not** contain model training code.\n\n<div align=\"center\">\n<img src=\"./assets/fig1.svg\" width=\"800\">\n</div>\n\n---\n\n## II. Key Features\n\n### 1. `MiVLLM`: A MiMo-tailored vLLM-based Model Wrapper\n\nWe use a custom `mivllm` model class built on top of the original `VLLM` implementation in `lmms-eval`, tailored for MiMo models. Compared with the default implementation, it:\n\n- improves **data loading efficiency**\n- enables finer control over **image and video preprocessing**\n- supports MiMo-specific inference settings such as:\n- `max_model_len`\n- `gpu_memory_utilization`\n- `max_num_seqs`\n\n### 2. Evaluation for Embodied AI\n\nThis evaluation suite supports embodied AI benchmarks covering key capabilities such as:\n\n- *"},{"ref":"P9","kind":"page","title":"XiaomiMiMo/MiMo-V2-Flash repository metadata","date":"2026-06-11T03:20:36.044718+00:00","date_source":null,"source_url":"https://github.com/XiaomiMiMo/MiMo-V2-Flash","signal_url":null,"signal_json_url":null,"text":"# XiaomiMiMo/MiMo-V2-Flash\n\nDescription: MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation Model\n\nLicense: Apache-2.0\n\nStars: 1333\n\nForks: 59\n\nOpen issues: 18\n\nCreated: 2025-12-15T16:28:22Z\n\nPushed: 2026-01-08T04:53:37Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<br/><br/>\n\n<div align=\"center\">\n<picture>\n<source srcset=\"https://github.com/XiaomiMiMo/MiMo-V2-Flash/raw/main/figures/Xiaomi_MiMo_darkmode.png?raw=true\" media=\"(prefers-color-scheme: dark)\">\n<img src=\"https://github.com/XiaomiMiMo/MiMo-V2-Flash/raw/main/figures/Xiaomi_MiMo.png?raw=true\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</picture>\n</div>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://huggingface.co/XiaomiMiMo/MiMo-V2-Flash\" target=\"_blank\">🤗 HuggingFace</a>\n&nbsp;|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-V2-Flash/blob/main/paper.pdf\" target=\"_blank\">📔 Technical Report </a>\n&nbsp;|\n<a href=\"https://mimo.xiaomi.com/blog/mimo-v2-flash\" target=\"_blank\">📰 Blog </a>\n&nbsp;|\n<br/><br/>\n<strong>Play around!</strong> &nbsp;\n<a href=\"https://aistudio.xiaomimimo.com\" target=\"_blank\">🗨️ Xiaomi MiMo Studio </a>\n&nbsp;\n<a href=\"https://platform.xiaomimimo.com/\" target=\"_blank\">🎨 Xiaomi MiMo API Platform </a>\n</div>\n<br/>\n\n# MiMo-V2-Flash\n\n**MiMo-V2-Flash** is a Mixture-of-Experts (MoE) language model with **309B total parameters** and **15B active parameters**. Designed for high-speed reasoning and agentic workflows, it utilizes a novel hybrid attention architecture and Multi-Token Prediction (MTP) to achieve state-of-the-art performance while significantly reducing inference costs.\n\n<p align=\"center\">\n<img width=\"80%\" src=\"https://github.com/XiaomiMiMo/MiMo-V2-Flash/raw/main/figures/MiMo-v2-flash-performance.jpg?raw=true\">\n</p>\n\n-----\n\n## 1. Introduction\n\nMiMo-V2-Flash creates a new balance between long-context modeling capability and inference efficiency. Key features include:\n\n* **Hybrid Attention Architecture**: Interleaves Sliding Window Attention (SWA) and Global Attention (GA) with a 5:1 ratio and an aggressive 128-token window. This reduces KV-cache storage by nearly 6x while maintaining long-context performance via learnable **attention sink bias"},{"ref":"P10","kind":"page","title":"XiaomiMiMo/vllm repository metadata","date":"2026-06-11T02:54:14.938034+00:00","date_source":null,"source_url":"https://github.com/XiaomiMiMo/vllm","signal_url":null,"signal_json_url":null,"text":"# XiaomiMiMo/vllm\n\nDescription: A high-throughput and memory-efficient inference and serving engine for LLMs\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 31\n\nForks: 5\n\nOpen issues: 0\n\nCreated: 2025-04-29T06:43:27Z\n\nPushed: 2025-05-12T09:06:32Z\n\nDefault branch: feat_mimo_mtp_stable_073\n\nFork: yes\n\nParent repository: vllm-project/vllm\n\nArchived: no\n\nREADME:\n<p align=\"center\">\n<picture>\n<source media=\"(prefers-color-scheme: dark)\" srcset=\"https://raw.githubusercontent.com/vllm-project/vllm/main/docs/source/assets/logos/vllm-logo-text-dark.png\">\n<img alt=\"vLLM\" src=\"https://raw.githubusercontent.com/vllm-project/vllm/main/docs/source/assets/logos/vllm-logo-text-light.png\" width=55%>\n</picture>\n</p>\n\n<h3 align=\"center\">\nEasy, fast, and cheap LLM serving for everyone\n</h3>\n\n<p align=\"center\">\n| <a href=\"https://docs.vllm.ai\"><b>Documentation</b></a> | <a href=\"https://vllm.ai\"><b>Blog</b></a> | <a href=\"https://arxiv.org/abs/2309.06180\"><b>Paper</b></a> | <a href=\"https://x.com/vllm_project\"><b>Twitter/X</b></a> | <a href=\"https://slack.vllm.ai\"><b>Developer Slack</b></a> |\n</p>\n\n---\n\nWe are excited to invite you to our Menlo Park meetup with Meta, evening of Thursday, February 27! Meta engineers will discuss the improvements on top of vLLM, and vLLM contributors will share updates from the v0.7.x series of releases. [Register Now](https://lu.ma/h7g3kuj9)\n\n---\n\n*Latest News* 🔥\n\n- [2025/01] We are excited to announce the alpha release of vLLM V1: A major architectural upgrade with 1.7x speedup! Clean code, optimized execution loop, zero-overhead prefix caching, enhanced multimodal support, and more. Please check out our blog post [here](https://blog.vllm.ai/2025/01/27/v1-alpha-release.html).\n- [2025/01] We hosted [the eighth vLLM meetup](https://lu.ma/zep56hui) with Google Cloud! Please find the meetup slides from vLLM team [here](https://docs.google.com/presentation/d/1epVkt4Zu8Jz_S5OhEHPc798emsYh2BwYfRuDDVEF7u4/edit?usp=sharing), and Google Cloud team [here](https://drive.google.com/file/d/1h24pHewANyRL11xy5dXUbvRC9F9Kkjix/view?usp=sharing).\n- [2024/12] vLLM joins [pytorch ecosystem](https://pytorch.org/blog/vllm-joins-pytorch)! Easy, Fast, and Cheap LLM Serving f"},{"ref":"P11","kind":"page","title":"XiaomiMiMo/lmms-eval repository metadata","date":"2026-06-11T02:54:11.272032+00:00","date_source":null,"source_url":"https://github.com/XiaomiMiMo/lmms-eval","signal_url":null,"signal_json_url":null,"text":"# XiaomiMiMo/lmms-eval\n\nDescription: Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.\n\nLanguage: Python\n\nLicense: NOASSERTION\n\nStars: 71\n\nForks: 5\n\nOpen issues: 0\n\nCreated: 2025-05-29T22:25:08Z\n\nPushed: 2025-08-08T12:15:22Z\n\nDefault branch: mimo_vl_eval\n\nFork: yes\n\nParent repository: EvolvingLMMs-Lab/lmms-eval\n\nArchived: no\n\nREADME:\n# The Evaluation Suite of Xiaomi MiMo-VL\n\nTo promote **rigorous**, **reproducible**, and **thinking-oriented** evaluation of Vision-Language Models (VLMs), we open-source our evaluation suite for [**MiMo-VL**](https://github.com/XiaomiMiMo/MiMo-VL) and beyond.\n\nBuilt on top of the excellent [lmms-eval](https://github.com/EvolvingLMMs-Lab/lmms-eval) framework, we introduce several improvements in model integration, evaluation protocol, and task coverage to better support the next generation of reasoning-capable VLMs.\n\n## 📰 News\n\n**[25/08/08]** We update our evaluation framework along with the release of [MiMo-VL-7B-SFT-2508](https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-SFT-2508) and [MiMo-VL-7B-RL-2508](https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-RL-2508). New features include: \n- Additional GUI action benchmarks AndroidControl and CAGUI (evaluated using `--model mimo_agent`)\n- Additional evaluation benchmarks on video spatial reasoning (VSI-Bench), physics reasoning (PhysReason), multi-modal long context understanding (MMLongBench), multi-modal instruction following (MM-IFEval); \n- Enables no_think evaluation by adding model argument `disable_thinking_user=True`\n\n## 🔧 Key Features\n\n### 1. ⚙️ `MiVLLM`: A vLLM-based Model Wrapper for MiMo-VL\nWe introduce a new `MiVLLM` model class based on the original `VLLM` class in `lmms-eval`, which is tailored for [**MiMo-VL**](https://github.com/XiaomiMiMo/MiMo-VL). Compared to the original implementation, it:\n* Greatly improves **data loading efficiency**\n* Enables fine-grained control over **image and video preprocessing**\n\n### 2. 🧠 Adaptation to Thinking VLMs\n\nThe original lmms-eval tasks were designed for **non-thinking** VLMs: they prompt directly for short answers and compare outputs without post-processing. We redesign this pr"},{"ref":"P12","kind":"page","title":"XiaomiMiMo/MiMo-Skills repository metadata","date":"2026-06-11T02:51:35.366594+00:00","date_source":null,"source_url":"https://github.com/XiaomiMiMo/MiMo-Skills","signal_url":null,"signal_json_url":null,"text":"# XiaomiMiMo/MiMo-Skills\n\nDescription: Agent skills for Xiaomi MiMo series.\n\nLanguage: Python\n\nLicense: MIT\n\nStars: 70\n\nForks: 6\n\nOpen issues: 7\n\nCreated: 2026-04-23T12:10:40Z\n\nPushed: 2026-04-24T03:16:07Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<div align=\"center\">\n<picture>\n<source srcset=\"https://github.com/XiaomiMiMo/MiMo/raw/main/figures/Xiaomi_MiMo_darkmode.png?raw=true\" media=\"(prefers-color-scheme: dark)\">\n<img src=\"https://github.com/XiaomiMiMo/MiMo/raw/main/figures/Xiaomi_MiMo.png?raw=true\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</picture>\n</div>\n\n> Agent skills for Xiaomi MiMo series. Get your API key at <a href=\"https://platform.xiaomimimo.com/\" target=\"_blank\">🎨 Xiaomi MiMo Open Platform</a>.\n\n[中文版](./README_zh.md)\n\n## Skills\n\n| Skill | Description | License |\n|-------|-------------|---------|\n| `mimo-v2-5-tts` | MiMo V2.5 TTS voice synthesis. Supports preset voices, voice design, voice cloning, emotion styles, dialects, and singing. Supports natural language control, Director Mode, and audio tag control. | MIT |\n\n## Installation\n\n### npx (Recommended)\n\n```bash\nnpx skills add XiaomiMiMo/MiMo-Skills\n```\n\nOptions:\n\n```bash\n# List available skills without installing\nnpx skills add XiaomiMiMo/MiMo-Skills --list\n\n# Install to specific agents\nnpx skills add XiaomiMiMo/MiMo-Skills -a claude-code -a opencode\n\n# Install globally (user-level)\nnpx skills add XiaomiMiMo/MiMo-Skills -g\n\n# Install specific skills by name\nnpx skills add XiaomiMiMo/MiMo-Skills --skill mimo-v2-5-tts\n\n# Non-interactive (CI/CD friendly)\nnpx skills add XiaomiMiMo/MiMo-Skills -g -a claude-code -y\n```\n\n### Manual\n\n```bash\ngit clone https://github.com/XiaomiMiMo/MiMo-Skills.git ~/.mimo-skills\nln -s ~/.mimo-skills/skills/* ~/.hermes/skills/\n# ln -s ~/.mimo-skills/skills/* ~/.openclaw/skills/\n# ln -s ~/.mimo-skills/skills/* ~/.config/opencode/skills/\n# ln -s ~/.mimo-skills/skills/* ~/.claude/skills/\n# ...\n```\n\n## Environment Variables\n\n| Variable | Description | Required |\n|----------|-------------|----------|\n| `MIMO_API_KEY` | MiMo API key | Yes |\n| `FEISHU_APP_ID` | Feishu App ID (for voice message sending) | Optional |\n| `FEISHU_APP_SECRET` | Feishu App Secret (for voice messa"},{"ref":"P13","kind":"page","title":"XiaomiMiMo/MiMo-V2.5-ASR repository metadata","date":"2026-06-11T02:51:35.167304+00:00","date_source":null,"source_url":"https://github.com/XiaomiMiMo/MiMo-V2.5-ASR","signal_url":null,"signal_json_url":null,"text":"# XiaomiMiMo/MiMo-V2.5-ASR\n\nDescription: Robust Speech Recognition Across Languages, Dialects, and Complex Acoustic Scenarios\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 264\n\nForks: 24\n\nOpen issues: 7\n\nCreated: 2026-04-23T12:59:55Z\n\nPushed: 2026-04-23T17:15:38Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<div align=\"center\">\n<img src=\"assets/XiaomiMIMO.png\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</div>\n\n<div align=\"center\">\n<h3>\n<b>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span><br/>\nMiMo-V2.5-ASR: Robust Speech Recognition Across<br/>\nLanguages, Dialects, and Complex Acoustic Scenarios<br/>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n</b>\n</h3>\n</div>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://huggingface.co/XiaomiMiMo/MiMo-V2.5-ASR\" target=\"_blank\">🤗 HuggingFace</a>\n&nbsp;|\n<a href=\"https://huggingface.co/spaces/XiaomiMiMo/MiMo-V2.5-ASR\" target=\"_blank\">🚀 Online Demo</a>\n&nbsp;|\n<a href=\"https://mimo.xiaomi.com/mimo-v2-5-asr\" target=\"_blank\">📰 Blog</a>\n&nbsp;|\n\n<br/>\n</div>\n\n<br/>\n\n## Introduction\n\n**MiMo-V2.5-ASR** is a state-of-the-art end-to-end automatic speech recognition (ASR) model developed by the Xiaomi MiMo team. It is built to deliver accurate and robust transcription across Mandarin Chinese and English, multiple Chinese dialects, code-switched speech, song lyrics, knowledge-intensive content, noisy acoustic environments, and multi-speaker conversations. MiMo-V2.5-ASR achieves state-of-the-art results on a wide range of public benchmarks.\n\n## Abstract\n\nAutomatic speech recognition systems are expected to faithfully transcribe speech signals that originate from diverse languages, dialects, accents, and domains, and that are captured under a wide variety of acoustic conditions. While conventional end-to-end models perform well on in-domain data, they still fall short of real-world requirements in challenging scenarios such as dialect mixing, code-switching, knowledge-intensive content, noisy environments, and multi-speaker conversations. Therefore, we present **MiMo-V2.5-ASR**, an end-to-end speech recognition model developed by the Xiaomi MiMo team. Through large-scale mid-training, high-quality supervised fine-"},{"ref":"P14","kind":"page","title":"XiaomiMiMo/MiMo-7B-SFT model card","date":"2026-06-11T02:50:43.687643+00:00","date_source":null,"source_url":"https://huggingface.co/XiaomiMiMo/MiMo-7B-SFT/raw/main/README.md","signal_url":null,"signal_json_url":null,"text":"---\nlicense: mit\nlibrary_name: transformers\n---\n<div align=\"center\">\n<picture>\n<source srcset=\"https://github.com/XiaomiMiMo/MiMo/raw/main/figures/Xiaomi_MiMo_darkmode.png?raw=true\" media=\"(prefers-color-scheme: dark)\">\n<img src=\"https://github.com/XiaomiMiMo/MiMo/raw/main/figures/Xiaomi_MiMo.png?raw=true\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</picture>\n</div>\n\n<h3 align=\"center\">\n<b>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\nUnlocking the Reasoning Potential of Language Model<br/>From Pretraining to Posttraining\n<br/>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\n</b>\n</h3>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://huggingface.co/XiaomiMiMo\" target=\"_blank\">🤗 HuggingFace</a>\n&nbsp;|\n<a href=\"https://www.modelscope.cn/organization/XiaomiMiMo\" target=\"_blank\">🤖️ ModelScope</a>\n&nbsp;|\n<a href=\"https://arxiv.org/abs/2505.07608\" target=\"_blank\">📔 Technical Report</a>\n&nbsp;|\n<br/>\n</div>\n\n<br/>\n\n---\n\n## Updates\n\n[2025.05.30] We scaled the SFT dataset from approximately 500K to 6M instances and continuously expanding the RL training window size from 32K to 48K, the performance of [MiMo-7B-RL-0530](https://huggingface.co/XiaomiMiMo/MiMo-7B-RL-0530) on AIME24 can be continuously improved and eventually surpass that of DeepSeek R1 (79.8).\n\n<table>\n<thead>\n<tr>\n<th>Benchmark</th>\n<th>MiMo-7B-RL</th>\n<th>MiMo-7B-RL-0530</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td colspan=\"3\"><strong>Mathematics</strong></td>\n<p align=\"center\">\n<td rowspan=\"11\"><img width=\"80%\" src=\"https://github.com/XiaomiMiMo/MiMo/raw/main/figures/length.jpg?raw=true\"></td>\n</p>\n</tr>\n<tr><td>MATH500<br/>(Pass@1)</td><td>95.8</td><td>97.2</td></tr>\n<tr><td>AIME 2024<br/>(Pass@1)</td><td>68.2</td><td>80.1</td></tr>\n<tr><td>AIME 2025<br/>(Pass@1)</td><td>55.4</td><td>70.2</td></tr>\n<tr><td colspan=\"3\"><strong>Code</strong></td></tr>\n<tr><td>LiveCodeBench v5<br/>(Pass@1)</td><td>57.8</td><td>60.9</td></tr>\n<tr><td>LiveCodeBench v6<br/>(Pass@1)</td><td>49.3</td><td>52.2</td></tr>\n<tr><td colspan=\"3\"><strong>STEM</strong></td></tr>\n<tr><td>GPQA-Diamond<br/>(Pass@1)</td><td>54.4</td><td>60.6</td></tr>\n<tr><td colspan=\"3\"><strong>General</strong></td></t"},{"ref":"P15","kind":"page","title":"XiaomiMiMo/MiMo-7B-Base model card","date":"2026-06-11T02:50:43.657954+00:00","date_source":null,"source_url":"https://huggingface.co/XiaomiMiMo/MiMo-7B-Base/raw/main/README.md","signal_url":null,"signal_json_url":null,"text":"---\nlicense: mit\nlibrary_name: transformers\n---\n<div align=\"center\">\n<picture>\n<source srcset=\"https://github.com/XiaomiMiMo/MiMo/raw/main/figures/Xiaomi_MiMo_darkmode.png?raw=true\" media=\"(prefers-color-scheme: dark)\">\n<img src=\"https://github.com/XiaomiMiMo/MiMo/raw/main/figures/Xiaomi_MiMo.png?raw=true\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</picture>\n</div>\n\n<h3 align=\"center\">\n<b>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\nUnlocking the Reasoning Potential of Language Model<br/>From Pretraining to Posttraining\n<br/>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\n</b>\n</h3>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://huggingface.co/XiaomiMiMo\" target=\"_blank\">🤗 HuggingFace</a>\n&nbsp;|\n<a href=\"https://www.modelscope.cn/organization/XiaomiMiMo\" target=\"_blank\">🤖️ ModelScope</a>\n&nbsp;|\n<a href=\"https://arxiv.org/abs/2505.07608\" target=\"_blank\">📔 Technical Report</a>\n&nbsp;|\n<br/>\n</div>\n\n<br/>\n\n---\n\n## Updates\n\n[2025.05.30] We scaled the SFT dataset from approximately 500K to 6M instances and continuously expanding the RL training window size from 32K to 48K, the performance of [MiMo-7B-RL-0530](https://huggingface.co/XiaomiMiMo/MiMo-7B-RL-0530) on AIME24 can be continuously improved and eventually surpass that of DeepSeek R1 (79.8).\n\n<table>\n<thead>\n<tr>\n<th>Benchmark</th>\n<th>MiMo-7B-RL</th>\n<th>MiMo-7B-RL-0530</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td colspan=\"3\"><strong>Mathematics</strong></td>\n<p align=\"center\">\n<td rowspan=\"11\"><img width=\"80%\" src=\"https://github.com/XiaomiMiMo/MiMo/raw/main/figures/length.jpg?raw=true\"></td>\n</p>\n</tr>\n<tr><td>MATH500<br/>(Pass@1)</td><td>95.8</td><td>97.2</td></tr>\n<tr><td>AIME 2024<br/>(Pass@1)</td><td>68.2</td><td>80.1</td></tr>\n<tr><td>AIME 2025<br/>(Pass@1)</td><td>55.4</td><td>70.2</td></tr>\n<tr><td colspan=\"3\"><strong>Code</strong></td></tr>\n<tr><td>LiveCodeBench v5<br/>(Pass@1)</td><td>57.8</td><td>60.9</td></tr>\n<tr><td>LiveCodeBench v6<br/>(Pass@1)</td><td>49.3</td><td>52.2</td></tr>\n<tr><td colspan=\"3\"><strong>STEM</strong></td></tr>\n<tr><td>GPQA-Diamond<br/>(Pass@1)</td><td>54.4</td><td>60.6</td></tr>\n<tr><td colspan=\"3\"><strong>General</strong></td></t"},{"ref":"P16","kind":"page","title":"XiaomiMiMo/MiMo-7B-RL model card","date":"2026-06-11T02:50:43.605265+00:00","date_source":null,"source_url":"https://huggingface.co/XiaomiMiMo/MiMo-7B-RL/raw/main/README.md","signal_url":null,"signal_json_url":null,"text":"---\nlicense: mit\nlibrary_name: transformers\n---\n<div align=\"center\">\n<picture>\n<source srcset=\"https://github.com/XiaomiMiMo/MiMo/raw/main/figures/Xiaomi_MiMo_darkmode.png?raw=true\" media=\"(prefers-color-scheme: dark)\">\n<img src=\"https://github.com/XiaomiMiMo/MiMo/raw/main/figures/Xiaomi_MiMo.png?raw=true\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</picture>\n</div>\n\n<h3 align=\"center\">\n<b>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\nUnlocking the Reasoning Potential of Language Model<br/>From Pretraining to Posttraining\n<br/>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\n</b>\n</h3>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://huggingface.co/XiaomiMiMo\" target=\"_blank\">🤗 HuggingFace</a>\n&nbsp;|\n<a href=\"https://www.modelscope.cn/organization/XiaomiMiMo\" target=\"_blank\">🤖️ ModelScope</a>\n&nbsp;|\n<a href=\"https://arxiv.org/abs/2505.07608\" target=\"_blank\">📔 Technical Report</a>\n&nbsp;|\n<br/>\n</div>\n\n<br/>\n\n---\n\n## Updates\n\n[2025.05.30] We scaled the SFT dataset from approximately 500K to 6M instances and continuously expanding the RL training window size from 32K to 48K, the performance of [MiMo-7B-RL-0530](https://huggingface.co/XiaomiMiMo/MiMo-7B-RL-0530) on AIME24 can be continuously improved and eventually surpass that of DeepSeek R1 (79.8).\n\n<table>\n<thead>\n<tr>\n<th>Benchmark</th>\n<th>MiMo-7B-RL</th>\n<th>MiMo-7B-RL-0530</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td colspan=\"3\"><strong>Mathematics</strong></td>\n<p align=\"center\">\n<td rowspan=\"11\"><img width=\"80%\" src=\"https://github.com/XiaomiMiMo/MiMo/raw/main/figures/length.jpg?raw=true\"></td>\n</p>\n</tr>\n<tr><td>MATH500<br/>(Pass@1)</td><td>95.8</td><td>97.2</td></tr>\n<tr><td>AIME 2024<br/>(Pass@1)</td><td>68.2</td><td>80.1</td></tr>\n<tr><td>AIME 2025<br/>(Pass@1)</td><td>55.4</td><td>70.2</td></tr>\n<tr><td colspan=\"3\"><strong>Code</strong></td></tr>\n<tr><td>LiveCodeBench v5<br/>(Pass@1)</td><td>57.8</td><td>60.9</td></tr>\n<tr><td>LiveCodeBench v6<br/>(Pass@1)</td><td>49.3</td><td>52.2</td></tr>\n<tr><td colspan=\"3\"><strong>STEM</strong></td></tr>\n<tr><td>GPQA-Diamond<br/>(Pass@1)</td><td>54.4</td><td>60.6</td></tr>\n<tr><td colspan=\"3\"><strong>General</strong></td></t"},{"ref":"P17","kind":"page","title":"XiaomiMiMo/MiMo-7B-RL-Zero model card","date":"2026-06-11T02:50:43.507722+00:00","date_source":null,"source_url":"https://huggingface.co/XiaomiMiMo/MiMo-7B-RL-Zero/raw/main/README.md","signal_url":null,"signal_json_url":null,"text":"---\nlicense: mit\nlibrary_name: transformers\n---\n<div align=\"center\">\n<picture>\n<source srcset=\"https://github.com/XiaomiMiMo/MiMo/raw/main/figures/Xiaomi_MiMo_darkmode.png?raw=true\" media=\"(prefers-color-scheme: dark)\">\n<img src=\"https://github.com/XiaomiMiMo/MiMo/raw/main/figures/Xiaomi_MiMo.png?raw=true\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</picture>\n</div>\n\n<h3 align=\"center\">\n<b>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\nUnlocking the Reasoning Potential of Language Model<br/>From Pretraining to Posttraining\n<br/>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\n</b>\n</h3>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://huggingface.co/XiaomiMiMo\" target=\"_blank\">🤗 HuggingFace</a>\n&nbsp;|\n<a href=\"https://www.modelscope.cn/organization/XiaomiMiMo\" target=\"_blank\">🤖️ ModelScope</a>\n&nbsp;|\n<a href=\"https://arxiv.org/abs/2505.07608\" target=\"_blank\">📔 Technical Report</a>\n&nbsp;|\n<br/>\n</div>\n\n<br/>\n\n> This model repository is licensed under the MIT License.\n\n## I. Introduction\n\nCurrently, most successful RL works, including open-source research, rely on relatively large base models, e.g., 32B models, particularly for enhancing code reasoning capabilities. Moreover, it was widely considered that achieving uniform and simultaneous improvements in both mathematical and code capabilities within a small model is challenging. Nonetheless, we believe that the effectiveness of the RL trained reasoning model relies on the inherent reasoning potential of the base model. To fully unlock the reasoning potential of language models, efforts must focus not only on post-training but also on pre-training strategies tailored to reasoning.\n\nIn this work, we present MiMo-7B, a series of models trained from scratch and born for reasoning tasks. Our RL experiments from MiMo-7B-Base show that our model possesses extraordinary reasoning potential, even surpassing much larger 32B models. Additionally, we perform RL training on a cold-started SFT model, resulting in MiMo-7B-RL, which demonstrates superior performance on both mathematics and code reasoning tasks, matching the performance of OpenAI o1-mini.\n\n<p align=\"center\">\n<img width=\"80%\" src=\"https://github.co"},{"ref":"P18","kind":"page","title":"XiaomiMiMo/MiMo-VL-7B-RL model card","date":"2026-06-11T02:50:42.460412+00:00","date_source":null,"source_url":"https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-RL/raw/main/README.md","signal_url":null,"signal_json_url":null,"text":"---\nbase_model:\n- XiaomiMiMo/MiMo-VL-7B-RL\nlibrary_name: transformers\nlicense: mit\npipeline_tag: image-text-to-text\n---\n\n<div align=\"center\">\n<picture>\n<source srcset=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo_darkmode.png?raw=true\" media=\"(prefers-color-scheme: dark)\">\n<img src=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo.png?raw=true\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</picture>\n</div>\n\n<h3 align=\"center\">\n<b>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\nMiMo-VL Technical Report\n<br/>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\n</b>\n</h3>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://huggingface.co/collections/XiaomiMiMo/mimo-vl-68382ccacc7c2875500cd212\" target=\"_blank\">🤗 HuggingFace</a>\n&nbsp;|\n<a href=\"https://www.modelscope.cn/collections/MiMo-VL-bb651017e02742\" target=\"_blank\">🤖️ ModelScope</a>\n&nbsp;|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-VL/blob/main/MiMo-VL-Technical-Report.pdf\" target=\"_blank\">📔 Technical Report</a>\n&nbsp;|\n<a href=\"https://huggingface.co/papers/2506.03569\" target=\"_blank\">📃 Paper</a>\n&nbsp;|\n<br/>\n</div>\n\n<br/>\n\n## I. Introduction\n\nIn this report, we share our efforts to build a compact yet powerful VLM, MiMo-VL-7B. MiMo-VL-7B comprises (1) a native resolution ViT encoder that preserves fine-grained visual details, (2) an MLP projector for efficient cross-modal alignment, and (3) our [MiMo-7B language model](https://github.com/XiaomiMiMo/MiMo), specifically optimized for complex reasoning tasks. \n\nThe development of MiMo-VL-7B involves two sequential training processes: (1) A four-stage pre-training phase, which includes projector warmup, vision-language alignment, general multi-modal pre-training, and long-context Supervised Fine-Tuning (SFT). This phase yields the MiMo-VL-7B-SFT model. (2) A subsequent post-training phase, where we introduce Mixed On-policy Reinforcement Learning (MORL), a novel framework that seamlessly integrates diverse reward signals spanning perception accuracy, visual grounding precision, logical reasoning capabilities, and human/AI preferences. This phase yields the MiMo-VL-7B-RL model.\n\n<p alig"},{"ref":"P19","kind":"page","title":"XiaomiMiMo/MiMo-7B-RL-0530 model card","date":"2026-06-11T02:50:42.240545+00:00","date_source":null,"source_url":"https://huggingface.co/XiaomiMiMo/MiMo-7B-RL-0530/raw/main/README.md","signal_url":null,"signal_json_url":null,"text":"---\nlicense: mit\nlibrary_name: transformers\n---\n<div align=\"center\">\n<picture>\n<source srcset=\"https://github.com/XiaomiMiMo/MiMo/raw/main/figures/Xiaomi_MiMo_darkmode.png?raw=true\" media=\"(prefers-color-scheme: dark)\">\n<img src=\"https://github.com/XiaomiMiMo/MiMo/raw/main/figures/Xiaomi_MiMo.png?raw=true\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</picture>\n</div>\n\n<h3 align=\"center\">\n<b>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\nUnlocking the Reasoning Potential of Language Model<br/>From Pretraining to Posttraining\n<br/>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\n</b>\n</h3>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://huggingface.co/XiaomiMiMo\" target=\"_blank\">🤗 HuggingFace</a>\n&nbsp;|\n<a href=\"https://www.modelscope.cn/organization/XiaomiMiMo\" target=\"_blank\">🤖️ ModelScope</a>\n&nbsp;|\n<a href=\"https://arxiv.org/abs/2505.07608\" target=\"_blank\">📔 Technical Report</a>\n&nbsp;|\n<br/>\n</div>\n\n<br/>\n\n---\n\n## Updates\n\n[2025.05.30] We scaled the SFT dataset from approximately 500K to 6M instances and continuously expanding the RL training window size from 32K to 48K, the performance of [MiMo-7B-RL-0530](https://huggingface.co/XiaomiMiMo/MiMo-7B-RL-0530) on AIME24 can be continuously improved and eventually surpass that of DeepSeek R1 (79.8).\n\n<table>\n<thead>\n<tr>\n<th>Benchmark</th>\n<th>MiMo-7B-RL</th>\n<th>MiMo-7B-RL-0530</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td colspan=\"3\"><strong>Mathematics</strong></td>\n<p align=\"center\">\n<td rowspan=\"11\"><img width=\"80%\" src=\"https://github.com/XiaomiMiMo/MiMo/raw/main/figures/length.jpg?raw=true\"></td>\n</p>\n</tr>\n<tr><td>MATH500<br/>(Pass@1)</td><td>95.8</td><td>97.2</td></tr>\n<tr><td>AIME 2024<br/>(Pass@1)</td><td>68.2</td><td>80.1</td></tr>\n<tr><td>AIME 2025<br/>(Pass@1)</td><td>55.4</td><td>70.2</td></tr>\n<tr><td colspan=\"3\"><strong>Code</strong></td></tr>\n<tr><td>LiveCodeBench v5<br/>(Pass@1)</td><td>57.8</td><td>60.9</td></tr>\n<tr><td>LiveCodeBench v6<br/>(Pass@1)</td><td>49.3</td><td>52.2</td></tr>\n<tr><td colspan=\"3\"><strong>STEM</strong></td></tr>\n<tr><td>GPQA-Diamond<br/>(Pass@1)</td><td>54.4</td><td>60.6</td></tr>\n<tr><td colspan=\"3\"><strong>General</strong></td></t"},{"ref":"P20","kind":"page","title":"XiaomiMiMo/MiMo-VL-7B-SFT model card","date":"2026-06-11T02:50:42.226193+00:00","date_source":null,"source_url":"https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-SFT/raw/main/README.md","signal_url":null,"signal_json_url":null,"text":"---\nbase_model:\n- XiaomiMiMo/MiMo-VL-7B-SFT\nlibrary_name: transformers\nlicense: mit\npipeline_tag: image-text-to-text\n---\n\n<div align=\"center\">\n<picture>\n<source srcset=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo_darkmode.png?raw=true\" media=\"(prefers-color-scheme: dark)\">\n<img src=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo.png?raw=true\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</picture>\n</div>\n\n<h3 align=\"center\">\n<b>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\nMiMo-VL Technical Report\n<br/>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\n</b>\n</h3>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://huggingface.co/collections/XiaomiMiMo/mimo-vl-68382ccacc7c2875500cd212\" target=\"_blank\">🤗 HuggingFace</a>\n&nbsp;|\n<a href=\"https://www.modelscope.cn/collections/MiMo-VL-bb651017e02742\" target=\"_blank\">🤖️ ModelScope</a>\n&nbsp;|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-VL/blob/main/MiMo-VL-Technical-Report.pdf\" target=\"_blank\">📔 Technical Report</a>\n&nbsp;|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-VL\" target=\"_blank\">💻 Github Repo</a>\n&nbsp;|\n<br/>\n</div>\n\n<br/>\n\n## I. Introduction\n\nIn this report, we share our efforts to build a compact yet powerful VLM, MiMo-VL-7B. MiMo-VL-7B comprises (1) a native resolution ViT encoder that preserves fine-grained visual details, (2) an MLP projector for efficient cross-modal alignment, and (3) our [MiMo-7B language model](https://github.com/XiaomiMiMo/MiMo), specifically optimized for complex reasoning tasks. \n\nThe development of MiMo-VL-7B involves two sequential training processes: (1) A four-stage pre-training phase, which includes projector warmup, vision-language alignment, general multi-modal pre-training, and long-context Supervised Fine-Tuning (SFT). This phase yields the MiMo-VL-7B-SFT model. (2) A subsequent post-training phase, where we introduce Mixed On-policy Reinforcement Learning (MORL), a novel framework that seamlessly integrates diverse reward signals spanning perception accuracy, visual grounding precision, logical reasoning capabilities, and human/AI preferences. This phase yields the MiMo-VL-7B-RL model.\n\n<p "},{"ref":"P21","kind":"page","title":"XiaomiMiMo/MiMo-VL-7B-SFT-2508 model card","date":"2026-06-11T02:50:37.951559+00:00","date_source":null,"source_url":"https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-SFT-2508/raw/main/README.md","signal_url":null,"signal_json_url":null,"text":"---\nbase_model:\n- XiaomiMiMo/MiMo-VL-7B-SFT-2508\nlibrary_name: transformers\nlicense: mit\npipeline_tag: image-text-to-text\n---\n\n<div align=\"center\">\n<picture>\n<source srcset=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo_darkmode.png?raw=true\" media=\"(prefers-color-scheme: dark)\">\n<img src=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo.png?raw=true\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</picture>\n</div>\n\n<h3 align=\"center\">\n<b>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\nMiMo-VL Technical Report\n<br/>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\n</b>\n</h3>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://huggingface.co/collections/XiaomiMiMo/mimo-vl-68382ccacc7c2875500cd212\" target=\"_blank\">🤗 HuggingFace</a>\n&nbsp;|\n<a href=\"https://www.modelscope.cn/collections/MiMo-VL-bb651017e02742\" target=\"_blank\">🤖️ ModelScope</a>\n&nbsp;|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-VL/blob/main/MiMo-VL-Technical-Report.pdf\" target=\"_blank\">📔 Technical Report</a>\n&nbsp;|\n<a href=\"https://huggingface.co/papers/2506.03569\" target=\"_blank\">📃 Paper</a>\n&nbsp;|\n<br/>\n</div>\n\n<br/>\n\n### 🔥🔥🔥MiMo-VL 2508 Updates\n\nWe're excited to announce improvements to our MiMo-VL ([MiMo-VL-7B-RL-2508](https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-RL-2508) and [MiMo-VL-7B-SFT-2508](https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-SFT-2508)), featuring enhanced performance across multiple benchmarks, improved thinking control capabilities, and better user experience.\n\n#### 📈 Performance Improvements\n\nMiMo-VL-7B-RL-2508 demonstrates consistent improvements across both image and video benchmarks, achieving notable milestones of **70.6 on MMMU** and **70.8 on VideoMME**.\n\n<img src=\"./mimo-2508.png\" alt=\"Benchmark Improvements\" width=\"768\">\n\nFull evaluation results can be found [below](#full-evaluation-results).\n\n#### 🤔 Thinking Control Feature\n\nA thinking control capability that allows users to turn off the model's reasoning mode using the no_think parameter:\n- Thinking mode (default behavior): Full reasoning process visible with 100% control success rate;\n- Non-thinking mode: Direct responses without reas"},{"ref":"P22","kind":"page","title":"XiaomiMiMo/MiMo-VL-7B-RL-2508 model card","date":"2026-06-11T02:50:37.928399+00:00","date_source":null,"source_url":"https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-RL-2508/raw/main/README.md","signal_url":null,"signal_json_url":null,"text":"---\nbase_model:\n- XiaomiMiMo/MiMo-VL-7B-RL-2508\nlibrary_name: transformers\nlicense: mit\npipeline_tag: image-text-to-text\n---\n\n<div align=\"center\">\n<picture>\n<source srcset=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo_darkmode.png?raw=true\" media=\"(prefers-color-scheme: dark)\">\n<img src=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo.png?raw=true\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</picture>\n</div>\n\n<h3 align=\"center\">\n<b>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\nMiMo-VL Technical Report\n<br/>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\n</b>\n</h3>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://huggingface.co/collections/XiaomiMiMo/mimo-vl-68382ccacc7c2875500cd212\" target=\"_blank\">🤗 HuggingFace</a>\n&nbsp;|\n<a href=\"https://www.modelscope.cn/collections/MiMo-VL-bb651017e02742\" target=\"_blank\">🤖️ ModelScope</a>\n&nbsp;|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-VL/blob/main/MiMo-VL-Technical-Report.pdf\" target=\"_blank\">📔 Technical Report</a>\n&nbsp;|\n<a href=\"https://huggingface.co/papers/2506.03569\" target=\"_blank\">📃 Paper</a>\n&nbsp;|\n<br/>\n</div>\n\n<br/>\n\n### 🔥🔥🔥MiMo-VL 2508 Updates\n\nWe're excited to announce improvements to our MiMo-VL ([MiMo-VL-7B-RL-2508](https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-RL-2508) and [MiMo-VL-7B-SFT-2508](https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-SFT-2508)), featuring enhanced performance across multiple benchmarks, improved thinking control capabilities, and better user experience.\n\n#### 📈 Performance Improvements\n\nMiMo-VL-7B-RL-2508 demonstrates consistent improvements across both image and video benchmarks, achieving notable milestones of **70.6 on MMMU** and **70.8 on VideoMME**.\n\n<img src=\"./mimo-2508.png\" alt=\"Benchmark Improvements\" width=\"768\">\n\nFull evaluation results can be found [below](#full-evaluation-results).\n\n#### 🤔 Thinking Control Feature\n\nA thinking control capability that allows users to turn off the model's reasoning mode using the no_think parameter:\n- Thinking mode (default behavior): Full reasoning process visible with 100% control success rate;\n- Non-thinking mode: Direct responses without reaso"},{"ref":"P23","kind":"page","title":"XiaomiMiMo/MiMo-Audio-7B-Base model card","date":"2026-06-11T02:50:34.916209+00:00","date_source":null,"source_url":"https://huggingface.co/XiaomiMiMo/MiMo-Audio-7B-Base/raw/main/README.md","signal_url":null,"signal_json_url":null,"text":"---\nlicense: mit\npipeline_tag: any-to-any\ntags:\n- Audio-to-Text\n- Text-to-Audio\n- Audio-to-Audio\n- Text-to-Text\n- Audio-Text-to-Text\n---\n<div align=\"center\">\n<picture>\n<source srcset=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo_darkmode.png?raw=true\" media=\"(prefers-color-scheme: dark)\">\n<img src=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo.png?raw=true\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</picture>\n</div>\n\n<h3 align=\"center\">\n<b>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\nMiMo Audio: Audio Language Models are Few-Shot Learners\n<br/>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\n</b>\n</h3>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-Audio\" target=\"_blank\">🤖 GitHub</a>\n&nbsp;|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-Audio/blob/main/MiMo-Audio-Technical-Report.pdf\" target=\"_blank\">📄 Paper</a>\n&nbsp;|\n<a href=\"https://xiaomimimo.github.io/MiMo-Audio-Demo\" target=\"_blank\">📰 Blog</a>\n&nbsp;|\n<a href=\"https://huggingface.co/spaces/XiaomiMiMo/mimo_audio_chat\" target=\"_blank\">🔥 Online Demo</a>\n&nbsp;|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-Audio-Eval\" target=\"_blank\">📊 MiMo-Audio-Eval</a>\n&nbsp;|\n\n<br/>\n</div>\n\n<br/>\n\n## Introduction\n\nExisting audio language models typically rely on task-specific fine-tuning to accomplish particular audio tasks. In contrast, humans are able to generalize to new audio tasks with only a few examples or simple instructions. GPT-3 has shown that scaling next-token prediction pretraining enables strong generalization capabilities in text, and we believe this paradigm is equally applicable to the audio domain. By scaling MiMo-Audio's pretraining data to over one hundred million of hours, we observe the emergence of few-shot learning capabilities across a diverse set of audio tasks. We develop a systematic evaluation of these capabilities and find that MiMo-Audio-7B-Base achieves SOTA performance on both speech intelligence and audio understanding benchmarks among open-source models. Beyond standard metrics, MiMo-Audio-7B-Base generalizes to tasks absent from its training data, such as voice conversi"},{"ref":"P24","kind":"page","title":"XiaomiMiMo/MiMo-Audio-Tokenizer model card","date":"2026-06-11T02:50:34.779809+00:00","date_source":null,"source_url":"https://huggingface.co/XiaomiMiMo/MiMo-Audio-Tokenizer/raw/main/README.md","signal_url":null,"signal_json_url":null,"text":"---\nlicense: mit\n---\n<div align=\"center\">\n<picture>\n<source srcset=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo_darkmode.png?raw=true\" media=\"(prefers-color-scheme: dark)\">\n<img src=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo.png?raw=true\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</picture>\n</div>\n\n<h3 align=\"center\">\n<b>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\nMiMo Audio: Audio Language Models are Few-Shot Learners\n<br/>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\n</b>\n</h3>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://huggingface.co/collections/XiaomiMiMo/mimo-audio-68cc7202692c27dae881cce0\" target=\"_blank\">🤗 HuggingFace</a>\n&nbsp;|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-Audio/blob/main/MiMo-Audio-Technical-Report.pdf\" target=\"_blank\">📄 Paper</a>\n&nbsp;|\n<a href=\"https://xiaomimimo.github.io/MiMo-Audio-Demo\" target=\"_blank\">📰 Blog</a>\n&nbsp;|\n<a href=\"https://huggingface.co/spaces/XiaomiMiMo/mimo_audio_chat\" target=\"_blank\">🔥 Online Demo</a>\n&nbsp;|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-Audio-Eval\" target=\"_blank\">📊 MiMo-Audio-Eval</a>\n&nbsp;|\n\n<br/>\n</div>\n\n<br/>\n\n## Introduction\n\nExisting audio language models typically rely on task-specific fine-tuning to accomplish particular audio tasks. In contrast, humans are able to generalize to new audio tasks with only a few examples or simple instructions. GPT-3 has shown that scaling next-token prediction pretraining enables strong generalization capabilities in text, and we believe this paradigm is equally applicable to the audio domain. By scaling MiMo-Audio's pretraining data to over one hundred million of hours, we observe the emergence of few-shot learning capabilities across a diverse set of audio tasks. We develop a systematic evaluation of these capabilities and find that MiMo-Audio-7B-Base achieves SOTA performance on both speech intelligence and audio understanding benchmarks among open-source models. Beyond standard metrics, MiMo-Audio-7B-Base generalizes to tasks absent from its training data, such as voice conversion, style transfer, and speech editing. MiMo-Audio-7B-Base also demons"},{"ref":"P25","kind":"page","title":"XiaomiMiMo/MiMo-Audio-7B-Instruct model card","date":"2026-06-11T02:50:34.568088+00:00","date_source":null,"source_url":"https://huggingface.co/XiaomiMiMo/MiMo-Audio-7B-Instruct/raw/main/README.md","signal_url":null,"signal_json_url":null,"text":"---\nlicense: mit\npipeline_tag: any-to-any\ntags:\n- Audio-to-Text\n- Text-to-Audio\n- Audio-to-Audio\n- Text-to-Text\n- Audio-Text-to-Text\n---\n<div align=\"center\">\n<picture>\n<source srcset=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo_darkmode.png?raw=true\" media=\"(prefers-color-scheme: dark)\">\n<img src=\"https://github.com/XiaomiMiMo/MiMo-VL/raw/main/figures/Xiaomi_MiMo.png?raw=true\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</picture>\n</div>\n\n<h3 align=\"center\">\n<b>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\nMiMo Audio: Audio Language Models are Few-Shot Learners\n<br/>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\n</b>\n</h3>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-Audio\" target=\"_blank\">🤖 GitHub</a>\n&nbsp;|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-Audio/blob/main/MiMo-Audio-Technical-Report.pdf\" target=\"_blank\">📄 Paper</a>\n&nbsp;|\n<a href=\"https://xiaomimimo.github.io/MiMo-Audio-Demo\" target=\"_blank\">📰 Blog</a>\n&nbsp;|\n<a href=\"https://huggingface.co/spaces/XiaomiMiMo/mimo_audio_chat\" target=\"_blank\">🔥 Online Demo</a>\n&nbsp;|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-Audio-Eval\" target=\"_blank\">📊 MiMo-Audio-Eval</a>\n&nbsp;|\n\n<br/>\n</div>\n\n<br/>\n\n## Introduction\n\nExisting audio language models typically rely on task-specific fine-tuning to accomplish particular audio tasks. In contrast, humans are able to generalize to new audio tasks with only a few examples or simple instructions. GPT-3 has shown that scaling next-token prediction pretraining enables strong generalization capabilities in text, and we believe this paradigm is equally applicable to the audio domain. By scaling MiMo-Audio's pretraining data to over one hundred million of hours, we observe the emergence of few-shot learning capabilities across a diverse set of audio tasks. We develop a systematic evaluation of these capabilities and find that MiMo-Audio-7B-Base achieves SOTA performance on both speech intelligence and audio understanding benchmarks among open-source models. Beyond standard metrics, MiMo-Audio-7B-Base generalizes to tasks absent from its training data, such as voice conversi"},{"ref":"P26","kind":"page","title":"XiaomiMiMo/MiMo-7B-MTPs model card","date":"2026-06-11T02:49:53.014243+00:00","date_source":null,"source_url":"https://huggingface.co/XiaomiMiMo/MiMo-7B-MTPs/raw/main/README.md","signal_url":null,"signal_json_url":null,"text":"---\nlicense: mit\nlibrary_name: transformers\n---\n\n<div align=\"center\">\n<picture>\n<source srcset=\"https://github.com/XiaomiMiMo/MiMo/raw/main/figures/Xiaomi_MiMo_darkmode.png?raw=true\" media=\"(prefers-color-scheme: dark)\">\n<img src=\"https://github.com/XiaomiMiMo/MiMo/raw/main/figures/Xiaomi_MiMo.png?raw=true\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</picture>\n</div>\n\n<h3 align=\"center\">\n<b>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\nUnlocking the Reasoning Potential of Language Model<br/>From Pretraining to Posttraining\n<br/>\n<span>━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━</span>\n<br/>\n</b>\n</h3>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://huggingface.co/XiaomiMiMo\" target=\"_blank\">🤗 HuggingFace</a>\n&nbsp;|\n<a href=\"https://www.modelscope.cn/organization/XiaomiMiMo\" target=\"_blank\">🤖️ ModelScope</a>\n&nbsp;|\n<a href=\"https://arxiv.org/abs/2505.07608\" target=\"_blank\">📔 Technical Report</a>\n&nbsp;|\n<br/>\n</div>\n\n<br/>\n\n> This model repository is licensed under the MIT License.\n\n## I. Pretrained MTPs of MiMo-7B\n\nThis model repository contains the pretrained MTP weights of MiMo-7B (`model.mtp_layers.1` and `model.mtp_layers.2`)\n\nCurrently, MiMo-7B model each has 1 MTP layer (`model.mtp_layers.0`). Users may load the weights of pretrained MTPs for potential rollout speedup (please refer to *[Power Up Speculative Decoding In Reinforcement Learning](https://www.notion.so/jiajunli-guapisolo/Power-Up-Speculative-Decoding-In-Reinforcement-Learning-2a92d24a293b802d9c73dbae429e581e)*).\n\n> [!IMPORTANT]\n> We tuned 1 MTP layer in SFT and freeze it in RL, and we **HAVE NOT** test the performance of posttrained models with 2 more pretrained MTP layers.\n\n## II. Contact\n\nPlease contact us at [mimo@xiaomi.com](mailto:mimo@xiaomi.com) or open an issue if you have any questions."},{"ref":"P27","kind":"page","title":"XiaomiMiMo/MiMo-Embodied-7B model card","date":"2026-06-11T02:49:53.008774+00:00","date_source":null,"source_url":"https://huggingface.co/XiaomiMiMo/MiMo-Embodied-7B/raw/main/README.md","signal_url":null,"signal_json_url":null,"text":"---\nbase_model:\n- XiaomiMiMo/MiMo-Embodied\nlibrary_name: transformers\nlicense: mit\n---\n\n<div align=\"center\">\n<img src=\"./assets/xfmlogo.svg\" width=600>\n</div>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://huggingface.co/XiaomiMiMo/MiMo-Embodied-7B\" target=\"_blank\">🤗 HuggingFace</a>\n&nbsp;|\n<a href=\"https://arxiv.org/abs/2511.16518\" target=\"_blank\">📔 Technical Report</a>\n&nbsp;|\n<br/>\n</div>\n\n## I. Introduction\n\n**MiMo-Embodied**, a powerful cross-embodied vision-language model that shows state-of-the-art performance in both **autonomous driving** and **embodied AI tasks**, the first open-source VLM that integrates these two critical areas, significantly enhancing understanding and reasoning in dynamic physical environments.\n\n<div align=\"center\">\n<img src=\"./assets/fig1.svg\" width=800>\n</div>\n\n## II. Model Capabilities\n\n<div align=\"center\">\n<img src=\"./assets/fig2.svg\" width=800>\n</div>\n\n## III. Model Details\n\n<div align=\"center\">\n<img src=\"./assets/fig3_img.png\" width=800>\n</div>\n\n## IV. Evaluation Results\n\nMiMo-Embodied demonstrates superior performance across **17 benchmarks in three key embodied AI capabilities: Task Planning, Affordance Prediction, and Spatial Understanding**, significantly surpassing existing open-source embodied VLM models and rivaling closed-source models.\n\nAdditionally, MiMo-Embodied excels in **12 autonomous driving benchmarks across three key capabilities: Environmental Perception, Status Prediction, and Driving Planning**—significantly outperforming both existing open-source and closed-source VLM models, as well as proprietary VLM models.\n\nMoreover, evaluation on **8 general visual understanding benchmarks** confirms that MiMo-Embodied retains and even strengthens its general capabilities, showing that domain-specialized training enhances rather than diminishes overall model proficiency.\n\n### Embodied AI Benchmarks\n\n#### Affordance & Planning\n\n<div align=\"center\">\n<img src=\"./assets/table2.png\" width=800>\n</div>\n\n#### Spatial Understanding\n\n<div align=\"center\">\n<img src=\"./assets/table3.png\" width=800>\n</div>\n\n### Autonomous Driving Benchmarks\n\n#### Single-View Image & Multi-View Video\n\n<div align=\"center\">"},{"ref":"P28","kind":"page","title":"XiaomiMiMo/MiMo-V2-Flash model card","date":"2026-06-11T02:49:48.056976+00:00","date_source":null,"source_url":"https://huggingface.co/XiaomiMiMo/MiMo-V2-Flash/raw/main/README.md","signal_url":null,"signal_json_url":null,"text":"---\nlicense: mit\nlibrary_name: transformers\n---\n\n<br/><br/>\n\n<div align=\"center\">\n<picture>\n<source srcset=\"https://github.com/XiaomiMiMo/MiMo-V2-Flash/raw/main/figures/Xiaomi_MiMo_darkmode.png?raw=true\" media=\"(prefers-color-scheme: dark)\">\n<img src=\"https://github.com/XiaomiMiMo/MiMo-V2-Flash/raw/main/figures/Xiaomi_MiMo.png?raw=true\" width=\"60%\" alt=\"Xiaomi-MiMo\" />\n</picture>\n</div>\n\n<br/>\n\n<div align=\"center\" style=\"line-height: 1;\">\n|\n<a href=\"https://huggingface.co/XiaomiMiMo/MiMo-V2-Flash\" target=\"_blank\">🤗 HuggingFace</a>\n&nbsp;|\n<a href=\"https://github.com/XiaomiMiMo/MiMo-V2-Flash/blob/main/paper.pdf\" target=\"_blank\">📔 Technical Report </a>\n&nbsp;|\n<a href=\"https://mimo.xiaomi.com/blog/mimo-v2-flash\" target=\"_blank\">📰 Blog </a>\n&nbsp;|\n<br/><br/>\n<strong>Play around!</strong> &nbsp;\n<a href=\"https://aistudio.xiaomimimo.com\" target=\"_blank\">🗨️ Xiaomi MiMo Studio </a>\n&nbsp;\n<a href=\"https://platform.xiaomimimo.com/\" target=\"_blank\">🎨 Xiaomi MiMo API Platform </a>\n</div>\n<br/>\n\n# MiMo-V2-Flash\n\n**MiMo-V2-Flash** is a Mixture-of-Experts (MoE) language model with **309B total parameters** and **15B active parameters**. Designed for high-speed reasoning and agentic workflows, it utilizes a novel hybrid attention architecture and Multi-Token Prediction (MTP) to achieve state-of-the-art performance while significantly reducing inference costs.\n\n<p align=\"center\">\n<img width=\"80%\" src=\"https://github.com/XiaomiMiMo/MiMo-V2-Flash/raw/main/figures/MiMo-v2-flash-performance.jpg?raw=true\">\n</p>\n\n-----\n\n## 1. Introduction\n\nMiMo-V2-Flash creates a new balance between long-context modeling capability and inference efficiency. Key features include:\n\n* **Hybrid Attention Architecture**: Interleaves Sliding Window Attention (SWA) and Global Attention (GA) with a 5:1 ratio and an aggressive 128-token window. This reduces KV-cache storage by nearly 6x while maintaining long-context performance via learnable **attention sink bias**.\n* **Multi-Token Prediction (MTP)**: Equipped with a lightweight MTP module (0.33B params/block) using dense FFNs. This triples output speed during inference and will be good to accelerates rollout in RL training.\n* **Efficient Pre-Training"},{"ref":"E1","kind":"event","title":"XiaomiMiMo/MiMo-V2-Flash","date":"2025-12-16T08:47:02+00:00","date_source":"source","source_url":"https://huggingface.co/XiaomiMiMo/MiMo-V2-Flash","signal_url":"https://onlylabs.fyi/signals/089da2cd-b0f9-4a5b-9f17-d81af8080c20","signal_json_url":"https://onlylabs.fyi/signals/089da2cd-b0f9-4a5b-9f17-d81af8080c20/signal.json","text":"model_released · XiaomiMiMo/MiMo-V2-Flash · signal_desk=releases · occurred_at=2025-12-16T08:47:02+00:00 · url=https://huggingface.co/XiaomiMiMo/MiMo-V2-Flash · hf_downloads=70617 · hf_likes=735 · hf_params=309785318400 · pipeline=text-generation · license=mit"},{"ref":"E2","kind":"event","title":"XiaomiMiMo/MiMo-V2.5-Pro","date":"2026-04-27T12:52:53+00:00","date_source":"source","source_url":"https://huggingface.co/XiaomiMiMo/MiMo-V2.5-Pro","signal_url":"https://onlylabs.fyi/signals/b8d6c42b-9278-46e3-bac8-47d85cbca081","signal_json_url":"https://onlylabs.fyi/signals/b8d6c42b-9278-46e3-bac8-47d85cbca081/signal.json","text":"model_released · XiaomiMiMo/MiMo-V2.5-Pro · signal_desk=releases · occurred_at=2026-04-27T12:52:53+00:00 · url=https://huggingface.co/XiaomiMiMo/MiMo-V2.5-Pro · hf_downloads=72223 · hf_likes=616 · hf_params=1023244718976 · pipeline=text-generation · license=mit"},{"ref":"E3","kind":"event","title":"XiaomiMiMo/MiMo-V2.5","date":"2026-04-27T13:37:38+00:00","date_source":"source","source_url":"https://huggingface.co/XiaomiMiMo/MiMo-V2.5","signal_url":"https://onlylabs.fyi/signals/996568a1-38e2-42d9-b2c8-b81ee1fbe615","signal_json_url":"https://onlylabs.fyi/signals/996568a1-38e2-42d9-b2c8-b81ee1fbe615/signal.json","text":"model_released · XiaomiMiMo/MiMo-V2.5 · signal_desk=releases · occurred_at=2026-04-27T13:37:38+00:00 · url=https://huggingface.co/XiaomiMiMo/MiMo-V2.5 · hf_downloads=214671 · hf_likes=293 · hf_params=310775040000 · license=mit"},{"ref":"E4","kind":"event","title":"XiaomiMiMo/MiMo-7B-RL","date":"2025-04-29T23:48:55+00:00","date_source":"source","source_url":"https://huggingface.co/XiaomiMiMo/MiMo-7B-RL","signal_url":"https://onlylabs.fyi/signals/8d0da666-ad80-4c54-9f4b-ffea0aa195eb","signal_json_url":"https://onlylabs.fyi/signals/8d0da666-ad80-4c54-9f4b-ffea0aa195eb/signal.json","text":"model_released · XiaomiMiMo/MiMo-7B-RL · signal_desk=releases · occurred_at=2025-04-29T23:48:55+00:00 · url=https://huggingface.co/XiaomiMiMo/MiMo-7B-RL · hf_downloads=91706 · hf_likes=276 · hf_params=7833409536 · pipeline=text-generation · license=mit"},{"ref":"E5","kind":"event","title":"XiaomiMiMo/MiMo-VL-7B-RL","date":"2025-05-30T00:37:21+00:00","date_source":"source","source_url":"https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-RL","signal_url":"https://onlylabs.fyi/signals/4d93d9cc-e2c6-46e6-831d-bb9be86209f3","signal_json_url":"https://onlylabs.fyi/signals/4d93d9cc-e2c6-46e6-831d-bb9be86209f3/signal.json","text":"model_released · XiaomiMiMo/MiMo-VL-7B-RL · signal_desk=releases · occurred_at=2025-05-30T00:37:21+00:00 · url=https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-RL · hf_downloads=3934 · hf_likes=169 · hf_params=8306217216 · pipeline=image-text-to-text · license=mit · raw={\"derived_reason\":\"first-party-finetune\"}"},{"ref":"E6","kind":"event","title":"XiaomiMiMo/MiMo-Audio-7B-Instruct","date":"2025-09-18T21:28:00+00:00","date_source":"source","source_url":"https://huggingface.co/XiaomiMiMo/MiMo-Audio-7B-Instruct","signal_url":"https://onlylabs.fyi/signals/96af2a2a-6bd3-49a6-9c38-34e61096c6dc","signal_json_url":"https://onlylabs.fyi/signals/96af2a2a-6bd3-49a6-9c38-34e61096c6dc/signal.json","text":"model_released · XiaomiMiMo/MiMo-Audio-7B-Instruct · signal_desk=releases · occurred_at=2025-09-18T21:28:00+00:00 · url=https://huggingface.co/XiaomiMiMo/MiMo-Audio-7B-Instruct · hf_downloads=53327 · hf_likes=156 · hf_params=8018587648 · pipeline=any-to-any · license=mit"},{"ref":"E7","kind":"event","title":"XiaomiMiMo/MiMo-7B-Base","date":"2025-04-29T23:41:27+00:00","date_source":"source","source_url":"https://huggingface.co/XiaomiMiMo/MiMo-7B-Base","signal_url":"https://onlylabs.fyi/signals/9d620f56-5336-445f-83b3-1b79da84cce3","signal_json_url":"https://onlylabs.fyi/signals/9d620f56-5336-445f-83b3-1b79da84cce3/signal.json","text":"model_released · XiaomiMiMo/MiMo-7B-Base · signal_desk=releases · occurred_at=2025-04-29T23:41:27+00:00 · url=https://huggingface.co/XiaomiMiMo/MiMo-7B-Base · hf_downloads=145070 · hf_likes=134 · hf_params=7833409536 · pipeline=text-generation · license=mit"},{"ref":"E8","kind":"event","title":"XiaomiMiMo/MiMo-V2.5-ASR","date":"2026-04-23T06:30:09+00:00","date_source":"source","source_url":"https://huggingface.co/XiaomiMiMo/MiMo-V2.5-ASR","signal_url":"https://onlylabs.fyi/signals/8db951bb-3b09-42fb-ac10-d71a51974a4c","signal_json_url":"https://onlylabs.fyi/signals/8db951bb-3b09-42fb-ac10-d71a51974a4c/signal.json","text":"model_released · XiaomiMiMo/MiMo-V2.5-ASR · signal_desk=releases · occurred_at=2026-04-23T06:30:09+00:00 · url=https://huggingface.co/XiaomiMiMo/MiMo-V2.5-ASR · hf_downloads=1274 · hf_likes=94 · hf_params=7622619136 · pipeline=automatic-speech-recognition · license=mit"},{"ref":"E9","kind":"event","title":"XiaomiMiMo/MiMo-VL-7B-RL-2508","date":"2025-08-07T09:59:03+00:00","date_source":"source","source_url":"https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-RL-2508","signal_url":"https://onlylabs.fyi/signals/3d6d2254-1059-49a2-aae2-b486ea7da6aa","signal_json_url":"https://onlylabs.fyi/signals/3d6d2254-1059-49a2-aae2-b486ea7da6aa/signal.json","text":"model_released · XiaomiMiMo/MiMo-VL-7B-RL-2508 · signal_desk=releases · occurred_at=2025-08-07T09:59:03+00:00 · url=https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-RL-2508 · hf_downloads=1322 · hf_likes=92 · hf_params=8306217216 · pipeline=image-text-to-text · license=mit · raw={\"derived_reason\":\"first-party-finetune\"}"},{"ref":"E10","kind":"event","title":"XiaomiMiMo/MiMo-Embodied-7B","date":"2025-11-19T09:34:27+00:00","date_source":"source","source_url":"https://huggingface.co/XiaomiMiMo/MiMo-Embodied-7B","signal_url":"https://onlylabs.fyi/signals/42ee7dc3-481d-41f7-a8ef-90a4343b1882","signal_json_url":"https://onlylabs.fyi/signals/42ee7dc3-481d-41f7-a8ef-90a4343b1882/signal.json","text":"model_released · XiaomiMiMo/MiMo-Embodied-7B · signal_desk=releases · occurred_at=2025-11-19T09:34:27+00:00 · url=https://huggingface.co/XiaomiMiMo/MiMo-Embodied-7B · hf_downloads=1123 · hf_likes=66 · hf_params=8306217216 · pipeline=image-text-to-text · license=mit · raw={\"derived_reason\":\"first-party-finetune\"}"},{"ref":"E11","kind":"event","title":"XiaomiMiMo/MiMo-V2.5-Pro-Base","date":"2026-04-27T11:32:54+00:00","date_source":"source","source_url":"https://huggingface.co/XiaomiMiMo/MiMo-V2.5-Pro-Base","signal_url":"https://onlylabs.fyi/signals/5bf6b214-2c68-424e-96d1-ba2c9f385e72","signal_json_url":"https://onlylabs.fyi/signals/5bf6b214-2c68-424e-96d1-ba2c9f385e72/signal.json","text":"model_released · XiaomiMiMo/MiMo-V2.5-Pro-Base · signal_desk=releases · occurred_at=2026-04-27T11:32:54+00:00 · url=https://huggingface.co/XiaomiMiMo/MiMo-V2.5-Pro-Base · hf_downloads=537 · hf_likes=40 · hf_params=1023244718976 · pipeline=text-generation · license=mit"},{"ref":"E12","kind":"event","title":"XiaomiMiMo/MiMo-VL-7B-SFT","date":"2025-05-30T00:44:26+00:00","date_source":"source","source_url":"https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-SFT","signal_url":"https://onlylabs.fyi/signals/46a3f313-9b91-4fbd-85a3-86e89387b524","signal_json_url":"https://onlylabs.fyi/signals/46a3f313-9b91-4fbd-85a3-86e89387b524/signal.json","text":"model_released · XiaomiMiMo/MiMo-VL-7B-SFT · signal_desk=releases · occurred_at=2025-05-30T00:44:26+00:00 · url=https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-SFT · hf_downloads=438 · hf_likes=55 · hf_params=8306217216 · pipeline=image-text-to-text · license=mit · raw={\"derived_reason\":\"first-party-finetune\"}"},{"ref":"E13","kind":"event","title":"XiaomiMiMo/MiMo-Code","date":"2026-06-10T11:52:41+00:00","date_source":"source","source_url":"https://github.com/XiaomiMiMo/MiMo-Code","signal_url":"https://onlylabs.fyi/signals/d063adef-deb0-44b6-84d5-e57dec60e0a6","signal_json_url":"https://onlylabs.fyi/signals/d063adef-deb0-44b6-84d5-e57dec60e0a6/signal.json","text":"repo_new · XiaomiMiMo/MiMo-Code · signal_desk=repos · occurred_at=2026-06-10T11:52:41+00:00 · url=https://github.com/XiaomiMiMo/MiMo-Code · stars=1786 · raw={\"repo\":\"XiaomiMiMo/MiMo-Code\",\"language\":\"TypeScript\"}"},{"ref":"E14","kind":"event","title":"XiaomiMiMo/MiMo-Audio-7B-Base","date":"2025-09-18T20:55:36+00:00","date_source":"source","source_url":"https://huggingface.co/XiaomiMiMo/MiMo-Audio-7B-Base","signal_url":"https://onlylabs.fyi/signals/b8cca5f3-da57-4e1b-99ea-66113daa2a8c","signal_json_url":"https://onlylabs.fyi/signals/b8cca5f3-da57-4e1b-99ea-66113daa2a8c/signal.json","text":"model_released · XiaomiMiMo/MiMo-Audio-7B-Base · signal_desk=releases · occurred_at=2025-09-18T20:55:36+00:00 · url=https://huggingface.co/XiaomiMiMo/MiMo-Audio-7B-Base · hf_downloads=104 · hf_likes=51 · hf_params=8018587648 · pipeline=any-to-any · license=mit"},{"ref":"E15","kind":"event","title":"XiaomiMiMo/MiMo-V2.5-Base","date":"2026-04-27T12:49:45+00:00","date_source":"source","source_url":"https://huggingface.co/XiaomiMiMo/MiMo-V2.5-Base","signal_url":"https://onlylabs.fyi/signals/b41061f8-212c-40d3-884b-27c014a6081b","signal_json_url":"https://onlylabs.fyi/signals/b41061f8-212c-40d3-884b-27c014a6081b/signal.json","text":"model_released · XiaomiMiMo/MiMo-V2.5-Base · signal_desk=releases · occurred_at=2026-04-27T12:49:45+00:00 · url=https://huggingface.co/XiaomiMiMo/MiMo-V2.5-Base · hf_downloads=262 · hf_likes=28 · hf_params=310775040000 · license=mit"},{"ref":"E16","kind":"event","title":"XiaomiMiMo/MiMo-Code v0.1.0","date":"2026-06-10T16:05:43+00:00","date_source":"source","source_url":"https://github.com/XiaomiMiMo/MiMo-Code/releases/tag/v0.1.0","signal_url":"https://onlylabs.fyi/signals/832bb2f2-641b-4a1d-90ff-b5305ed0926d","signal_json_url":"https://onlylabs.fyi/signals/832bb2f2-641b-4a1d-90ff-b5305ed0926d/signal.json","text":"release · XiaomiMiMo/MiMo-Code v0.1.0 · signal_desk=releases · occurred_at=2026-06-10T16:05:43+00:00 · url=https://github.com/XiaomiMiMo/MiMo-Code/releases/tag/v0.1.0 · raw={\"repo\":\"XiaomiMiMo/MiMo-Code\"}"},{"ref":"E17","kind":"event","title":"XiaomiMiMo/MiMo-7B-RL-0530","date":"2025-05-30T01:19:37+00:00","date_source":"source","source_url":"https://huggingface.co/XiaomiMiMo/MiMo-7B-RL-0530","signal_url":"https://onlylabs.fyi/signals/a588ddc1-7a81-4173-a97c-068e86401461","signal_json_url":"https://onlylabs.fyi/signals/a588ddc1-7a81-4173-a97c-068e86401461/signal.json","text":"model_released · XiaomiMiMo/MiMo-7B-RL-0530 · signal_desk=releases · occurred_at=2025-05-30T01:19:37+00:00 · url=https://huggingface.co/XiaomiMiMo/MiMo-7B-RL-0530 · hf_downloads=6124 · hf_likes=44 · hf_params=7833409536 · pipeline=text-generation · license=mit"},{"ref":"E18","kind":"event","title":"Research Scientist - Pre-training","date":"2026-06-05T22:36:26.893+00:00","date_source":"source","source_url":"https://mimo.xiaomi.com/index#joinUs","signal_url":"https://onlylabs.fyi/signals/bed723b6-3960-44e9-9978-05604fede1fb","signal_json_url":"https://onlylabs.fyi/signals/bed723b6-3960-44e9-9978-05604fede1fb/signal.json","text":"job_opened · Research Scientist - Pre-training · signal_desk=hiring · occurred_at=2026-06-05T22:36:26.893+00:00 · url=https://mimo.xiaomi.com/index#joinUs · raw={\"ats\":\"agent\"}"},{"ref":"E19","kind":"event","title":"AI Infrastructure Engineer","date":"2026-06-05T22:36:26.893+00:00","date_source":"source","source_url":"https://mimo.xiaomi.com/index#joinUs","signal_url":"https://onlylabs.fyi/signals/13d2ef96-6d50-4a90-b797-5c692212a9c3","signal_json_url":"https://onlylabs.fyi/signals/13d2ef96-6d50-4a90-b797-5c692212a9c3/signal.json","text":"job_opened · AI Infrastructure Engineer · signal_desk=hiring · occurred_at=2026-06-05T22:36:26.893+00:00 · url=https://mimo.xiaomi.com/index#joinUs · raw={\"ats\":\"agent\"}"},{"ref":"E20","kind":"event","title":"Research Scientist - Post-training","date":"2026-06-05T22:36:26.893+00:00","date_source":"source","source_url":"https://mimo.xiaomi.com/index#joinUs","signal_url":"https://onlylabs.fyi/signals/e4045b83-bed9-4d8c-86d8-d7361f6dbb7c","signal_json_url":"https://onlylabs.fyi/signals/e4045b83-bed9-4d8c-86d8-d7361f6dbb7c/signal.json","text":"job_opened · Research Scientist - Post-training · signal_desk=hiring · occurred_at=2026-06-05T22:36:26.893+00:00 · url=https://mimo.xiaomi.com/index#joinUs · raw={\"ats\":\"agent\"}"},{"ref":"E21","kind":"event","title":"Research Scientist - Audio Speech","date":"2026-06-05T22:36:26.893+00:00","date_source":"source","source_url":"https://mimo.xiaomi.com/index#joinUs","signal_url":"https://onlylabs.fyi/signals/da0a2e65-5577-4a5e-9833-d5add3d8e313","signal_json_url":"https://onlylabs.fyi/signals/da0a2e65-5577-4a5e-9833-d5add3d8e313/signal.json","text":"job_opened · Research Scientist - Audio Speech · signal_desk=hiring · occurred_at=2026-06-05T22:36:26.893+00:00 · url=https://mimo.xiaomi.com/index#joinUs · raw={\"ats\":\"agent\"}"},{"ref":"E22","kind":"event","title":"Knowledge Engineer","date":"2026-06-05T22:36:26.893+00:00","date_source":"source","source_url":"https://mimo.xiaomi.com/index#joinUs","signal_url":"https://onlylabs.fyi/signals/3aafff9b-1dac-4cca-9752-e29b06f87c29","signal_json_url":"https://onlylabs.fyi/signals/3aafff9b-1dac-4cca-9752-e29b06f87c29/signal.json","text":"job_opened · Knowledge Engineer · signal_desk=hiring · occurred_at=2026-06-05T22:36:26.893+00:00 · url=https://mimo.xiaomi.com/index#joinUs · raw={\"ats\":\"agent\"}"},{"ref":"E23","kind":"event","title":"Research Scientist - Multimodal","date":"2026-06-05T22:36:26.893+00:00","date_source":"source","source_url":"https://mimo.xiaomi.com/index#joinUs","signal_url":"https://onlylabs.fyi/signals/ad4b6210-3bee-487f-aad2-7c63986aa6a7","signal_json_url":"https://onlylabs.fyi/signals/ad4b6210-3bee-487f-aad2-7c63986aa6a7/signal.json","text":"job_opened · Research Scientist - Multimodal · signal_desk=hiring · occurred_at=2026-06-05T22:36:26.893+00:00 · url=https://mimo.xiaomi.com/index#joinUs · raw={\"ats\":\"agent\"}"},{"ref":"E24","kind":"event","title":"XiaomiMiMo/MiMo-Audio-Tokenizer","date":"2025-09-18T20:56:05+00:00","date_source":"source","source_url":"https://huggingface.co/XiaomiMiMo/MiMo-Audio-Tokenizer","signal_url":"https://onlylabs.fyi/signals/0fdea92e-dc3d-4512-ab59-b1561612fad7","signal_json_url":"https://onlylabs.fyi/signals/0fdea92e-dc3d-4512-ab59-b1561612fad7/signal.json","text":"model_released · XiaomiMiMo/MiMo-Audio-Tokenizer · signal_desk=releases · occurred_at=2025-09-18T20:56:05+00:00 · url=https://huggingface.co/XiaomiMiMo/MiMo-Audio-Tokenizer · hf_downloads=4710 · hf_likes=36 · hf_params=1295799574 · license=mit"},{"ref":"E25","kind":"event","title":"XiaomiMiMo/MiMo-VL-7B-SFT-2508","date":"2025-08-07T09:36:36+00:00","date_source":"source","source_url":"https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-SFT-2508","signal_url":"https://onlylabs.fyi/signals/960e00ba-b355-4438-b0cf-7f7140441710","signal_json_url":"https://onlylabs.fyi/signals/960e00ba-b355-4438-b0cf-7f7140441710/signal.json","text":"model_released · XiaomiMiMo/MiMo-VL-7B-SFT-2508 · signal_desk=releases · occurred_at=2025-08-07T09:36:36+00:00 · url=https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-SFT-2508 · hf_downloads=1548 · hf_likes=36 · hf_params=8306217216 · pipeline=image-text-to-text · license=mit · raw={\"derived_reason\":\"first-party-finetune\"}"},{"ref":"E26","kind":"event","title":"XiaomiMiMo/MiMo-7B-SFT","date":"2025-04-29T23:30:47+00:00","date_source":"source","source_url":"https://huggingface.co/XiaomiMiMo/MiMo-7B-SFT","signal_url":"https://onlylabs.fyi/signals/16c3487a-753d-4d9e-83c1-d27c3e3569a2","signal_json_url":"https://onlylabs.fyi/signals/16c3487a-753d-4d9e-83c1-d27c3e3569a2/signal.json","text":"model_released · XiaomiMiMo/MiMo-7B-SFT · signal_desk=releases · occurred_at=2025-04-29T23:30:47+00:00 · url=https://huggingface.co/XiaomiMiMo/MiMo-7B-SFT · hf_downloads=1185 · hf_likes=27 · hf_params=7833409536 · pipeline=text-generation · license=mit"},{"ref":"E27","kind":"event","title":"XiaomiMiMo/MiMo-V2.5-ASR","date":"2026-04-23T12:59:55+00:00","date_source":"source","source_url":"https://github.com/XiaomiMiMo/MiMo-V2.5-ASR","signal_url":"https://onlylabs.fyi/signals/cd0746fc-007a-41ad-8322-f05f4511e585","signal_json_url":"https://onlylabs.fyi/signals/cd0746fc-007a-41ad-8322-f05f4511e585/signal.json","text":"repo_new · XiaomiMiMo/MiMo-V2.5-ASR · signal_desk=repos · occurred_at=2026-04-23T12:59:55+00:00 · url=https://github.com/XiaomiMiMo/MiMo-V2.5-ASR · stars=264 · raw={\"repo\":\"XiaomiMiMo/MiMo-V2.5-ASR\",\"description\":\"Robust Speech Recognition Across Languages, Dialects, and Complex Acoustic Scenarios\",\"language\":\"Python\"}"},{"ref":"E28","kind":"event","title":"XiaomiMiMo/MiMo-V2-Flash","date":"2025-12-15T16:28:22+00:00","date_source":"source","source_url":"https://github.com/XiaomiMiMo/MiMo-V2-Flash","signal_url":"https://onlylabs.fyi/signals/42a36b31-f9d2-4c0d-8362-5c3457da58b1","signal_json_url":"https://onlylabs.fyi/signals/42a36b31-f9d2-4c0d-8362-5c3457da58b1/signal.json","text":"repo_new · XiaomiMiMo/MiMo-V2-Flash · signal_desk=repos · occurred_at=2025-12-15T16:28:22+00:00 · url=https://github.com/XiaomiMiMo/MiMo-V2-Flash · stars=1334 · hn=3 points/0 comments · raw={\"repo\":\"XiaomiMiMo/MiMo-V2-Flash\",\"description\":\"MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation Model\"}"},{"ref":"E29","kind":"event","title":"XiaomiMiMo/MiMo-Skills","date":"2026-04-23T12:10:40+00:00","date_source":"source","source_url":"https://github.com/XiaomiMiMo/MiMo-Skills","signal_url":"https://onlylabs.fyi/signals/e2e28c19-79d0-420c-9c88-c214aee7bffc","signal_json_url":"https://onlylabs.fyi/signals/e2e28c19-79d0-420c-9c88-c214aee7bffc/signal.json","text":"repo_new · XiaomiMiMo/MiMo-Skills · signal_desk=repos · occurred_at=2026-04-23T12:10:40+00:00 · url=https://github.com/XiaomiMiMo/MiMo-Skills · stars=70 · raw={\"repo\":\"XiaomiMiMo/MiMo-Skills\",\"description\":\"Agent skills for Xiaomi MiMo series.\",\"language\":\"Python\"}"},{"ref":"E30","kind":"event","title":"XiaomiMiMo/MiMo-7B-RL-Zero","date":"2025-04-29T23:56:19+00:00","date_source":"source","source_url":"https://huggingface.co/XiaomiMiMo/MiMo-7B-RL-Zero","signal_url":"https://onlylabs.fyi/signals/dc6d4ac0-43c0-4789-8405-e9771475cc5e","signal_json_url":"https://onlylabs.fyi/signals/dc6d4ac0-43c0-4789-8405-e9771475cc5e/signal.json","text":"model_released · XiaomiMiMo/MiMo-7B-RL-Zero · signal_desk=releases · occurred_at=2025-04-29T23:56:19+00:00 · url=https://huggingface.co/XiaomiMiMo/MiMo-7B-RL-Zero · hf_downloads=995 · hf_likes=18 · hf_params=7833409536 · pipeline=text-generation · license=mit"},{"ref":"E31","kind":"event","title":"XiaomiMiMo/MiMo","date":"2025-04-26T09:31:17+00:00","date_source":"source","source_url":"https://github.com/XiaomiMiMo/MiMo","signal_url":"https://onlylabs.fyi/signals/a420fc75-34f5-48ee-851c-b2aa59ec4c59","signal_json_url":"https://onlylabs.fyi/signals/a420fc75-34f5-48ee-851c-b2aa59ec4c59/signal.json","text":"repo_new · XiaomiMiMo/MiMo · signal_desk=repos · occurred_at=2025-04-26T09:31:17+00:00 · url=https://github.com/XiaomiMiMo/MiMo · stars=2173 · raw={\"repo\":\"XiaomiMiMo/MiMo\",\"description\":\"MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining\",\"language\":\"Python\"}"},{"ref":"E32","kind":"event","title":"XiaomiMiMo/MiMo-7B-MTPs","date":"2025-11-14T12:44:49+00:00","date_source":"source","source_url":"https://huggingface.co/XiaomiMiMo/MiMo-7B-MTPs","signal_url":"https://onlylabs.fyi/signals/966a0480-8f34-48c2-9c46-8b65e6dd89d9","signal_json_url":"https://onlylabs.fyi/signals/966a0480-8f34-48c2-9c46-8b65e6dd89d9/signal.json","text":"model_released · XiaomiMiMo/MiMo-7B-MTPs · signal_desk=releases · occurred_at=2025-11-14T12:44:49+00:00 · url=https://huggingface.co/XiaomiMiMo/MiMo-7B-MTPs · hf_downloads=43 · hf_likes=7 · hf_params=421580800 · pipeline=image-feature-extraction · license=mit"},{"ref":"E33","kind":"event","title":"XiaomiMiMo/MiMo-Audio","date":"2025-09-19T00:46:49+00:00","date_source":"source","source_url":"https://github.com/XiaomiMiMo/MiMo-Audio","signal_url":"https://onlylabs.fyi/signals/3ec5b187-b604-462b-9037-d69cdcb0ac13","signal_json_url":"https://onlylabs.fyi/signals/3ec5b187-b604-462b-9037-d69cdcb0ac13/signal.json","text":"repo_new · XiaomiMiMo/MiMo-Audio · signal_desk=repos · occurred_at=2025-09-19T00:46:49+00:00 · url=https://github.com/XiaomiMiMo/MiMo-Audio · stars=1046 · raw={\"repo\":\"XiaomiMiMo/MiMo-Audio\",\"description\":\"MiMo-Audio: Audio Language Models are Few-Shot Learners\",\"language\":\"Python\"}"},{"ref":"E34","kind":"event","title":"XiaomiMiMo/MiMo-VL","date":"2025-05-29T06:40:28+00:00","date_source":"source","source_url":"https://github.com/XiaomiMiMo/MiMo-VL","signal_url":"https://onlylabs.fyi/signals/5b758cf9-56a9-4ee0-9ddf-ebd331db778c","signal_json_url":"https://onlylabs.fyi/signals/5b758cf9-56a9-4ee0-9ddf-ebd331db778c/signal.json","text":"repo_new · XiaomiMiMo/MiMo-VL · signal_desk=repos · occurred_at=2025-05-29T06:40:28+00:00 · url=https://github.com/XiaomiMiMo/MiMo-VL · stars=643 · raw={\"repo\":\"XiaomiMiMo/MiMo-VL\",\"description\":\"MiMo-VL\"}"},{"ref":"E35","kind":"event","title":"XiaomiMiMo/MiMo-Embodied","date":"2025-11-19T08:54:41+00:00","date_source":"source","source_url":"https://github.com/XiaomiMiMo/MiMo-Embodied","signal_url":"https://onlylabs.fyi/signals/174c5775-92dd-4fe2-96fe-7882a7b6b614","signal_json_url":"https://onlylabs.fyi/signals/174c5775-92dd-4fe2-96fe-7882a7b6b614/signal.json","text":"repo_new · XiaomiMiMo/MiMo-Embodied · signal_desk=repos · occurred_at=2025-11-19T08:54:41+00:00 · url=https://github.com/XiaomiMiMo/MiMo-Embodied · stars=386 · raw={\"repo\":\"XiaomiMiMo/MiMo-Embodied\",\"description\":\"MiMo-Embodied\",\"language\":\"Python\"}"},{"ref":"E36","kind":"event","title":"XiaomiMiMo/MiMo-Audio-Tokenizer","date":"2025-09-18T16:02:05+00:00","date_source":"source","source_url":"https://github.com/XiaomiMiMo/MiMo-Audio-Tokenizer","signal_url":"https://onlylabs.fyi/signals/cd7b9cc4-6e61-4584-8062-1c5206f6bda8","signal_json_url":"https://onlylabs.fyi/signals/cd7b9cc4-6e61-4584-8062-1c5206f6bda8/signal.json","text":"repo_new · XiaomiMiMo/MiMo-Audio-Tokenizer · signal_desk=repos · occurred_at=2025-09-18T16:02:05+00:00 · url=https://github.com/XiaomiMiMo/MiMo-Audio-Tokenizer · stars=144 · raw={\"repo\":\"XiaomiMiMo/MiMo-Audio-Tokenizer\",\"description\":\"A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.\",\"language\":\"Python\"}"},{"ref":"E37","kind":"event","title":"XiaomiMiMo/MiMo-Audio-Training","date":"2025-10-16T13:52:54+00:00","date_source":"source","source_url":"https://github.com/XiaomiMiMo/MiMo-Audio-Training","signal_url":"https://onlylabs.fyi/signals/a854dcf6-ce98-4e5a-bc73-009d41a19cf5","signal_json_url":"https://onlylabs.fyi/signals/a854dcf6-ce98-4e5a-bc73-009d41a19cf5/signal.json","text":"repo_new · XiaomiMiMo/MiMo-Audio-Training · signal_desk=repos · occurred_at=2025-10-16T13:52:54+00:00 · url=https://github.com/XiaomiMiMo/MiMo-Audio-Training · stars=109 · raw={\"repo\":\"XiaomiMiMo/MiMo-Audio-Training\",\"language\":\"Python\"}"},{"ref":"E38","kind":"event","title":"XiaomiMiMo/MiMo-Audio-Eval","date":"2025-09-18T16:01:04+00:00","date_source":"source","source_url":"https://github.com/XiaomiMiMo/MiMo-Audio-Eval","signal_url":"https://onlylabs.fyi/signals/c3c70f32-f1b3-4f97-b082-ef5875601ce0","signal_json_url":"https://onlylabs.fyi/signals/c3c70f32-f1b3-4f97-b082-ef5875601ce0/signal.json","text":"repo_new · XiaomiMiMo/MiMo-Audio-Eval · signal_desk=repos · occurred_at=2025-09-18T16:01:04+00:00 · url=https://github.com/XiaomiMiMo/MiMo-Audio-Eval · stars=87 · raw={\"repo\":\"XiaomiMiMo/MiMo-Audio-Eval\",\"language\":\"Python\"}"},{"ref":"E39","kind":"event","title":"XiaomiMiMo/lmms-eval","date":"2025-05-29T22:25:08+00:00","date_source":"source","source_url":"https://github.com/XiaomiMiMo/lmms-eval","signal_url":"https://onlylabs.fyi/signals/b1608d6d-6843-471c-8fdf-283941727e40","signal_json_url":"https://onlylabs.fyi/signals/b1608d6d-6843-471c-8fdf-283941727e40/signal.json","text":"repo_forked · XiaomiMiMo/lmms-eval · signal_desk=forks · occurred_at=2025-05-29T22:25:08+00:00 · url=https://github.com/XiaomiMiMo/lmms-eval · stars=71 · raw={\"repo\":\"XiaomiMiMo/lmms-eval\",\"parent\":\"EvolvingLMMs-Lab/lmms-eval\"}"},{"ref":"E40","kind":"event","title":"XiaomiMiMo/vllm","date":"2025-04-29T06:43:27+00:00","date_source":"source","source_url":"https://github.com/XiaomiMiMo/vllm","signal_url":"https://onlylabs.fyi/signals/22317a95-d4e6-423d-a7c0-222620f2c7eb","signal_json_url":"https://onlylabs.fyi/signals/22317a95-d4e6-423d-a7c0-222620f2c7eb/signal.json","text":"repo_forked · XiaomiMiMo/vllm · signal_desk=forks · occurred_at=2025-04-29T06:43:27+00:00 · url=https://github.com/XiaomiMiMo/vllm · stars=31 · raw={\"repo\":\"XiaomiMiMo/vllm\",\"parent\":\"vllm-project/vllm\"}"},{"ref":"E41","kind":"event","title":"XiaomiMiMo/MiMo-Audio-Demo","date":"2025-09-19T01:01:53+00:00","date_source":"source","source_url":"https://github.com/XiaomiMiMo/MiMo-Audio-Demo","signal_url":"https://onlylabs.fyi/signals/2263ff85-b698-4990-95e2-746b7c574f08","signal_json_url":"https://onlylabs.fyi/signals/2263ff85-b698-4990-95e2-746b7c574f08/signal.json","text":"repo_new · XiaomiMiMo/MiMo-Audio-Demo · signal_desk=repos · occurred_at=2025-09-19T01:01:53+00:00 · url=https://github.com/XiaomiMiMo/MiMo-Audio-Demo · stars=11 · raw={\"repo\":\"XiaomiMiMo/MiMo-Audio-Demo\",\"language\":\"HTML\"}"},{"ref":"E42","kind":"event","title":"AI Infrastructure Engineer","date":"unknown","date_source":"missing","source_url":"https://mimo.xiaomi.com/index#joinUs","signal_url":"https://onlylabs.fyi/signals/90e39025-4ca2-42bd-9c2e-698f4cfe7144","signal_json_url":"https://onlylabs.fyi/signals/90e39025-4ca2-42bd-9c2e-698f4cfe7144/signal.json","text":"job_opened · AI Infrastructure Engineer · signal_desk=hiring · occurred_at=unknown · url=https://mimo.xiaomi.com/index#joinUs · raw={\"ats\":\"agent\"}"},{"ref":"E43","kind":"event","title":"Research Scientist - Pre-training","date":"unknown","date_source":"missing","source_url":"https://mimo.xiaomi.com/index#joinUs","signal_url":"https://onlylabs.fyi/signals/2fd59d7d-7292-4c49-99c7-5205717cd9ae","signal_json_url":"https://onlylabs.fyi/signals/2fd59d7d-7292-4c49-99c7-5205717cd9ae/signal.json","text":"job_opened · Research Scientist - Pre-training · signal_desk=hiring · occurred_at=unknown · url=https://mimo.xiaomi.com/index#joinUs · raw={\"ats\":\"agent\"}"},{"ref":"E44","kind":"event","title":"Research Scientist - Multimodal","date":"unknown","date_source":"missing","source_url":"https://mimo.xiaomi.com/index#joinUs","signal_url":"https://onlylabs.fyi/signals/920f308e-d1fe-47b5-9239-bc9e4035c13a","signal_json_url":"https://onlylabs.fyi/signals/920f308e-d1fe-47b5-9239-bc9e4035c13a/signal.json","text":"job_opened · Research Scientist - Multimodal · signal_desk=hiring · occurred_at=unknown · url=https://mimo.xiaomi.com/index#joinUs · raw={\"ats\":\"agent\"}"},{"ref":"E45","kind":"event","title":"Research Scientist - Post-training","date":"unknown","date_source":"missing","source_url":"https://mimo.xiaomi.com/index#joinUs","signal_url":"https://onlylabs.fyi/signals/37ff766d-aab6-48b6-b420-b3f08d88e7ab","signal_json_url":"https://onlylabs.fyi/signals/37ff766d-aab6-48b6-b420-b3f08d88e7ab/signal.json","text":"job_opened · Research Scientist - Post-training · signal_desk=hiring · occurred_at=unknown · url=https://mimo.xiaomi.com/index#joinUs · raw={\"ats\":\"agent\"}"},{"ref":"E46","kind":"event","title":"Research Scientist - Audio Speech","date":"unknown","date_source":"missing","source_url":"https://mimo.xiaomi.com/index#joinUs","signal_url":"https://onlylabs.fyi/signals/b34a26ca-66cf-4751-948c-ff89aef3c44f","signal_json_url":"https://onlylabs.fyi/signals/b34a26ca-66cf-4751-948c-ff89aef3c44f/signal.json","text":"job_opened · Research Scientist - Audio Speech · signal_desk=hiring · occurred_at=unknown · url=https://mimo.xiaomi.com/index#joinUs · raw={\"ats\":\"agent\"}"},{"ref":"E47","kind":"event","title":"Knowledge Engineer","date":"unknown","date_source":"missing","source_url":"https://mimo.xiaomi.com/index#joinUs","signal_url":"https://onlylabs.fyi/signals/d8de6e7f-c220-4d50-98e8-ac8871120f22","signal_json_url":"https://onlylabs.fyi/signals/d8de6e7f-c220-4d50-98e8-ac8871120f22/signal.json","text":"job_opened · Knowledge Engineer · signal_desk=hiring · occurred_at=unknown · url=https://mimo.xiaomi.com/index#joinUs · raw={\"ats\":\"agent\"}"}]}