Artificial Intelligence

1602 readers

243 users here now

Welcome to the AI Community!

Let's explore AI passionately, foster innovation, and learn together. Follow these guidelines for a vibrant and respectful community:

Be kind and respectful.
Share high-quality contributions.
Stay on-topic.
Enhance accessibility.
Verify information.
Encourage meaningful discussions.

You can access the AI Wiki at the following link: AI Wiki

Let's create a thriving AI community together!

founded 2 years ago

MODERATORS

ikidd@lemmy.world

Xiaomi releases an open-source 7B reasoning model that small models can achieve exceptional mathematical and code reasoning capabilities, even outperforming larger 32B models (huggingface.co)

submitted 2 weeks ago by cm0002@lemmy.world to c/ai_@lemmy.world

1 comments fedilink hide all child comments

MiMo-7B, a series of reasoning-focused language models trained from scratch, demonstrating that small models can achieve exceptional mathematical and code reasoning capabilities, even outperforming larger 32B models. Key innovations include:

Pre-training optimizations: Enhanced data pipelines, multi-dimensional filtering, and a three-stage data mixture (25T tokens) with Multiple-Token Prediction for improved reasoning.

Post-training techniques: Curated 130K math/code problems with rule-based rewards, a difficulty-driven code reward for sparse tasks, and data re-sampling to stabilize RL training.

RL infrastructure: A Seamless Rollout Engine accelerates training/validation by 2.29×/1.96×, paired with robust inference support. MiMo-7B-RL matches OpenAI’s o1-mini on reasoning tasks, with all models (base, SFT, RL) open-sourced to advance the community’s development of powerful reasoning LLMs.

an in-depth discussion of mimo-7b >https://www.youtube.com/watch?v=y6mSdLgJYQY

top 1 comments

sorted by: hot top controversial new old

[–] ikidd@lemmy.world 2 points 1 week ago

I'd love to see how it works on LM Studio. That's a small enough model to run locally.