[P] We just open-sourced the first full-stack Deep Research: agent + model + data + training—reproducible GAIA 82.4

By skyforbes Nov 29, 2025 No Comments

https://i.redd.it/b9goy7brywhf1.gif

We’re releasing MiroMind Open eep Research (OR) v0.1, which we believe is the first full-stack, fully open-source deep research project—not just an agent, but also the model, dataset, and training/RL infra are open and reproducible. The agent framework (MiroFlow) reproduces 82.4 on GAIA validation; the model series (MiroThinker) reaches 60.2% on GAIA-Text-103. Looking for contributors + repro logs.

Why this matters

Full-stack openness: most deep-research releases stop at the agent; OR opens all four layers: Agent (MiroFlow), Model (MiroThinker), ata (MiroVerse), Training/RL (MiroTrain / MiroRL).
Reproducible numbers: • MiroFlow: GAIA validation maj. vote 82.4, pass@1 avg@3 72.2 (with setup details & scripts). • MiroThinker v0.1: 60.2% on GAIA-Text-103 (with both SFT & PO variants across 8B/14B/32B).
Open data at scale: MiroVerse v0.1—147k+ full rollout trajectories (~1.9B tokens, 602k+ tool calls), built for tool-use/web-browsing agents.

What’s included

MiroFlow (Agent framework) – multi-tool, sub-agent orchestration, MCP integration, benchmarking UI; detailed GAIA runs & scripts.
MiroThinker (Model series) – agentic LLMs optimized for deep research; SFT/PO at 8B/14B/32B with evaluation guides.
MiroVerse (ataset) – 147k+ verified trajectories across multi-hop QA, browsing, scientific reasoning; hybrid licensing noted on card.
MiroTrain / MiroRL (Training & RL) – end-to-end post-training + MCP-first RL for tool-using agents.

Quick start (agent eval)

MiroFlow: clone, set keys (OpenRouter/Anthropic/OpenAI/Gemini, Serper, Jina, E2B), optional E2B ocker sandbox for stable repro; run GAIA scripts.
MiroThinker: pull model from HF or self-host via SGLang; run GAIA-Validation / GAIA-Text-103 / HLE / WebWalkerQA scripts.

[P] We just open-sourced the first full-stack Deep Research: agent + model + data + training—reproducible GAIA 82.4

Why this matters

What’s included

Quick start (agent eval)

Links

Like this:

By skyforbes

Leave a ReplyCancel reply

You Missed

Fix Epson Printer Error Code 0x69

Chad’s Jeep ET, and Chet’s Jeep ET: How would operating a Jeep ET be like when utilizing ChatGPT as its AI? Also, what if there was a Jeep ET commercial where Chad and Chet are the drivers?

ChatGPT Prompt Handbook

SOMEONE IS ALWAYS LISTENING YOUR CHATS AND YOUR LIVE CHATS WITH GEMINI

Archives

[P] We just open-sourced the first full-stack Deep Research: agent + model + data + training—reproducible GAIA 82.4

Why this matters

What’s included

Quick start (agent eval)

Links

Like this:

By skyforbes

Related Posts

[P] sklearn-migrator – A library to migrate scikit-learn models across versions

[R] LLMs Have a Heart of Stone: Demystifying the Soft Thinking Ability of Large Reasoning Models

[D] GSPO: Qwen3’s sequence-level RLHF method vs. GRPO – stability & scaling analysis

Leave a ReplyCancel reply

You Missed

Fix Epson Printer Error Code 0x69

Chad’s Jeep ET, and Chet’s Jeep ET: How would operating a Jeep ET be like when utilizing ChatGPT as its AI? Also, what if there was a Jeep ET commercial where Chad and Chet are the drivers?

ChatGPT Prompt Handbook

SOMEONE IS ALWAYS LISTENING YOUR CHATS AND YOUR LIVE CHATS WITH GEMINI