Mistral Pushes Vibe to the Cloud: Async Coding Agents and Medium 3.5 at 77.6% SWE-Bench — Europe Joins the Agentic Race
Mistral launched remote agents for Vibe in late April 2026, powered by Mistral Medium 3.5 (128B, 256k context). SWE-Bench Verified: 77.6%. Full technical breakdown.

While Google was rolling out Gemini Spark at I/O and Anthropic was chasing a $900 billion valuation, Mistral AI shipped the most strategic update in its history on April 29, 2026: Vibe, its vibecoding CLI launched in late 2025, gets remote agents that run in the cloud. And the new model Mistral Medium 3.5 scores 77.6% on SWE-Bench Verified — neck-and-neck with Claude Opus 4.7 (87%) and GPT-6 (88%). Europe has joined the agentic race. And it's playing its own hand.
Vibe Leaves Your Laptop — Without Breaking the Session
Before this update, Vibe was a local CLI. You'd start a session, the agent would write code in your terminal, refactor modules, run tests. When you closed the laptop, the session died with it.
With remote agents, Vibe teleports the local session into Mistral's cloud. Conversation history, task state, pending approvals — all transferred. The agent keeps going, in parallel, without blocking your terminal. When it's done, it opens a GitHub pull request and pings you. You come back to review the result, not to validate every keystroke.
This is the exact pattern Factory AI popularized with Droids and that Claude Code 2.1.7 brought with Agent Teams. But Mistral adds two crucial differences: the session is mobile (you can teleport it either way) and the model is open-weight — therefore self-hostable.
Mistral Medium 3.5: 128B Dense, 256k Context, Merged Model
The model powering Vibe in cloud mode is Mistral Medium 3.5 — announced the same day. It breaks Mistral's tradition of specialized models (Magistral for reasoning, Devstral for code, Magistral for chat). This time, Mistral merged everything into a single weight set, with configurable reasoning effort per request.
Technical specs:
- 128 billion parameters, dense architecture (no MoE).
- 256,000 tokens of context in general availability.
- Multimodal: text + image input, with vision encoder trained from scratch, supporting variable sizes and aspect ratios.
- Open weight under Apache 2.0 license — self-hostable, fine-tunable.
Published benchmark numbers:
| Benchmark | Mistral Medium 3.5 | Claude Opus 4.7 | GPT-6 |
|---|---|---|---|
| SWE-Bench Verified | 77.6% | 87.6% | 88.0% |
| τ³-Telecom | 91.4% | n/a | n/a |
| MMLU | 84.2% | 89.1% | 90.3% |
Mistral doesn't match Anthropic or OpenAI on closed models. But the gap is narrowing — and most importantly, Medium 3.5 is open. For teams that cannot send code to an external API (banks, defense, healthcare), this is the best trade-off on the market.
Pricing: $7.50 per Million Output Tokens
On API pricing, Mistral lands above its own Small series but well below Opus:
| Model | Input ($/M tokens) | Output ($/M tokens) |
|---|---|---|
| Mistral Medium 3.5 | $2.00 | $7.50 |
| Claude Opus 4.7 | $15.00 | $75.00 |
| GPT-6 (rumored) | $12.00 | $60.00 |
For agentic coding workloads — where you burn millions of output tokens per day — the gap is massive. An agent running 24/7 on Vibe + Medium 3.5 costs 10× less than a Claude Opus equivalent. That's Mistral's bet: agentic will commoditize, and price will pick the winner.
Vibe vs Claude Code vs Cursor vs Factory Droids
The vibecoding market in May 2026 now has four major players. Here's how Vibe stacks up.
| Tool | Model | Cloud agents | Open weight | Mobile session | Auto PR |
|---|---|---|---|---|---|
| Mistral Vibe | Medium 3.5 | ✅ | ✅ | ✅ | ✅ |
| Claude Code 2.1.7 | Opus 4.7 | ✅ Agent Teams | ❌ | ❌ | ✅ |
| Cursor v2.6 | Composer 2 (mix) | ❌ | ❌ | ❌ | Partial |
| Factory Droids | Multi-model | ✅ | ❌ | ❌ | ✅ |
Vibe is the only one combining open weight + cloud agents + mobile session. Claude Code dominates on multi-agent coordination (Agent Teams). Cursor keeps its edge on IDE integration — but the Kimi scandal damaged trust. Factory targets the heavy enterprise.
The European Angle: Tech Sovereignty
Mistral also leans on an argument its competitors cannot use: sovereignty. As France investigates Big Tech and the UK launches a $500M sovereign AI fund, having an open-weight European frontier model becomes a geopolitical asset.
Mistral Medium 3.5 is hostable on OVHcloud, Scaleway, and any EU sovereign cloud. Defense teams, European banks (BNP, Société Générale, Deutsche Bank), insurers and hospitals that can't ship code to US AWS now have a credible option. It's not a footnote — it's the ground Mistral will capture its top 100 European enterprise customers on.
What's Next? Vibe Studio Expected This Summer
Mistral slipped into the blog post that a Vibe Studio — visual UI to orchestrate multiple remote sessions, monitor costs and manage approvals — is in private beta. Public release expected summer 2026.
The pattern is clear: Mistral is building its own end-to-end agentic stack — model (Medium 3.5), runtime (Vibe), interface (Vibe Studio), cloud (OVH/Scaleway partnerships). Same verticalization that Google has with Antigravity or Anthropic with Claude Code. The difference: Mistral does it all open weight.
Key Takeaways
- Mistral Vibe gains remote agents on April 29, 2026: vibecoding sessions teleport to Mistral's cloud and run in parallel.
- Agents open GitHub pull requests when work is done — humans review, no longer write.
- Mistral Medium 3.5 is a 128B dense model with 256k context, multimodal, Apache 2.0 open weight.
- Scores 77.6% on SWE-Bench Verified — behind Claude Opus 4.7 and GPT-6, but at 10× less cost on output.
- Strong European angle: self-hostable on OVH, Scaleway and EU sovereign clouds — defense, banks, healthcare targeting.
Mistral doesn't need to match OpenAI or Anthropic head-on to win. It just needs to be good enough, open, and 10× cheaper on the segment that matters: 24/7 agentic coding workloads. Europe has its champion. Full details on Mistral Vibe's product page and the announcement blog.


