Skip to content
News

Mistral’s Codestral 25B Is the Open-Source Coding Model That Actually Runs on Your Hardware

Mistral AI’s Codestral 25B scores 81.1% on HumanEval, runs on consumer GPUs, and integrates with VS Code and JetBrains out of the box.

3 min read
Mistral's Codestral 25B Is the Open-Source Coding Model That Actually Runs on Your Hardware

Mistral AI released Codestral 25B in March 2024, and the coding world quietly had a moment. A 25-billion parameter model, fully open-source, commercially licensed, integrated into VS Code and JetBrains IDEs, scoring 81.1% on HumanEval — and you can run it on a gaming GPU you probably already own. That’s not a press release talking point. That’s the actual situation.

For developers who’ve been quietly annoyed by API rate limits, per-token pricing, and the general indignity of sending their proprietary code to someone else’s server, Codestral arrives as a genuinely useful alternative — not just a benchmark trophy.

What Codestral 25B Actually Is

The model was trained on over 500 billion tokens of code data spanning 80+ programming languages. Its context window sits at 32,000 tokens, which is large enough to handle real files, not just toy examples. On HumanEval — the standard pass-or-fail Python coding benchmark — it scores 81.1% (Pass@1). On MBPP, a practical Python task benchmark, it hits 78.6%. On RepoBench, which tests repository-level code understanding rather than isolated snippets, it reaches 60%.

Those numbers put it in the same conversation as GPT-4o Mini on coding tasks, which is exactly the comparison Mistral wants you to make. The honest version: for pure code generation, Codestral is competitive. For everything else GPT-4o Mini does — reasoning, multimodal tasks, general Q&A — it’s not in the race. Codestral was built to write code, not to be a Swiss Army knife.

Open-Source, But With Paperwork

The licensing situation is dual-track. Research use falls under the Mistral Research License. Commercial use requires the Mistral Commercial License. Neither is the AGPL restrictions you’d get from some other open models, which makes this more practical for teams who want to deploy Codestral inside a product without a legal headache. It’s not MIT-licensed freedom, but it’s workable.

The model is available on Hugging Face under the model ID mistral-community/Codestral-25B and through Mistral’s own API. If you want to run it locally, you can — 25 billion parameters in 4-bit quantization fits on a single GPU with 24GB of VRAM, which covers cards like the RTX 3090, RTX 4090, or equivalent professional hardware. Not a laptop setup, but not a data center either.

IDE Integrations That Don’t Require a PhD to Configure

Mistral shipped Codestral with day-one integrations for VS Code and JetBrains IDEs — IntelliJ IDEA, PyCharm, and WebStorm specifically. Both are available through their respective plugin marketplaces, meaning the setup path is install plugin, point it at your Mistral API key or local endpoint, start coding. Developers who’ve used GitHub Copilot will recognize the workflow immediately: inline completions, fill-in-the-middle suggestions, and the familiar experience of an AI that finishes your functions before you do.

Running it locally through the API means zero telemetry on your code reaching Mistral’s servers. For teams working on proprietary codebases where every keystroke sent to a third party is a compliance conversation waiting to happen, that matters significantly.

Why This Is Actually a Big Deal

The real story here isn’t that Codestral beats GPT-4o Mini — the benchmark gaps are close enough that “beats” is generous. The story is that a model of this quality now runs on hardware that individual developers own, under a license that allows commercial use, with integrations that make it immediately practical. That combination didn’t really exist before at this level of code quality.

Proprietary coding tools like GitHub Copilot and Cursor still have advantages: continuous updates, larger backing models under the hood, and refinement from millions of real-world sessions. But Codestral is fine-tunable. A team can take the base model, train it on their internal codebase, and end up with something that knows their architecture, their conventions, and their preferred patterns. OpenAI won’t give you that with GPT-4o Mini.

What’s Next

Mistral has a track record of iterating fast — Mistral 7B, Mixtral 8x7B, and now Codestral each raised the bar for what open models could do at their size. The question isn’t whether Codestral 25B is the permanent ceiling for open-source coding models. It isn’t. The question is how quickly the community builds on it, fine-tunes it, and integrates it into tools that make the proprietary alternatives feel expensive for what they offer. That process is already underway.

author avatar
promptyze

promptyze

ADMINISTRATOR