* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Saturday, February 28, 2026
Earth-News
  • Home
  • Business
  • Entertainment

    Golden Entertainment, Inc. (GDEN) director receives RSUs and common shares – Stock Titan

    What the Future Holds for UK Entertainment in 2025

    Discover the All-New ‘ABC Entertainment Update’ Podcast – Your Ultimate Source for the Latest in TV!

    Discover Thrilling Adventures Awaiting Every Movie Lover at the Film Festival

    Dorset Players offer themes of community in popular stage event – Bennington Banner

    GrayCo Grows Its Portfolio with Exciting New Multifamily Property in Charlotte’s Thriving Arts and Entertainment District

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology

    How NVIDIA’s Evolution into the “Berkshire of Technology” Could Unlock Huge Shareholder Gains

    Trump Media & Technology Group Under Fire for Algorithmic Misconduct: Impact on Its Valuation Explained

    Flexport Launches Groundbreaking Technology to Automate Tariff Refunds

    EU and Nigeria Kick Off Exciting New Partnership in Groundbreaking Science & Technology

    Hormel Creates, Fills New Chief Technology Officer Position – Meatingplace

    How Colt Technology is Driving the Future of AI Innovation

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment

    Golden Entertainment, Inc. (GDEN) director receives RSUs and common shares – Stock Titan

    What the Future Holds for UK Entertainment in 2025

    Discover the All-New ‘ABC Entertainment Update’ Podcast – Your Ultimate Source for the Latest in TV!

    Discover Thrilling Adventures Awaiting Every Movie Lover at the Film Festival

    Dorset Players offer themes of community in popular stage event – Bennington Banner

    GrayCo Grows Its Portfolio with Exciting New Multifamily Property in Charlotte’s Thriving Arts and Entertainment District

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology

    How NVIDIA’s Evolution into the “Berkshire of Technology” Could Unlock Huge Shareholder Gains

    Trump Media & Technology Group Under Fire for Algorithmic Misconduct: Impact on Its Valuation Explained

    Flexport Launches Groundbreaking Technology to Automate Tariff Refunds

    EU and Nigeria Kick Off Exciting New Partnership in Groundbreaking Science & Technology

    Hormel Creates, Fills New Chief Technology Officer Position – Meatingplace

    How Colt Technology is Driving the Future of AI Innovation

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home Technology

Large Enough – Mistral AI

July 24, 2024
in Technology
Large Enough – Mistral AI
Share on FacebookShare on Twitter

Detailed benchmarks

This latest generation continues to push the boundaries of cost efficiency, speed, and performance. Mistral Large 2 is exposed on la Plateforme and enriched with new features to facilitate building innovative AI applications.

Mistral Large 2

Mistral Large 2 has a 128k context window and supports dozens of languages including French, German, Spanish, Italian, Portuguese, Arabic, Hindi, Russian, Chinese, Japanese, and Korean, along with 80+ coding languages including Python, Java, C, C++, JavaScript, and Bash.

Mistral Large 2 is designed for single-node inference with long-context applications in mind – its size of 123 billion parameters allows it to run at large throughput on a single node.
We are releasing Mistral Large 2 under the Mistral Research License, that allows usage and modification for research and non-commercial usages. For commercial usage of Mistral Large 2 requiring self-deployment, a Mistral Commercial License must be acquired by contacting us.

General performance

Mistral Large 2 sets a new frontier in terms of performance / cost of serving on evaluation metrics. In particular, on MMLU, the pretrained version achieves an accuracy of 84.0%, and sets a new point on the performance/cost Pareto front of open models.

Code & Reasoning

Following our experience with Codestral 22B and Codestral Mamba, we trained Mistral Large 2 on a very large proportion of code. Mistral Large 2 vastly outperforms the previous Mistral Large, and performs on par with leading models such as GPT-4o, Claude 3 Opus, and Llama 3 405B.

Detailed benchmarks

A significant effort was also devoted to enhancing the model’s reasoning capabilities. One of the key focus areas during training was to minimize the model’s tendency to “hallucinate” or generate plausible-sounding but factually incorrect or irrelevant information. This was achieved by fine-tuning the model to be more cautious and discerning in its responses, ensuring that it provides reliable and accurate outputs.

Additionally, the new Mistral Large 2 is trained to acknowledge when it cannot find solutions or does not have sufficient information to provide a confident answer. This commitment to accuracy is reflected in the improved model performance on popular mathematical benchmarks, demonstrating its enhanced reasoning and problem-solving skills:

Detailed benchmarks

Performance accuracy on code generation benchmarks (all models were benchmarked through the same evaluation pipeline)

Detailed benchmarks

Performance accuracy on MultiPL-E (all models were benchmarked through the same evaluation pipeline, except for the “paper” row)

Detailed benchmarks

Performance accuracy on GSM8K (8-shot) and MATH (0-shot, no CoT) generation benchmarks (all models were benchmarked through the same evaluation pipeline)

Instruction following & Alignment

We drastically improved the instruction-following and conversational capabilities of Mistral Large 2. The new Mistral Large 2 is particularly better at following precise instructions and handling long multi-turn conversations. Below we report the performance on MT-Bench, Wild Bench, and Arena Hard benchmarks:

Detailed benchmarks

Performance on general alignment benchmarks (all models were benchmarked through the same evalutation pipeline)

On some benchmarks, generating lengthy responses tends to improve the scores. However, in many business applications, conciseness is paramount – short model generations facilitate quicker interactions and are more cost-effective for inference. This is why we spent a lot of effort to ensure that generations remain succinct and to the point whenever possible. The graph below reports the average length of generations of different models on questions from the MT Bench benchmark:

MT Bench benchmarks

Language diversity

A large fraction of business use cases today involve working with multilingual documents. While the majority of models are English-centric, the new Mistral Large 2 was trained on a large proportion of multilingual data. In particular, it excels in English, French, German, Spanish, Italian, Portuguese, Dutch, Russian, Chinese, Japanese, Korean, Arabic, and Hindi. Below are the performance results of Mistral Large 2 on the multilingual MMLU benchmark, compared to the previous Mistral Large, Llama 3.1 models, and to Cohere’s Command R+.

Detailed benchmarks

Detailed benchmarks

Performance on Multilingual MMLU (measured on the base pretrained model)

Tool Use & Function Calling

Mistral Large 2 is equipped with enhanced function calling and retrieval skills and has undergone training to proficiently execute both parallel and sequential function calls, enabling it to serve as the power engine of complex business applications.

Detailed benchmarks

Try Mistral Large 2 on la Plateforme

You can use Mistral Large 2 today via la Plateforme under the name mistral-large-2407, and test it on le Chat. It is available under the version 24.07 (a YY.MM versioning system that we are applying to all our models), and the API name mistral-large-2407. Weights for the instruct model are available and are also hosted on HuggingFace.

we are consolidating the offering on la Plateforme around two general purpose models, Mistral Nemo and Mistral Large, and two specialist models, Codestral and Embed. As we progressively deprecate older models on la Plateforme, all Apache models (Mistral 7B, Mixtral 8x7B and 8x22B, Codestral Mamba, Mathstral) remain available for deployment and fine-tuning using our SDK mistral-inference and mistral-finetune.

Starting today, we are extending fine-tuning capabilities on la Plateforme: those are now available for Mistral Large, Mistral Nemo and Codestral.

Access Mistral models through cloud service providers

We are proud to partner with leading cloud service providers to bring the new Mistral Large 2 to a global audience. In particular, today we are expanding our partnership with Google Cloud Platform to bring Mistral AI’s models on Vertex AI via a Managed API. Mistral AI’s best models are now available on Vertex AI, in addition to Azure AI Studio, Amazon Bedrock and IBM watsonx.ai.

Availability timeline of Mistral AI models

Detailed benchmarks

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : Hacker News – https://mistral.ai/news/mistral-large-2407/

Tags: Enoughlargetechnology
Previous Post

Browns place RB Nick Chubb on PUP list as he recovers from knee surgery

Next Post

AI models collapse when trained on recursively generated data

Friday Harbor Port staff frustrated with 7-year-long delay caused by Ecology in cleaning up Jensen’s Boatyard – San Juan Islander

February 28, 2026

Why This Critic Is Stuck in Outdated Arguments on ID

February 28, 2026

Ignoring Climate Science Jeopardizes U.S. Military Readiness – Congress Must Take Immediate Action

February 28, 2026

Unraveling the Hidden Factors Behind Diabetes Risk in Indians: Genes, Diet, and Lifestyle Explained

February 28, 2026

How Sanae Takaichi’s Rise Is Transforming the Future of China Relations

February 28, 2026

WATCH: Trump Unveils Bold Economic Vision and Revives “Drill Baby Drill” Agenda

February 28, 2026

Golden Entertainment, Inc. (GDEN) director receives RSUs and common shares – Stock Titan

February 28, 2026

San Mateo County Warns Community of Potential Measles Exposure at Panda Express

February 28, 2026

U.S. Hockey Team Clinches Gold in a Triumph That Transcends Politics

February 28, 2026

How NVIDIA’s Evolution into the “Berkshire of Technology” Could Unlock Huge Shareholder Gains

February 28, 2026

Categories

Archives

February 2026
M T W T F S S
 1
2345678
9101112131415
16171819202122
232425262728  
« Jan    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (1,095)
  • Economy (1,112)
  • Entertainment (21,989)
  • General (20,140)
  • Health (10,152)
  • Lifestyle (1,127)
  • News (22,149)
  • People (1,117)
  • Politics (1,129)
  • Science (16,327)
  • Sports (21,614)
  • Technology (16,094)
  • World (1,104)

Recent News

Friday Harbor Port staff frustrated with 7-year-long delay caused by Ecology in cleaning up Jensen’s Boatyard – San Juan Islander

February 28, 2026

Why This Critic Is Stuck in Outdated Arguments on ID

February 28, 2026
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version