* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Monday, May 19, 2025
Earth-News
  • Home
  • Business
  • Entertainment
    Why Was Kanye West’s South Korea Tour Cancelled? – Yahoo

    Unraveling the Mystery: Why Kanye West’s South Korea Tour Was Canceled

    Entertainment Spotlight: ‘Georgie & Mandy’s First Marriage’ – Atlanta News First

    ‘Final Destination: Bloodlines’ tops box office while The Weeknd’s movie falters – Yakima Herald-Republic

    Final Destination: Bloodlines Dominates the Box Office as The Weeknd’s Film Struggles

    Country Music Legend Bids Heartfelt Farewell: ‘Y’all Gonna Make Me Tear Up!

    We won’t get a Game of Thrones show this year: A Knight of the Seven Kingdoms shifts to early 2026 – Entertainment Weekly

    Game of Thrones Fans Will Have to Wait: A Knight of the Seven Kingdoms Delayed Until 2026!

    Nile Entertainment Secures African Rights for Thrilling Action Film ‘Son of the Soil

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    JPMorgan Chase plans to spend $18B in technology in 2025 (JPM:NYSE) – Seeking Alpha

    JPMorgan Chase Unveils Ambitious $18 Billion Tech Investment Plan for 2025!

    Nvidia plans to sell tech to speed AI chip communication – The Indian Express

    Nvidia Unveils Game-Changing Technology to Accelerate AI Chip Communication!

    Murfreesboro LPR technology helps catch suspect in Henry County homicide case – WKRN News 2

    Murfreesboro LPR technology helps catch suspect in Henry County homicide case – WKRN News 2

    How will BCI technology change the lives of people with disabilities? – news.cgtn.com

    Transforming Lives: The Impact of BCI Technology on People with Disabilities

    Super Speeders are deadly. This technology can slow them down. – Popular Science

    Revolutionary Technology: Taming the Threat of Super Speeders!

    Celebrating Success: Highlights from the Collaborative College for Technology & Leadership Graduation Ceremony

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment
    Why Was Kanye West’s South Korea Tour Cancelled? – Yahoo

    Unraveling the Mystery: Why Kanye West’s South Korea Tour Was Canceled

    Entertainment Spotlight: ‘Georgie & Mandy’s First Marriage’ – Atlanta News First

    ‘Final Destination: Bloodlines’ tops box office while The Weeknd’s movie falters – Yakima Herald-Republic

    Final Destination: Bloodlines Dominates the Box Office as The Weeknd’s Film Struggles

    Country Music Legend Bids Heartfelt Farewell: ‘Y’all Gonna Make Me Tear Up!

    We won’t get a Game of Thrones show this year: A Knight of the Seven Kingdoms shifts to early 2026 – Entertainment Weekly

    Game of Thrones Fans Will Have to Wait: A Knight of the Seven Kingdoms Delayed Until 2026!

    Nile Entertainment Secures African Rights for Thrilling Action Film ‘Son of the Soil

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    JPMorgan Chase plans to spend $18B in technology in 2025 (JPM:NYSE) – Seeking Alpha

    JPMorgan Chase Unveils Ambitious $18 Billion Tech Investment Plan for 2025!

    Nvidia plans to sell tech to speed AI chip communication – The Indian Express

    Nvidia Unveils Game-Changing Technology to Accelerate AI Chip Communication!

    Murfreesboro LPR technology helps catch suspect in Henry County homicide case – WKRN News 2

    Murfreesboro LPR technology helps catch suspect in Henry County homicide case – WKRN News 2

    How will BCI technology change the lives of people with disabilities? – news.cgtn.com

    Transforming Lives: The Impact of BCI Technology on People with Disabilities

    Super Speeders are deadly. This technology can slow them down. – Popular Science

    Revolutionary Technology: Taming the Threat of Super Speeders!

    Celebrating Success: Highlights from the Collaborative College for Technology & Leadership Graduation Ceremony

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home Technology

Large Enough – Mistral AI

July 24, 2024
in Technology
Large Enough – Mistral AI
Share on FacebookShare on Twitter

Detailed benchmarks

This latest generation continues to push the boundaries of cost efficiency, speed, and performance. Mistral Large 2 is exposed on la Plateforme and enriched with new features to facilitate building innovative AI applications.

Mistral Large 2

Mistral Large 2 has a 128k context window and supports dozens of languages including French, German, Spanish, Italian, Portuguese, Arabic, Hindi, Russian, Chinese, Japanese, and Korean, along with 80+ coding languages including Python, Java, C, C++, JavaScript, and Bash.

Mistral Large 2 is designed for single-node inference with long-context applications in mind – its size of 123 billion parameters allows it to run at large throughput on a single node.
We are releasing Mistral Large 2 under the Mistral Research License, that allows usage and modification for research and non-commercial usages. For commercial usage of Mistral Large 2 requiring self-deployment, a Mistral Commercial License must be acquired by contacting us.

General performance

Mistral Large 2 sets a new frontier in terms of performance / cost of serving on evaluation metrics. In particular, on MMLU, the pretrained version achieves an accuracy of 84.0%, and sets a new point on the performance/cost Pareto front of open models.

Code & Reasoning

Following our experience with Codestral 22B and Codestral Mamba, we trained Mistral Large 2 on a very large proportion of code. Mistral Large 2 vastly outperforms the previous Mistral Large, and performs on par with leading models such as GPT-4o, Claude 3 Opus, and Llama 3 405B.

Detailed benchmarks

A significant effort was also devoted to enhancing the model’s reasoning capabilities. One of the key focus areas during training was to minimize the model’s tendency to “hallucinate” or generate plausible-sounding but factually incorrect or irrelevant information. This was achieved by fine-tuning the model to be more cautious and discerning in its responses, ensuring that it provides reliable and accurate outputs.

Additionally, the new Mistral Large 2 is trained to acknowledge when it cannot find solutions or does not have sufficient information to provide a confident answer. This commitment to accuracy is reflected in the improved model performance on popular mathematical benchmarks, demonstrating its enhanced reasoning and problem-solving skills:

Detailed benchmarks

Performance accuracy on code generation benchmarks (all models were benchmarked through the same evaluation pipeline)

Detailed benchmarks

Performance accuracy on MultiPL-E (all models were benchmarked through the same evaluation pipeline, except for the “paper” row)

Detailed benchmarks

Performance accuracy on GSM8K (8-shot) and MATH (0-shot, no CoT) generation benchmarks (all models were benchmarked through the same evaluation pipeline)

Instruction following & Alignment

We drastically improved the instruction-following and conversational capabilities of Mistral Large 2. The new Mistral Large 2 is particularly better at following precise instructions and handling long multi-turn conversations. Below we report the performance on MT-Bench, Wild Bench, and Arena Hard benchmarks:

Detailed benchmarks

Performance on general alignment benchmarks (all models were benchmarked through the same evalutation pipeline)

On some benchmarks, generating lengthy responses tends to improve the scores. However, in many business applications, conciseness is paramount – short model generations facilitate quicker interactions and are more cost-effective for inference. This is why we spent a lot of effort to ensure that generations remain succinct and to the point whenever possible. The graph below reports the average length of generations of different models on questions from the MT Bench benchmark:

MT Bench benchmarks

Language diversity

A large fraction of business use cases today involve working with multilingual documents. While the majority of models are English-centric, the new Mistral Large 2 was trained on a large proportion of multilingual data. In particular, it excels in English, French, German, Spanish, Italian, Portuguese, Dutch, Russian, Chinese, Japanese, Korean, Arabic, and Hindi. Below are the performance results of Mistral Large 2 on the multilingual MMLU benchmark, compared to the previous Mistral Large, Llama 3.1 models, and to Cohere’s Command R+.

Detailed benchmarks

Detailed benchmarks

Performance on Multilingual MMLU (measured on the base pretrained model)

Tool Use & Function Calling

Mistral Large 2 is equipped with enhanced function calling and retrieval skills and has undergone training to proficiently execute both parallel and sequential function calls, enabling it to serve as the power engine of complex business applications.

Detailed benchmarks

Try Mistral Large 2 on la Plateforme

You can use Mistral Large 2 today via la Plateforme under the name mistral-large-2407, and test it on le Chat. It is available under the version 24.07 (a YY.MM versioning system that we are applying to all our models), and the API name mistral-large-2407. Weights for the instruct model are available and are also hosted on HuggingFace.

we are consolidating the offering on la Plateforme around two general purpose models, Mistral Nemo and Mistral Large, and two specialist models, Codestral and Embed. As we progressively deprecate older models on la Plateforme, all Apache models (Mistral 7B, Mixtral 8x7B and 8x22B, Codestral Mamba, Mathstral) remain available for deployment and fine-tuning using our SDK mistral-inference and mistral-finetune.

Starting today, we are extending fine-tuning capabilities on la Plateforme: those are now available for Mistral Large, Mistral Nemo and Codestral.

Access Mistral models through cloud service providers

We are proud to partner with leading cloud service providers to bring the new Mistral Large 2 to a global audience. In particular, today we are expanding our partnership with Google Cloud Platform to bring Mistral AI’s models on Vertex AI via a Managed API. Mistral AI’s best models are now available on Vertex AI, in addition to Azure AI Studio, Amazon Bedrock and IBM watsonx.ai.

Availability timeline of Mistral AI models

Detailed benchmarks

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : Hacker News – https://mistral.ai/news/mistral-large-2407/

Tags: Enoughlargetechnology
Previous Post

Browns place RB Nick Chubb on PUP list as he recovers from knee surgery

Next Post

AI models collapse when trained on recursively generated data

SK Life Science Launches Its Second National Ad, Continuing to Lead with the Only Direct-to-Consumer Commercial for an Anti-Seizure Drug in the U.S. – WV News

SK Life Science Unveils Groundbreaking Second National Ad for Pioneering Anti-Seizure Drug!

May 19, 2025
Craving A Great Wine On A Budget? Here’s What To Look For On The Label – Yahoo

Unlocking Affordable Elegance: How to Choose the Perfect Budget Wine!

May 19, 2025
MANTRA and WIN Investments Join Forces to Bring Real-World Sports Assets Onchain – GlobeNewswire

Revolutionizing Sports: MANTRA and WIN Investments Team Up to Bring Real-World Assets Onchain!

May 19, 2025
UK economy grows more than expected: How optimistic should you be? – BBC

Surprising Growth in the UK Economy: What It Means for Your Future!

May 19, 2025
Why Was Kanye West’s South Korea Tour Cancelled? – Yahoo

Unraveling the Mystery: Why Kanye West’s South Korea Tour Was Canceled

May 19, 2025
LCHD shares 2025 data from County Health Rankings and Roadmaps – Tomahawk Leader

Discover the Latest Insights: 2025 County Health Rankings and Roadmaps Unveiled!

May 19, 2025
GOP lawmaker explains how he would communicate a disagreement with Trump – CNN

How a GOP Lawmaker Plans to Navigate Disagreements with Trump

May 19, 2025
JPMorgan Chase plans to spend $18B in technology in 2025 (JPM:NYSE) – Seeking Alpha

JPMorgan Chase Unveils Ambitious $18 Billion Tech Investment Plan for 2025!

May 19, 2025
A courtside lounge? Dynamic ticket pricing? UCLA hopes new sports ventures will pay off – Los Angeles Times

UCLA’s Bold New Sports Ventures: Will Courtside Lounges and Dynamic Ticket Pricing Transform the Game

May 19, 2025

Spatiotemporal assessment of ecological quality and driving mechanisms in the Beijing metropolitan area – Nature

May 19, 2025

Categories

Archives

May 2025
MTWTFSS
 1234
567891011
12131415161718
19202122232425
262728293031 
« Apr    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (620)
  • Economy (634)
  • Entertainment (21,548)
  • General (15,223)
  • Health (9,676)
  • Lifestyle (639)
  • News (22,149)
  • People (636)
  • Politics (642)
  • Science (15,858)
  • Sports (21,144)
  • Technology (15,625)
  • World (624)

Recent News

SK Life Science Launches Its Second National Ad, Continuing to Lead with the Only Direct-to-Consumer Commercial for an Anti-Seizure Drug in the U.S. – WV News

SK Life Science Unveils Groundbreaking Second National Ad for Pioneering Anti-Seizure Drug!

May 19, 2025
Craving A Great Wine On A Budget? Here’s What To Look For On The Label – Yahoo

Unlocking Affordable Elegance: How to Choose the Perfect Budget Wine!

May 19, 2025
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version