* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Monday, July 14, 2025
Earth-News
  • Home
  • Business
  • Entertainment
    Entertainment Business Master’s Grad Launched Nonprofit to Nurture Emerging Artists – Full Sail University

    Entertainment Business Master’s Grad Launched Nonprofit to Nurture Emerging Artists – Full Sail University

    Review: At the Huntington, the New Hollywood String Quartet recalls legendary studio musicians – Los Angeles Times

    Review: At the Huntington, the New Hollywood String Quartet recalls legendary studio musicians – Los Angeles Times

    Kehoe repeals paid sick leave, allows several counties in the Ozarks to have entertainment districts in bill signings – KY3

    Kehoe repeals paid sick leave, allows several counties in the Ozarks to have entertainment districts in bill signings – KY3

    Emily Deschanel was scolded during “Bones” season 1 for being ‘late and unprepared’: ‘I was just beside myself’ – Yahoo

    Emily Deschanel was scolded during “Bones” season 1 for being ‘late and unprepared’: ‘I was just beside myself’ – Yahoo

    How you can see new movies early – Yahoo

    Unlock the Secret to Watching New Movies Before Everyone Else!

    Immersive sports and entertainment venue Cosm set to build its 5th location in Cleveland – WKYC

    Cosm Reveals Exciting Vision for Its 5th Immersive Sports and Entertainment Venue in Cleveland

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    Sentrycs’ Cyber Over RF technology integrated into Rafael’s combat-proven Drone Dome system – Defence Industry Europe

    Sentrycs’ Cyber Over RF Technology Boosts Rafael’s Battle-Tested Drone Dome System

    Nordic Air Defence raises $3 million to expand operations and advance drone defence technology – Defence Industry Europe

    Nordic Air Defence Lands $3 Million to Transform Drone Defense and Supercharge Operations

    China’s energy dominance in three charts – MIT Technology Review

    How China Is Powering Its Energy Dominance: A Visual Breakdown

    Meta Acquires AI Startup PlayAI to Enhance Voice Technology Capa – GuruFocus

    Meta Acquires AI Startup PlayAI to Revolutionize Voice Technology Capabilities

    Stallion Uranium Provides Update on Technology Data Acquisition Agreement – GlobeNewswire

    Stallion Uranium Announces Exciting Progress in Technology Data Acquisition Agreement

    2025 WE Local Prague Recap: Inspiring Women in Engineering and Technology – Society of Women Engineers

    2025 WE Local Prague Recap: Inspiring Women in Engineering and Technology – Society of Women Engineers

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment
    Entertainment Business Master’s Grad Launched Nonprofit to Nurture Emerging Artists – Full Sail University

    Entertainment Business Master’s Grad Launched Nonprofit to Nurture Emerging Artists – Full Sail University

    Review: At the Huntington, the New Hollywood String Quartet recalls legendary studio musicians – Los Angeles Times

    Review: At the Huntington, the New Hollywood String Quartet recalls legendary studio musicians – Los Angeles Times

    Kehoe repeals paid sick leave, allows several counties in the Ozarks to have entertainment districts in bill signings – KY3

    Kehoe repeals paid sick leave, allows several counties in the Ozarks to have entertainment districts in bill signings – KY3

    Emily Deschanel was scolded during “Bones” season 1 for being ‘late and unprepared’: ‘I was just beside myself’ – Yahoo

    Emily Deschanel was scolded during “Bones” season 1 for being ‘late and unprepared’: ‘I was just beside myself’ – Yahoo

    How you can see new movies early – Yahoo

    Unlock the Secret to Watching New Movies Before Everyone Else!

    Immersive sports and entertainment venue Cosm set to build its 5th location in Cleveland – WKYC

    Cosm Reveals Exciting Vision for Its 5th Immersive Sports and Entertainment Venue in Cleveland

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    Sentrycs’ Cyber Over RF technology integrated into Rafael’s combat-proven Drone Dome system – Defence Industry Europe

    Sentrycs’ Cyber Over RF Technology Boosts Rafael’s Battle-Tested Drone Dome System

    Nordic Air Defence raises $3 million to expand operations and advance drone defence technology – Defence Industry Europe

    Nordic Air Defence Lands $3 Million to Transform Drone Defense and Supercharge Operations

    China’s energy dominance in three charts – MIT Technology Review

    How China Is Powering Its Energy Dominance: A Visual Breakdown

    Meta Acquires AI Startup PlayAI to Enhance Voice Technology Capa – GuruFocus

    Meta Acquires AI Startup PlayAI to Revolutionize Voice Technology Capabilities

    Stallion Uranium Provides Update on Technology Data Acquisition Agreement – GlobeNewswire

    Stallion Uranium Announces Exciting Progress in Technology Data Acquisition Agreement

    2025 WE Local Prague Recap: Inspiring Women in Engineering and Technology – Society of Women Engineers

    2025 WE Local Prague Recap: Inspiring Women in Engineering and Technology – Society of Women Engineers

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home Technology

China’s DeepSeek Coder becomes first open-source coding model to beat GPT-4 Turbo

June 18, 2024
in Technology
China’s DeepSeek Coder becomes first open-source coding model to beat GPT-4 Turbo
Share on FacebookShare on Twitter

It’s time to celebrate the incredible women leading the way in AI! Nominate your inspiring leaders for VentureBeat’s Women in AI Awards today before June 18. Learn More

Chinese AI startup DeepSeek, which previously made headlines with a ChatGPT competitor trained on 2 trillion English and Chinese tokens, has announced the release of DeepSeek Coder V2, an open-source mixture of experts (MoE) code language model.

Built upon DeepSeek-V2, an MoE model that debuted last month, DeepSeek Coder V2 excels at both coding and math tasks. It supports more than 300 programming languages and outperforms state-of-the-art closed-source models, including GPT-4 Turbo, Claude 3 Opus and Gemini 1.5 Pro. The company claims this is the first time an open model has achieved this feat, sitting way ahead of Llama 3-70B and other models in the category.

It also notes that DeepSeek Coder V2 maintains comparable performance in terms of general reasoning and language capabilities. 

What does DeepSeek Coder V2 bring to the table?

Founded last year with a mission to “unravel the mystery of AGI with curiosity,” DeepSeek has been a notable Chinese player in the AI race, joining the likes of Qwen, 01.AI and Baidu. In fact, within a year of its launch, the company has already open-sourced a bunch of models, including the DeepSeek Coder family.

VB Transform 2024 Registration is Open

Join enterprise leaders in San Francisco from July 9 to 11 for our flagship AI event. Connect with peers, explore the opportunities and challenges of Generative AI, and learn how to integrate AI applications into your industry. Register Now

The original DeepSeek Coder, with up to 33 billion parameters, did decently on benchmarks with capabilities like project-level code completion and infilling, but only supported 86 programming languages and a context window of 16K. The new V2 offering builds on that work, expanding language support to 338 and context window to 128K – enabling it to handle more complex and extensive coding tasks.

When tested on MBPP+, HumanEval, and Aider benchmarks, designed to evaluate code generation, editing and problem-solving capabilities of LLMs, DeepSeek Coder V2 scored 76.2, 90.2, and 73.7, respectively — sitting ahead of most closed and open-source models, including GPT-4 Turbo, Claude 3 Opus, Gemini 1.5 Pro, Codestral and Llama-3 70B. Similar performance was seen across benchmarks designed to assess the model’s mathematical capabilities (MATH and GSM8K). 

The only model that managed to outperform DeepSeek’s offering across multiple benchmarks was GPT-4o, which obtained marginally higher scores in HumanEval, LiveCode Bench, MATH and GSM8K.

DeepSeek says it achieved these technical and performance advances by using DeepSeek V2, which is based on its Mixture of Experts framework, as a foundation. Essentially, the company pre-trained the base V2 model on an additional dataset of 6 trillion tokens – largely comprising code and math-related data sourced from GitHub and CommonCrawl.

This enables the model, which comes with 16B and 236B parameter options, to activate only 2.4B and 21B “expert” parameters to address the tasks at hand while also optimizing for diverse computing and application needs. 

Strong performance in general language, reasoning

In addition to excelling at coding and math-related tasks, DeepSeek Coder V2 also delivers decent performance in general reasoning and language understanding tasks. 

For instance, in the MMLU benchmark designed to evaluate language understanding across multiple tasks, it scored 79.2. This is way better than other code-specific models and nearly similar to the score of Llama-3 70B. GPT-4o and Claude 3 Opus, on their part, continue to lead the MMLU category with scores of 88.7 and 88.6, respectively. Meanwhile, GPT-4 Turbo follows closely behind.

The development shows open coding-specific models are finally excelling across the spectrum (not just their core use cases) and closing in on state-of-the-art closed-source models.

One of the most impressive teams in generative AI and open source killing it again!

The technical papers are amongst the best out there and performance has been exceptional from the final models with permissive licenses.

Great to see, everyone should try the 16b version ? https://t.co/lmggkEgj2n

— Emad (@EMostaque) June 17, 2024

As of now, DeepSeek Coder V2 is being offered under a MIT license, which allows for both research and unrestricted commercial use. Users can download both 16B and 236B sizes in instruct and base avatars via Hugging Face. Alternatively, the company is also providing access to the models via API through its platform under a pay-as-you-go model. 

For those who want to test out the capabilities of the models first, the company is offering the option to interact. with Deepseek Coder V2 via chatbot. 

VB Daily

Stay in the know! Get the latest news in your inbox daily

By subscribing, you agree to VentureBeat’s Terms of Service.

Thanks for subscribing. Check out more VB newsletters here.

An error occured.

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : VentureBeat – https://venturebeat.com/ai/chinas-deepseek-coder-becomes-first-open-source-coding-model-to-beat-gpt-4-turbo/

Tags: China’sDeepSeektechnology
Previous Post

Runway’s co-founder and CTO says Gen-3 Alpha coming in ‘days’ starting with paid subscribers

Next Post

Anthropic’s red team methods are a needed step to close AI security gaps

Canids as pollinators? Nectar foraging by Ethiopian wolves may contribute to the pollination of Kniphofia foliosa – ESA Journals

Could Ethiopian Wolves Be Unexpected Pollinators of Kniphofia foliosa?

July 14, 2025
Guest Opinion: Science is stronger with robust federal funding – Palo Alto Online

Why Strong Federal Funding is Essential for Advancing Science

July 14, 2025
Weight loss may ‘rejuvenate’ fat tissues, clearing away aged cells – Live Science

Weight Loss Could ‘Rejuvenate’ Fat Tissue by Clearing Out Old Cells

July 14, 2025
If your goal is to glow up, say goodbye to these 10 daily decisions – VegOut

10 Daily Habits to Ditch Now for a Stunning Glow-Up

July 14, 2025
‘We’ve never seen a team do this to PSG’ – how Chelsea won Club World Cup – BBC

Unbelievable Comeback: How Chelsea Shocked PSG to Clinch the Club World Cup!

July 14, 2025
India will become $10 trillion economy over next decade, GCCs to contribute $0.5 trillion – The Economic Times

India Poised to Become a $10 Trillion Economy Within a Decade, Powered by GCCs Driving $0.5 Trillion Growth

July 14, 2025
Entertainment Business Master’s Grad Launched Nonprofit to Nurture Emerging Artists – Full Sail University

Entertainment Business Master’s Grad Launched Nonprofit to Nurture Emerging Artists – Full Sail University

July 14, 2025
11 lessons for health tech startups from one of UpToDate’s creators – STAT

11 Essential Lessons for Health Tech Startups from a Leading Industry Innovator

July 14, 2025
Trump Threatens to Strip Rosie O’Donnell of U.S. Citizenship – The New York Times

Trump Threatens to Revoke Rosie O’Donnell’s U.S. Citizenship in Shocking Move

July 14, 2025
Sentrycs’ Cyber Over RF technology integrated into Rafael’s combat-proven Drone Dome system – Defence Industry Europe

Sentrycs’ Cyber Over RF Technology Boosts Rafael’s Battle-Tested Drone Dome System

July 14, 2025

Categories

Archives

July 2025
MTWTFSS
 123456
78910111213
14151617181920
21222324252627
28293031 
« Jun    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (721)
  • Economy (743)
  • Entertainment (21,631)
  • General (15,891)
  • Health (9,781)
  • Lifestyle (751)
  • News (22,149)
  • People (745)
  • Politics (754)
  • Science (15,962)
  • Sports (21,241)
  • Technology (15,728)
  • World (727)

Recent News

Canids as pollinators? Nectar foraging by Ethiopian wolves may contribute to the pollination of Kniphofia foliosa – ESA Journals

Could Ethiopian Wolves Be Unexpected Pollinators of Kniphofia foliosa?

July 14, 2025
Guest Opinion: Science is stronger with robust federal funding – Palo Alto Online

Why Strong Federal Funding is Essential for Advancing Science

July 14, 2025
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version