* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Saturday, November 22, 2025
Earth-News
  • Home
  • Business
  • Entertainment
    The Surprising Studio Ghibli Film That Influenced Netflix’s Train Dreams [Exclusive] – Yahoo

    The Surprising Studio Ghibli Film That Influenced Netflix’s Train Dreams [Exclusive] – Yahoo

    Star Panel to Consider Moviegoing in an Evolving Marketplace – Noozhawk

    Star Panel to Explore the Future of Moviegoing in a Changing Marketplace

    Mattel makes another bold family-entertainment move beyond toys – TheStreet

    Mattel makes another bold family-entertainment move beyond toys – TheStreet

    Themed Entertainment Association announces 32nd annual Thea Award recipients – InPark Magazine

    Themed Entertainment Association announces 32nd annual Thea Award recipients – InPark Magazine

    American Legion Hall celebrates Veterans with night of entertainment – Bethany Republican-Clipper

    American Legion Hall celebrates Veterans with night of entertainment – Bethany Republican-Clipper

    Liev Schreiber ‘cleared to return to work’ after weekend hospitalization, rep confirms – Los Angeles Times

    Liev Schreiber ‘cleared to return to work’ after weekend hospitalization, rep confirms – Los Angeles Times

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    First Partner Jennifer Siebel Newsom leads Gender Equity Summit on technology and well-being – California State Portal | CA.gov

    First Partner Jennifer Siebel Newsom Champions Gender Equity at Technology and Well-Being Summit

    MACo’s Inaugural Information Technology Conference – Maryland Association of Counties

    Maryland’s Inaugural Information Technology Conference Sparks a New Era of County Innovation

    F&I Sentinel Recognized on the 2025 Deloitte Technology Fast 500™ for the Second Consecutive Year – PR Newswire

    F&I Sentinel Achieves Back-to-Back Honors on the 2025 Deloitte Technology Fast 500™

    Keeping up with new technology – The Clinton Chronicle

    Stay Ahead of the Curve: Master the Hottest Technology Trends Today

    How hybrid technology supports sustainable driving – AZ Big Media

    How Hybrid Technology is Powering the Future of Sustainable Transportation

    Mid-Atlantic Technology Summit 2025 showcases next-gen tools for first responders – FireRescue1

    Mid-Atlantic Technology Summit 2025 Reveals Game-Changing Tools Empowering First Responders

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment
    The Surprising Studio Ghibli Film That Influenced Netflix’s Train Dreams [Exclusive] – Yahoo

    The Surprising Studio Ghibli Film That Influenced Netflix’s Train Dreams [Exclusive] – Yahoo

    Star Panel to Consider Moviegoing in an Evolving Marketplace – Noozhawk

    Star Panel to Explore the Future of Moviegoing in a Changing Marketplace

    Mattel makes another bold family-entertainment move beyond toys – TheStreet

    Mattel makes another bold family-entertainment move beyond toys – TheStreet

    Themed Entertainment Association announces 32nd annual Thea Award recipients – InPark Magazine

    Themed Entertainment Association announces 32nd annual Thea Award recipients – InPark Magazine

    American Legion Hall celebrates Veterans with night of entertainment – Bethany Republican-Clipper

    American Legion Hall celebrates Veterans with night of entertainment – Bethany Republican-Clipper

    Liev Schreiber ‘cleared to return to work’ after weekend hospitalization, rep confirms – Los Angeles Times

    Liev Schreiber ‘cleared to return to work’ after weekend hospitalization, rep confirms – Los Angeles Times

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    First Partner Jennifer Siebel Newsom leads Gender Equity Summit on technology and well-being – California State Portal | CA.gov

    First Partner Jennifer Siebel Newsom Champions Gender Equity at Technology and Well-Being Summit

    MACo’s Inaugural Information Technology Conference – Maryland Association of Counties

    Maryland’s Inaugural Information Technology Conference Sparks a New Era of County Innovation

    F&I Sentinel Recognized on the 2025 Deloitte Technology Fast 500™ for the Second Consecutive Year – PR Newswire

    F&I Sentinel Achieves Back-to-Back Honors on the 2025 Deloitte Technology Fast 500™

    Keeping up with new technology – The Clinton Chronicle

    Stay Ahead of the Curve: Master the Hottest Technology Trends Today

    How hybrid technology supports sustainable driving – AZ Big Media

    How Hybrid Technology is Powering the Future of Sustainable Transportation

    Mid-Atlantic Technology Summit 2025 showcases next-gen tools for first responders – FireRescue1

    Mid-Atlantic Technology Summit 2025 Reveals Game-Changing Tools Empowering First Responders

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home Technology

Study claims ChatGPT is losing capability, but some experts aren’t convinced

July 20, 2023
in Technology
Study claims ChatGPT is losing capability, but some experts aren’t convinced
Share on FacebookShare on Twitter

A shaky toy robot on a multicolor background.

Benj Edwards / Getty Images

On Tuesday, researchers from Stanford University and University of California, Berkeley published a research paper that purports to show changes in GPT-4’s outputs over time. The paper fuels a common-but-unproven belief that the AI language model has grown worse at coding and compositional tasks over the past few months. Some experts aren’t convinced by the results, but they say that the lack of certainty points to a larger problem with how OpenAI handles its model releases.

In a study titled “How Is ChatGPT’s Behavior Changing over Time?” published on arXiv, Lingjiao Chen, Matei Zaharia, and James Zou, cast doubt on the consistent performance of OpenAI’s large language models (LLMs), specifically GPT-3.5 and GPT-4. Using API access, they tested the March and June 2023 versions of these models on tasks like math problem-solving, answering sensitive questions, code generation, and visual reasoning. Most notably, GPT-4’s ability to identify prime numbers reportedly plunged dramatically from an accuracy of 97.6 percent in March to just 2.4 percent in June. Strangely, GPT-3.5 showed improved performance in the same period.

Performance of the March 2023 and June 2023 versions of GPT-4 and GPT-3.5 on four tasks, taken from

Enlarge / Performance of the March 2023 and June 2023 versions of GPT-4 and GPT-3.5 on four tasks, taken from “How Is ChatGPT’s Behavior Changing over Time?”

Chen/Zaharia/Zou

This study comes on the heels of people frequently complaining that GPT-4 has subjectively declined in performance over the past few months. Popular theories about why include OpenAI “distilling” models to reduce their computational overhead in a quest to speed up the output and save GPU resources, fine-tuning (additional training) to reduce harmful outputs that may have unintended effects, and a smattering of unsupported conspiracy theories such as OpenAI reducing GPT-4’s coding capabilities so more people will pay for GitHub Copilot.

Meanwhile, OpenAI has consistently denied any claims that GPT-4 has decreased in capability. As recently as last Thursday, OpenAI VP of Product Peter Welinder tweeted, “No, we haven’t made GPT-4 dumber. Quite the opposite: we make each new version smarter than the previous one. Current hypothesis: When you use it more heavily, you start noticing issues you didn’t see before.”

While this new study may appear like a smoking gun to prove the hunches of the GPT-4 critics, others say not so fast. Princeton computer science professor Arvind Narayanan thinks that its findings don’t conclusively prove a decline in GPT-4’s performance and are potentially consistent with fine-tuning adjustments made by OpenAI. For example, in terms of measuring code generation capabilities, he criticized the study for evaluating the immediacy of the code’s ability to be executed rather than its correctness.

“The change they report is that the newer GPT-4 adds non-code text to its output. They don’t evaluate the correctness of the code (strange),” he tweeted. “They merely check if the code is directly executable. So the newer model’s attempt to be more helpful counted against it.”

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : Ars Technica – https://arstechnica.com/?p=1954989

Tags: claimsstudytechnology
Previous Post

Google’s new security pilot program will ban employee Internet access

Next Post

Unity’s visionOS support has started to roll out—here’s how it works

WATCH: Lawmakers ask WA to ‘show your work’ in lawsuit over failure to release climate data – The Center Square

WATCH: Lawmakers Demand WA ‘Show Your Work’ in Lawsuit Over Withheld Climate Data

November 22, 2025
SCIENCE AT THE CROSSROADS | The Contradictory Future of NASA Amid a Transfer of Power – The Hoya

Science at the Crossroads: Navigating NASA’s Uncertain Future Amid Leadership Change

November 22, 2025
Parakeets teach a lesson in friendship – University of Cincinnati

Parakeets teach a lesson in friendship – University of Cincinnati

November 22, 2025
After Total Hip Replacement, Utah Patient Gets Back to Active Lifestyle in No Time – University of Utah Health

After Total Hip Replacement, Utah Patient Gets Back to Active Lifestyle in No Time – University of Utah Health

November 22, 2025
First Partner Jennifer Siebel Newsom leads Gender Equity Summit on technology and well-being – California State Portal | CA.gov

First Partner Jennifer Siebel Newsom Champions Gender Equity at Technology and Well-Being Summit

November 22, 2025
2025 Big Ten Championship Game scenarios: Tiebreakers, paths for Ohio State, Oregon, USC, Michigan, Indiana – CBS Sports

2025 Big Ten Championship Showdown: How Ohio State, Oregon, USC, Michigan, and Indiana Can Secure Victory

November 22, 2025
A New Era: The World Fencing League Makes Global Debut in April 2026 – Sports Video Group

A New Era: The World Fencing League Makes Global Debut in April 2026 – Sports Video Group

November 21, 2025
Economy added 119K jobs as unemployment ticked up in September; BLS cancels October jobs report – McKnight’s Senior Living

Economy added 119K jobs as unemployment ticked up in September; BLS cancels October jobs report – McKnight’s Senior Living

November 21, 2025
The Surprising Studio Ghibli Film That Influenced Netflix’s Train Dreams [Exclusive] – Yahoo

The Surprising Studio Ghibli Film That Influenced Netflix’s Train Dreams [Exclusive] – Yahoo

November 21, 2025
Partisanship Is Poisoning Public Health – Scientific American

How Partisan Politics Are Putting Public Health at Risk

November 21, 2025

Categories

Archives

November 2025
M T W T F S S
 12
3456789
10111213141516
17181920212223
24252627282930
« Oct    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (931)
  • Economy (950)
  • Entertainment (21,825)
  • General (18,317)
  • Health (9,990)
  • Lifestyle (961)
  • News (22,149)
  • People (955)
  • Politics (963)
  • Science (16,164)
  • Sports (21,451)
  • Technology (15,931)
  • World (937)

Recent News

WATCH: Lawmakers ask WA to ‘show your work’ in lawsuit over failure to release climate data – The Center Square

WATCH: Lawmakers Demand WA ‘Show Your Work’ in Lawsuit Over Withheld Climate Data

November 22, 2025
SCIENCE AT THE CROSSROADS | The Contradictory Future of NASA Amid a Transfer of Power – The Hoya

Science at the Crossroads: Navigating NASA’s Uncertain Future Amid Leadership Change

November 22, 2025
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version