* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Friday, June 19, 2026
Earth-News
  • Home
  • Business
  • Entertainment

    Beloved Retro Jim Henson Characters Star in an Exciting New Show Coming to Harrisburg

    JUST IN: Tucker Wetmore Inks With Sandbox Entertainment – MusicRow.com

    Explosive Fourth of July Celebration Bursting with Rodeo Thrills and Destruction Derby Excitement

    Stephen Colbert’s Final ‘Late Show’ Peanuts Stunt Triggers Surprising Fallout

    Miramis Appoints New Head of Entertainment Ahead of Gasometer Stockholm Launch

    Deadly Helicopter Crash in Brazil Claims Six Lives; Authorities Launch Urgent Investigation

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology

    License Plate Reader Technology Leads to Arrest in Auburn Shooting Investigation

    Cohere Broadens Its Reach with Acquisition of Reliant AI to Launch Groundbreaking Sovereign Biopharma Platform

    How Satellite Technology Is Transforming the Future of Global Drinking Water Protection

    Why the Most Game-Changing Innovation of the Next Decade Could Surprise You

    FC Barcelona Launches Its First Signature Fragrance, Fusing Emotion, Memory, and Innovation

    SLU-Madrid Elevates Tech Training Through Exciting Cisco Networking Academy and PUE Academy Collaboration

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment

    Beloved Retro Jim Henson Characters Star in an Exciting New Show Coming to Harrisburg

    JUST IN: Tucker Wetmore Inks With Sandbox Entertainment – MusicRow.com

    Explosive Fourth of July Celebration Bursting with Rodeo Thrills and Destruction Derby Excitement

    Stephen Colbert’s Final ‘Late Show’ Peanuts Stunt Triggers Surprising Fallout

    Miramis Appoints New Head of Entertainment Ahead of Gasometer Stockholm Launch

    Deadly Helicopter Crash in Brazil Claims Six Lives; Authorities Launch Urgent Investigation

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology

    License Plate Reader Technology Leads to Arrest in Auburn Shooting Investigation

    Cohere Broadens Its Reach with Acquisition of Reliant AI to Launch Groundbreaking Sovereign Biopharma Platform

    How Satellite Technology Is Transforming the Future of Global Drinking Water Protection

    Why the Most Game-Changing Innovation of the Next Decade Could Surprise You

    FC Barcelona Launches Its First Signature Fragrance, Fusing Emotion, Memory, and Innovation

    SLU-Madrid Elevates Tech Training Through Exciting Cisco Networking Academy and PUE Academy Collaboration

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home Science

How Quickly Do Large Language Models Learn Unexpected Skills?

February 14, 2024
in Science
How Quickly Do Large Language Models Learn Unexpected Skills?
Share on FacebookShare on Twitter

Two years ago, in a project called the Beyond the Imitation Game benchmark, or BIG-bench, 450 researchers compiled a list of 204 tasks designed to test the capabilities of large language models, which power chatbots like ChatGPT. On most tasks, performance improved predictably and smoothly as the models scaled up — the larger the model, the better it got. But with other tasks, the jump in ability wasn’t smooth. The performance remained near zero for a while, then performance jumped. Other studies found similar leaps in ability.

The authors described this as “breakthrough” behavior; other researchers have likened it to a phase transition in physics, like when liquid water freezes into ice. In a paper published in August 2022, researchers noted that these behaviors are not only surprising but unpredictable, and that they should inform the evolving conversations around AI safety, potential and risk. They called the abilities “emergent,” a word that describes collective behaviors that only appear once a system reaches a high level of complexity.

But things may not be so simple. A new paper by a trio of researchers at Stanford University posits that the sudden appearance of these abilities is just a consequence of the way researchers measure the LLM’s performance. The abilities, they argue, are neither unpredictable nor sudden. “The transition is much more predictable than people give it credit for,” said Sanmi Koyejo, a computer scientist at Stanford and the paper’s senior author. “Strong claims of emergence have as much to do with the way we choose to measure as they do with what the models are doing.”

We’re only now seeing and studying this behavior because of how large these models have become. Large language models train by analyzing enormous datasets of text — words from online sources including books, web searches and Wikipedia — and finding links between words that often appear together. The size is measured in terms of parameters, roughly analogous to all the ways that words can be connected. The more parameters, the more connections an LLM can find. GPT-2 had 1.5 billion parameters, while GPT-3.5, the LLM that powers ChatGPT, uses 350 billion. GPT-4, which debuted in March 2023 and now underlies Microsoft Copilot, reportedly uses 1.75 trillion.

That rapid growth has brought an astonishing surge in performance and efficacy, and no one is disputing that large enough LLMs can complete tasks that smaller models can’t, including ones for which they weren’t trained. The trio at Stanford who cast emergence as a “mirage” recognize that LLMs become more effective as they scale up; in fact, the added complexity of larger models should make it possible to get better at more difficult and diverse problems. But they argue that whether this improvement looks smooth and predictable or jagged and sharp results from the choice of metric — or even a paucity of test examples — rather than the model’s inner workings.

Three-digit addition offers an example. In the 2022 BIG-bench study, researchers reported that with fewer parameters, both GPT-3 and another LLM named LAMDA failed to accurately complete addition problems. However, when GPT-3 trained using 13 billion parameters, its ability changed as if with the flip of a switch. Suddenly, it could add — and LAMDA could, too, at 68 billion parameters. This suggests that the ability to add emerges at a certain threshold.

But the Stanford researchers point out that the LLMs were judged only on accuracy: Either they could do it perfectly, or they couldn’t. So even if an LLM predicted most of the digits correctly, it failed. That didn’t seem right. If you’re calculating 100 plus 278, then 376 seems like a much more accurate answer than, say, −9.34.

So instead, Koyejo and his collaborators tested the same task using a metric that awards partial credit. “We can ask: How well does it predict the first digit? Then the second? Then the third?” he said.

Koyejo credits the idea for the new work to his graduate student Rylan Schaeffer, who he said noticed that an LLM’s performance seems to change with how its ability is measured. Together with Brando Miranda, another Stanford graduate student, they chose new metrics showing that as parameters increased, the LLMs predicted an increasingly correct sequence of digits in addition problems. This suggests that the ability to add isn’t emergent — meaning that it undergoes a sudden, unpredictable jump — but gradual and predictable. They find that with a different measuring stick, emergence vanishes.

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : Quanta Magazine – https://www.quantamagazine.org/how-quickly-do-large-language-models-learn-unexpected-skills-20240213/

Tags: largequickly'science
Previous Post

Mpumalanga teacher fired for sexually harassing five pupils and colleague

Next Post

“Goldilocks” Element – Scientists Make Key Advance for Capturing Carbon From the Air

License Plate Reader Technology Leads to Arrest in Auburn Shooting Investigation

June 19, 2026

Inspiring Eco-Literate Kids to Become Nature’s Champions: Transforming Environmental Education

June 19, 2026

Men’s College World Series Finals Preview: Key Insights Before North Carolina Faces Oklahoma

June 19, 2026

Scientists Reveal the Kidney’s Secret Backup System in a Stunning Breakthrough

June 19, 2026

Inside the Future: How AI is Revolutionizing Modern Life Science Labs

June 19, 2026

Dondurma: The stretchy, chewy ice-cream that never drips – Channel 3000

June 19, 2026

2026 World Cup: Must-Watch Thrilling Matches on June 18

June 19, 2026

Cuban President Calls for Immediate Reforms Amid Deepening Economic Crisis Caused by US Blockade

June 18, 2026

Revolutionary Space Station Technology Transforms Health Treatments

June 18, 2026

Beloved Retro Jim Henson Characters Star in an Exciting New Show Coming to Harrisburg

June 18, 2026

Categories

Archives

June 2026
M T W T F S S
1234567
891011121314
15161718192021
22232425262728
2930  
« May    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (1,273)
  • Economy (1,294)
  • Entertainment (22,171)
  • General (22,169)
  • Health (10,329)
  • Lifestyle (1,306)
  • News (22,149)
  • People (1,297)
  • Politics (1,314)
  • Science (16,509)
  • Sports (21,793)
  • Technology (16,280)
  • World (1,286)

Recent News

License Plate Reader Technology Leads to Arrest in Auburn Shooting Investigation

June 19, 2026

Inspiring Eco-Literate Kids to Become Nature’s Champions: Transforming Environmental Education

June 19, 2026
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version