* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Saturday, April 25, 2026
Earth-News
  • Home
  • Business
  • Entertainment

    How The Cars That Made Us Perfectly Blends Education and Entertainment

    What the controversial Michael Jackson movie leaves out – The Washington Post

    Mini golf, 24/7 golf simulator bring new entertainment to Temple – The Killeen Daily Herald

    Nashoba Symphonic Band Marks 10 Years with Two Exciting Free Concerts

    Los Lorcas and Pat Byrne at Stage 33 Live – Brattleboro Reformer

    Atlanta City Council Greenlights Exciting New World Cup Entertainment District

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology

    KLP Kapitalforvaltning AS Boosts Investment in Credo Technology Group Holding Ltd. $CRDO

    NSWC Crane Scientist Pioneers Breakthrough in Electromagnetic Spectrum Technology

    Foreign car companies bet on technology to hang onto once-lucrative China auto market – CNBC

    Kalispell Parking Advisory Board Proposes New Technology, Increased Fines, and Block Ordinance Changes

    The Surprising Ways Your Daily Habits Are Destroying Your Charging Cables

    Redwire Becomes Proud Drone Technology Partner of the Washington Commanders to Showcase Military Appreciation – Washington Commanders

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment

    How The Cars That Made Us Perfectly Blends Education and Entertainment

    What the controversial Michael Jackson movie leaves out – The Washington Post

    Mini golf, 24/7 golf simulator bring new entertainment to Temple – The Killeen Daily Herald

    Nashoba Symphonic Band Marks 10 Years with Two Exciting Free Concerts

    Los Lorcas and Pat Byrne at Stage 33 Live – Brattleboro Reformer

    Atlanta City Council Greenlights Exciting New World Cup Entertainment District

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology

    KLP Kapitalforvaltning AS Boosts Investment in Credo Technology Group Holding Ltd. $CRDO

    NSWC Crane Scientist Pioneers Breakthrough in Electromagnetic Spectrum Technology

    Foreign car companies bet on technology to hang onto once-lucrative China auto market – CNBC

    Kalispell Parking Advisory Board Proposes New Technology, Increased Fines, and Block Ordinance Changes

    The Surprising Ways Your Daily Habits Are Destroying Your Charging Cables

    Redwire Becomes Proud Drone Technology Partner of the Washington Commanders to Showcase Military Appreciation – Washington Commanders

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home Science

How Quickly Do Large Language Models Learn Unexpected Skills?

February 14, 2024
in Science
How Quickly Do Large Language Models Learn Unexpected Skills?
Share on FacebookShare on Twitter

Two years ago, in a project called the Beyond the Imitation Game benchmark, or BIG-bench, 450 researchers compiled a list of 204 tasks designed to test the capabilities of large language models, which power chatbots like ChatGPT. On most tasks, performance improved predictably and smoothly as the models scaled up — the larger the model, the better it got. But with other tasks, the jump in ability wasn’t smooth. The performance remained near zero for a while, then performance jumped. Other studies found similar leaps in ability.

The authors described this as “breakthrough” behavior; other researchers have likened it to a phase transition in physics, like when liquid water freezes into ice. In a paper published in August 2022, researchers noted that these behaviors are not only surprising but unpredictable, and that they should inform the evolving conversations around AI safety, potential and risk. They called the abilities “emergent,” a word that describes collective behaviors that only appear once a system reaches a high level of complexity.

But things may not be so simple. A new paper by a trio of researchers at Stanford University posits that the sudden appearance of these abilities is just a consequence of the way researchers measure the LLM’s performance. The abilities, they argue, are neither unpredictable nor sudden. “The transition is much more predictable than people give it credit for,” said Sanmi Koyejo, a computer scientist at Stanford and the paper’s senior author. “Strong claims of emergence have as much to do with the way we choose to measure as they do with what the models are doing.”

We’re only now seeing and studying this behavior because of how large these models have become. Large language models train by analyzing enormous datasets of text — words from online sources including books, web searches and Wikipedia — and finding links between words that often appear together. The size is measured in terms of parameters, roughly analogous to all the ways that words can be connected. The more parameters, the more connections an LLM can find. GPT-2 had 1.5 billion parameters, while GPT-3.5, the LLM that powers ChatGPT, uses 350 billion. GPT-4, which debuted in March 2023 and now underlies Microsoft Copilot, reportedly uses 1.75 trillion.

That rapid growth has brought an astonishing surge in performance and efficacy, and no one is disputing that large enough LLMs can complete tasks that smaller models can’t, including ones for which they weren’t trained. The trio at Stanford who cast emergence as a “mirage” recognize that LLMs become more effective as they scale up; in fact, the added complexity of larger models should make it possible to get better at more difficult and diverse problems. But they argue that whether this improvement looks smooth and predictable or jagged and sharp results from the choice of metric — or even a paucity of test examples — rather than the model’s inner workings.

Three-digit addition offers an example. In the 2022 BIG-bench study, researchers reported that with fewer parameters, both GPT-3 and another LLM named LAMDA failed to accurately complete addition problems. However, when GPT-3 trained using 13 billion parameters, its ability changed as if with the flip of a switch. Suddenly, it could add — and LAMDA could, too, at 68 billion parameters. This suggests that the ability to add emerges at a certain threshold.

But the Stanford researchers point out that the LLMs were judged only on accuracy: Either they could do it perfectly, or they couldn’t. So even if an LLM predicted most of the digits correctly, it failed. That didn’t seem right. If you’re calculating 100 plus 278, then 376 seems like a much more accurate answer than, say, −9.34.

So instead, Koyejo and his collaborators tested the same task using a metric that awards partial credit. “We can ask: How well does it predict the first digit? Then the second? Then the third?” he said.

Koyejo credits the idea for the new work to his graduate student Rylan Schaeffer, who he said noticed that an LLM’s performance seems to change with how its ability is measured. Together with Brando Miranda, another Stanford graduate student, they chose new metrics showing that as parameters increased, the LLMs predicted an increasingly correct sequence of digits in addition problems. This suggests that the ability to add isn’t emergent — meaning that it undergoes a sudden, unpredictable jump — but gradual and predictable. They find that with a different measuring stick, emergence vanishes.

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : Quanta Magazine – https://www.quantamagazine.org/how-quickly-do-large-language-models-learn-unexpected-skills-20240213/

Tags: largequickly'science
Previous Post

Mpumalanga teacher fired for sexually harassing five pupils and colleague

Next Post

“Goldilocks” Element – Scientists Make Key Advance for Capturing Carbon From the Air

Sen. Jack Whitver Delivers Emotional Farewell Speech in Iowa Politics

April 25, 2026

KLP Kapitalforvaltning AS Boosts Investment in Credo Technology Group Holding Ltd. $CRDO

April 25, 2026

Jermod McCoy’s Dramatic NFL Draft Slide: The Hidden Knee Injury That Changed Everything

April 25, 2026

Hey Kids! Dive into Fun and Help Create a Clean Water Coloring Book!

April 25, 2026

Who’s the Bigger Gold Digger: Men or Women? Science Finally Reveals the Truth

April 25, 2026

Delving into the Ethics of Longevity Science: A Thought-Provoking Exploration

April 25, 2026

How Phones Secretly Impact Our Mental Health-Even Without Social Media

April 25, 2026

Bigfoot: Unveiling the Ultimate Master of Hide-and-Seek

April 25, 2026

Six of Seven Former World Teamers Advance to Men’s Freestyle Finals as Davino Defeats Forrest in NCAA Finals Rematch

April 25, 2026

Can Trump Navigate the Iran Crisis While Battling a Slumping Economy?

April 25, 2026

Categories

Archives

April 2026
M T W T F S S
 12345
6789101112
13141516171819
20212223242526
27282930  
« Mar    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (1,185)
  • Economy (1,205)
  • Entertainment (22,080)
  • General (21,161)
  • Health (10,237)
  • Lifestyle (1,215)
  • News (22,149)
  • People (1,205)
  • Politics (1,225)
  • Science (16,420)
  • Sports (21,704)
  • Technology (16,190)
  • World (1,195)

Recent News

Sen. Jack Whitver Delivers Emotional Farewell Speech in Iowa Politics

April 25, 2026

KLP Kapitalforvaltning AS Boosts Investment in Credo Technology Group Holding Ltd. $CRDO

April 25, 2026
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version