* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Tuesday, September 23, 2025
Earth-News
  • Home
  • Business
  • Entertainment
    Caesars Entertainment (CZR): Assessing Valuation After Times Square Casino Setback and Mounting Investor Concerns – simplywall.st

    Caesars Entertainment Faces Times Square Casino Hurdles as Investor Concerns Mount

    Why Hilaria Baldwin Has Found the ‘DWTS’ Process ‘Embarrassing’ At Times – WFXG

    Hilaria Baldwin Opens Up About the Embarrassing Moments on Her ‘DWTS’ Journey

    Harvest Fest 2025 – yadkinripple.com

    Celebrate the Bounty: Harvest Fest 2025 is Coming!

    Fox News Entertainment Newsletter: Kate Middleton stuns during Trump state visit, Brett James dead at 57 – Fox News

    Kate Middleton Stuns During Trump State Visit; Remembering Brett James at 57

    Lara Beitz to headline Oshkosh show with top comedians at Time Community Theater Sept. 27 – Yahoo

    Lara Beitz to Headline Star-Studded Oshkosh Comedy Night on September 27

    Shakespeare (with a twist) in Grand Junction – Yahoo

    Experience Shakespeare Like Never Before in Grand Junction

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    Agentic AI and the future of work: navigating technological promise and the risk of increased automation – Equal Times

    Agentic AI and the Future of Work: Embracing Innovation While Navigating Automation Challenges

    Technology alliance introduces system for stable recycling quality – RECYCLING magazine

    Innovative Technology Alliance Unveils Breakthrough System for Consistent Recycling Quality

    Pepper Pike council considers upgrading technology for streaming meetings, remote meeting participation – Cleveland.com

    Pepper Pike Council Explores Upgrading Technology for Enhanced Streaming and Remote Participation

    How Michelin Uses Technology to Rethink Tire Manufacturing: Interview – Motor1.com

    How Michelin’s Tech-Driven Revolution Is Transforming Tire Manufacturing

    Analysts Offer Insights on Technology Companies: Avnet (AVT), Nvidia (NVDA) and Atlassian (TEAM) – The Globe and Mail

    Experts Share Key Insights on Avnet, Nvidia, and Atlassian’s Future Prospects

    Top Technology Executives Recognized at the 2025 Carolina CIO ORBIE Awards – Yahoo Finance

    Celebrating Excellence: Top Technology Executives Honored at the 2025 Carolina CIO ORBIE Awards

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment
    Caesars Entertainment (CZR): Assessing Valuation After Times Square Casino Setback and Mounting Investor Concerns – simplywall.st

    Caesars Entertainment Faces Times Square Casino Hurdles as Investor Concerns Mount

    Why Hilaria Baldwin Has Found the ‘DWTS’ Process ‘Embarrassing’ At Times – WFXG

    Hilaria Baldwin Opens Up About the Embarrassing Moments on Her ‘DWTS’ Journey

    Harvest Fest 2025 – yadkinripple.com

    Celebrate the Bounty: Harvest Fest 2025 is Coming!

    Fox News Entertainment Newsletter: Kate Middleton stuns during Trump state visit, Brett James dead at 57 – Fox News

    Kate Middleton Stuns During Trump State Visit; Remembering Brett James at 57

    Lara Beitz to headline Oshkosh show with top comedians at Time Community Theater Sept. 27 – Yahoo

    Lara Beitz to Headline Star-Studded Oshkosh Comedy Night on September 27

    Shakespeare (with a twist) in Grand Junction – Yahoo

    Experience Shakespeare Like Never Before in Grand Junction

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    Agentic AI and the future of work: navigating technological promise and the risk of increased automation – Equal Times

    Agentic AI and the Future of Work: Embracing Innovation While Navigating Automation Challenges

    Technology alliance introduces system for stable recycling quality – RECYCLING magazine

    Innovative Technology Alliance Unveils Breakthrough System for Consistent Recycling Quality

    Pepper Pike council considers upgrading technology for streaming meetings, remote meeting participation – Cleveland.com

    Pepper Pike Council Explores Upgrading Technology for Enhanced Streaming and Remote Participation

    How Michelin Uses Technology to Rethink Tire Manufacturing: Interview – Motor1.com

    How Michelin’s Tech-Driven Revolution Is Transforming Tire Manufacturing

    Analysts Offer Insights on Technology Companies: Avnet (AVT), Nvidia (NVDA) and Atlassian (TEAM) – The Globe and Mail

    Experts Share Key Insights on Avnet, Nvidia, and Atlassian’s Future Prospects

    Top Technology Executives Recognized at the 2025 Carolina CIO ORBIE Awards – Yahoo Finance

    Celebrating Excellence: Top Technology Executives Honored at the 2025 Carolina CIO ORBIE Awards

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home Science

How Quickly Do Large Language Models Learn Unexpected Skills?

February 14, 2024
in Science
How Quickly Do Large Language Models Learn Unexpected Skills?
Share on FacebookShare on Twitter

Two years ago, in a project called the Beyond the Imitation Game benchmark, or BIG-bench, 450 researchers compiled a list of 204 tasks designed to test the capabilities of large language models, which power chatbots like ChatGPT. On most tasks, performance improved predictably and smoothly as the models scaled up — the larger the model, the better it got. But with other tasks, the jump in ability wasn’t smooth. The performance remained near zero for a while, then performance jumped. Other studies found similar leaps in ability.

The authors described this as “breakthrough” behavior; other researchers have likened it to a phase transition in physics, like when liquid water freezes into ice. In a paper published in August 2022, researchers noted that these behaviors are not only surprising but unpredictable, and that they should inform the evolving conversations around AI safety, potential and risk. They called the abilities “emergent,” a word that describes collective behaviors that only appear once a system reaches a high level of complexity.

But things may not be so simple. A new paper by a trio of researchers at Stanford University posits that the sudden appearance of these abilities is just a consequence of the way researchers measure the LLM’s performance. The abilities, they argue, are neither unpredictable nor sudden. “The transition is much more predictable than people give it credit for,” said Sanmi Koyejo, a computer scientist at Stanford and the paper’s senior author. “Strong claims of emergence have as much to do with the way we choose to measure as they do with what the models are doing.”

We’re only now seeing and studying this behavior because of how large these models have become. Large language models train by analyzing enormous datasets of text — words from online sources including books, web searches and Wikipedia — and finding links between words that often appear together. The size is measured in terms of parameters, roughly analogous to all the ways that words can be connected. The more parameters, the more connections an LLM can find. GPT-2 had 1.5 billion parameters, while GPT-3.5, the LLM that powers ChatGPT, uses 350 billion. GPT-4, which debuted in March 2023 and now underlies Microsoft Copilot, reportedly uses 1.75 trillion.

That rapid growth has brought an astonishing surge in performance and efficacy, and no one is disputing that large enough LLMs can complete tasks that smaller models can’t, including ones for which they weren’t trained. The trio at Stanford who cast emergence as a “mirage” recognize that LLMs become more effective as they scale up; in fact, the added complexity of larger models should make it possible to get better at more difficult and diverse problems. But they argue that whether this improvement looks smooth and predictable or jagged and sharp results from the choice of metric — or even a paucity of test examples — rather than the model’s inner workings.

Three-digit addition offers an example. In the 2022 BIG-bench study, researchers reported that with fewer parameters, both GPT-3 and another LLM named LAMDA failed to accurately complete addition problems. However, when GPT-3 trained using 13 billion parameters, its ability changed as if with the flip of a switch. Suddenly, it could add — and LAMDA could, too, at 68 billion parameters. This suggests that the ability to add emerges at a certain threshold.

But the Stanford researchers point out that the LLMs were judged only on accuracy: Either they could do it perfectly, or they couldn’t. So even if an LLM predicted most of the digits correctly, it failed. That didn’t seem right. If you’re calculating 100 plus 278, then 376 seems like a much more accurate answer than, say, −9.34.

So instead, Koyejo and his collaborators tested the same task using a metric that awards partial credit. “We can ask: How well does it predict the first digit? Then the second? Then the third?” he said.

Koyejo credits the idea for the new work to his graduate student Rylan Schaeffer, who he said noticed that an LLM’s performance seems to change with how its ability is measured. Together with Brando Miranda, another Stanford graduate student, they chose new metrics showing that as parameters increased, the LLMs predicted an increasingly correct sequence of digits in addition problems. This suggests that the ability to add isn’t emergent — meaning that it undergoes a sudden, unpredictable jump — but gradual and predictable. They find that with a different measuring stick, emergence vanishes.

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : Quanta Magazine – https://www.quantamagazine.org/how-quickly-do-large-language-models-learn-unexpected-skills-20240213/

Tags: largequickly'science
Previous Post

Mpumalanga teacher fired for sexually harassing five pupils and colleague

Next Post

“Goldilocks” Element – Scientists Make Key Advance for Capturing Carbon From the Air

U.S. and Israel against the world as Palestine dominates UN week – Axios

U.S. and Israel against the world as Palestine dominates UN week – Axios

September 23, 2025
Global economic outlook weakens as policy uncertainty weighs on demand – OECD

Global economic outlook weakens as policy uncertainty weighs on demand – OECD

September 23, 2025
Caesars Entertainment (CZR): Assessing Valuation After Times Square Casino Setback and Mounting Investor Concerns – simplywall.st

Caesars Entertainment Faces Times Square Casino Hurdles as Investor Concerns Mount

September 23, 2025
AI Tool Predicts Health Problems in Patients 20 Years Before They Emerge – eWeek

AI Tool Detects Health Issues Two Decades Before Symptoms Appear

September 23, 2025
Germany’s €80B Rearmament Plan Sidelines US Weapons – politicstoday.org

Germany’s €80B Rearmament Plan Sidelines US Weapons – politicstoday.org

September 23, 2025
FOCUS | SCIO holds press conference on promoting high-quality development through high-level ecological & environmental protection – Xinhua

FOCUS | SCIO holds press conference on promoting high-quality development through high-level ecological & environmental protection – Xinhua

September 23, 2025
Researcher on Tylenol-Autism Connection: Not the Best Science – Managed Healthcare Executive

Researcher Questions the Science Behind Tylenol-Autism Link

September 23, 2025
Da Vinci’s Genetic Secrets May Soon Be Revealed by Ambitious DNA Project – ScienceAlert

Unlocking Da Vinci’s Genetic Mysteries: The Ambitious DNA Project Set to Reveal All

September 23, 2025
Eco-Chic Home & Lifestyle Design Market Is Booming Worldwide | Major Giants The Joinery, Emeco, Greenington – openPR.com

Eco-Chic Home & Lifestyle Design Market Is Booming Worldwide | Major Giants The Joinery, Emeco, Greenington – openPR.com

September 23, 2025
Agentic AI and the future of work: navigating technological promise and the risk of increased automation – Equal Times

Agentic AI and the Future of Work: Embracing Innovation While Navigating Automation Challenges

September 23, 2025

Categories

Archives

September 2025
MTWTFSS
1234567
891011121314
15161718192021
22232425262728
2930 
« Aug    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (832)
  • Economy (853)
  • Entertainment (21,731)
  • General (17,194)
  • Health (9,896)
  • Lifestyle (865)
  • News (22,149)
  • People (855)
  • Politics (863)
  • Science (16,063)
  • Sports (21,352)
  • Technology (15,835)
  • World (837)

Recent News

U.S. and Israel against the world as Palestine dominates UN week – Axios

U.S. and Israel against the world as Palestine dominates UN week – Axios

September 23, 2025
Global economic outlook weakens as policy uncertainty weighs on demand – OECD

Global economic outlook weakens as policy uncertainty weighs on demand – OECD

September 23, 2025
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version