* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Monday, June 9, 2025
Earth-News
  • Home
  • Business
  • Entertainment
    Cisco Partners with Monumental Sports & Entertainment to Power New D.C. Arena – Cisco Newsroom

    Cisco Teams Up with Monumental Sports & Entertainment to Revolutionize the New D.C. Arena Experience

    Middle Eastern Entertainment Headlines at 5:49 a.m. GMT – Yahoo

    Exciting Updates from the Middle Eastern Entertainment Scene!

    Ceramic Dalmatian Entertainment is WLAF’s Business of the Week – WLAF

    Spotlight on Success: Ceramic Dalmatian Entertainment Shines as This Week’s Featured Business!

    Brass Lion Entertainment unveils co-op action RPG Wu-Tang: Rise of the Deceiver – VentureBeat

    Unleash Your Inner Warrior: Discover the Co-Op Action RPG Wu-Tang: Rise of the Deceiver!

    Entertainment lineup released for 2025 Mississippi State Fair – WAPT

    Exciting Entertainment Lineup Unveiled for the 2025 Mississippi State Fair!

    After Denzel Washington Said He Would Be In Black Panther 3, Ryan Coogler Explained Why He’s ‘Fine’ With That Information Being Revealed So Early – Yahoo

    Ryan Coogler Shares Why He’s Cool with Denzel Washington’s Black Panther 3 Reveal!

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    Technology, Labor Rights, and Political Power in Kenya and Across Africa – Tech Policy Press

    How Technology is Shaping Labor Rights and Political Power Across Africa

    Reeves to Announce £86 Billion for Science and Technology in Spending Review – Bloomberg

    Reeves Set to Unveil Groundbreaking £86 Billion Investment in Science and Technology!

    Innovation at Scale: How P&G Transforms Business Through Technology – Procter & Gamble

    Revolutionizing Business: P&G’s Bold Journey into Technological Innovation

    Drag racer survives frightening airborne crash at World Wide Technology Raceway – FOX 2

    Drag racer survives frightening airborne crash at World Wide Technology Raceway – FOX 2

    Apple Watch and the future of wearable technology in healthcare – MSN

    Revolutionizing Healthcare: The Future of Wearable Technology with Apple Watch

    ECS Professor Pankaj K. Jha Receives NSF Grant to Develop Quantum Technology – Syracuse University News

    Unlocking the Future: ECS Professor Pankaj K. Jha Secures NSF Grant for Groundbreaking Quantum Technology Development

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment
    Cisco Partners with Monumental Sports & Entertainment to Power New D.C. Arena – Cisco Newsroom

    Cisco Teams Up with Monumental Sports & Entertainment to Revolutionize the New D.C. Arena Experience

    Middle Eastern Entertainment Headlines at 5:49 a.m. GMT – Yahoo

    Exciting Updates from the Middle Eastern Entertainment Scene!

    Ceramic Dalmatian Entertainment is WLAF’s Business of the Week – WLAF

    Spotlight on Success: Ceramic Dalmatian Entertainment Shines as This Week’s Featured Business!

    Brass Lion Entertainment unveils co-op action RPG Wu-Tang: Rise of the Deceiver – VentureBeat

    Unleash Your Inner Warrior: Discover the Co-Op Action RPG Wu-Tang: Rise of the Deceiver!

    Entertainment lineup released for 2025 Mississippi State Fair – WAPT

    Exciting Entertainment Lineup Unveiled for the 2025 Mississippi State Fair!

    After Denzel Washington Said He Would Be In Black Panther 3, Ryan Coogler Explained Why He’s ‘Fine’ With That Information Being Revealed So Early – Yahoo

    Ryan Coogler Shares Why He’s Cool with Denzel Washington’s Black Panther 3 Reveal!

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    Technology, Labor Rights, and Political Power in Kenya and Across Africa – Tech Policy Press

    How Technology is Shaping Labor Rights and Political Power Across Africa

    Reeves to Announce £86 Billion for Science and Technology in Spending Review – Bloomberg

    Reeves Set to Unveil Groundbreaking £86 Billion Investment in Science and Technology!

    Innovation at Scale: How P&G Transforms Business Through Technology – Procter & Gamble

    Revolutionizing Business: P&G’s Bold Journey into Technological Innovation

    Drag racer survives frightening airborne crash at World Wide Technology Raceway – FOX 2

    Drag racer survives frightening airborne crash at World Wide Technology Raceway – FOX 2

    Apple Watch and the future of wearable technology in healthcare – MSN

    Revolutionizing Healthcare: The Future of Wearable Technology with Apple Watch

    ECS Professor Pankaj K. Jha Receives NSF Grant to Develop Quantum Technology – Syracuse University News

    Unlocking the Future: ECS Professor Pankaj K. Jha Secures NSF Grant for Groundbreaking Quantum Technology Development

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home Business

GPT, Other AI Models Can’t Decode SEC Filings, New Research Finds

December 20, 2023
in Business
GPT, Other AI Models Can’t Decode SEC Filings, New Research Finds
Share on FacebookShare on Twitter

New research conducted by a startup called Patronus AI shows that large language models (LLMs), similar to the one that powers ChatGPT, usually fail to decode Securities and Exchange Commission (SEC) filings.

Despite using OpenAI’s GPT-4-Turbo, the researchers only managed to get 79 per cent of answers right on Patronus AI’s new test, the company’s founders told CNBC.

With the ability to read nearly an entire filing alongside the question, GPT-4-Turbo was the best AI model configuration they tested.

Aside from refusing to answer, the so-called large language models would oftentimes “hallucinate” and come up with figures and facts that weren’t mentioned in the SEC filings.

“That type of performance rate is just absolutely unacceptable. It has to be much much higher for it to really work in an automated and production-ready way,” Patronus AI co-founder Anand Kannappan said.

Are LLMs really reliable?

Kannappan reposted an X (formerly Twitter) post by DoorDash’s Gokul Rajaram, noting “LLMs are nondeterministic”. In other words, they are likely to produce different answers for the same input.

LLMs are nondeterministic — they’re not guaranteed to produce the same output every time for the same input. That means that companies will need to do more rigorous testing to make sure they’re operating correctly, not going off-topic, and providing reliable results. This is what…

— Gokul Rajaram (@gokulr) December 19, 2023

So, it is safe to say that companies will have to be more careful when it comes to ensuring they are providing reliable results.

The latest findings further highlight some of the AI model-related challenges big companies, especially in regulated industries such as finance, face while trying to integrate this cutting-edge technology into their operations.

One of the most promising applications for chatbots has been their ability to extract crucial numbers and perform analysis on financial narratives.

Notably, SEC filings are teeming with important data, and if a ChatGPT-like bot could flawlessly summarise them or answer queries about what is in them, it could give the user a major advantage in the competitive financial industry.

Earlier this year, Bloomberg LP used the same underlying technology as OpenAI’s GPT to develop an AI model for financial data. Likewise, finance professor Alejandro Lopez-Lira showed that ChatGPT might come in handy for predicting stock movements.

Google is also working on a Gemini AI-powered program codenamed “Project Ellmann,” which will give users a “bird’s-eye” view of their lives. Moreover, McKinsey & Company suggest generative AI will radically overhaul how wealth management firms do business.

Despite the hype surrounding the newfangled technology, GPT’s entry into the industry has been pretty rough. When Microsoft launched its Bing Chat using OpenAI’s GPT, one of its primary uses was to quickly summarise an earnings press release.

However, some hawk-eyed observers realised that the numbers in Microsoft’s example were off. In fact, some of these numbers were entirely made up. In other words, Bing AI, which was recently rebranded to Copilot, made multiple factual errors.

How did AI models perform in the tests?

Patronus AI tested 4 language models including OpenAI’s GPT-4 and GPT-4-Turbo, Anthropic’s Claude 2 and Meta’s Llama 2. The company used a subset of 150 questions it had produced for the test.

The company also tested a slew of configurations and prompts, including a setting where the OpenAI models were provided the exact relevant source text in the question, which is known as “Oracle” mode.

The other tests involved instructing the models where the underlying SEC documents would be stored. Alternatively, the models were given “long context,” which is equivalent to providing an entire SEC filing alongside the question in the prompt.

GPT-4-Turbo

GPT-4-Turbo didn’t manage to pass the startup’s “closed book” test, where the model wasn’t given access to any SEC source document. Producing a correct answer only fourteen times, the model failed to answer 88 per cent of the 150 questions it was asked.

However, it performed better when given access to the underlying filings. In Oracle mode, GPT-4-Turbo answered the questions correctly 85 per cent of the time.

Llama 2

Meta’s open-source AI model had several hallucinations and went on to produce wrong answers 70 per cent of the time. It only managed to provide correct answers 19 per cent of the time when it was given access to underlying documents.

Claude 2

Anthropic’s Claude 2 performed well when the researchers included the entire relevant SEC filing along with the question. It answered 75 per cent of the questions accurately and gave wrong answers for 21 per cent of the queries it was asked.

Despite these shortcomings, Patronus AI co-founders believe language models like GPT can help people in the finance industry.

“We definitely think that the results can be pretty promising. Models will continue to get better over time. We’re very hopeful that in the long term, a lot of this can be automated,” Kannappan said.

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : IBTimes – https://www.ibtimes.co.uk/gpt-other-ai-models-cant-decode-sec-filings-new-research-finds-1722295

Tags: businessmodelsOther
Previous Post

Liverpool FC Receive Massive Injury Boost Ahead Of Games Against West Ham, Arsenal

Next Post

Real Madrid Defender David Alaba Receives Offer For Assistance From Bayern Munich For ACL Injury

Technology, Labor Rights, and Political Power in Kenya and Across Africa – Tech Policy Press

How Technology is Shaping Labor Rights and Political Power Across Africa

June 9, 2025
PCT Day 16-22 — Ecological Whiplash – The Trek

PCT Day 16-22 — Ecological Whiplash – The Trek

June 9, 2025
Attacks on science are attacks on making and keeping America great: op-ed – AL.com

Attacks on science are attacks on making and keeping America great: op-ed – AL.com

June 9, 2025
Science Confirms: Social Media Could Be Making Kids Depressed – SciTechDaily

Science Confirms: Social Media Could Be Making Kids Depressed – SciTechDaily

June 9, 2025
ZEL Mallorca, the active lifestyle brand of Meliá unveils bold and vibrant signature suite in Mallorca collaboration with Parisian artistic duo, Pangea. – Edomex Al Día

ZEL Mallorca, the active lifestyle brand of Meliá unveils bold and vibrant signature suite in Mallorca collaboration with Parisian artistic duo, Pangea. – Edomex Al Día

June 9, 2025
Power up your play with Nintendo Switch 2 and Mario Kart World launching today – News – Nintendo Official Site – Nintendo

Get Ready to Race: Nintendo Switch 2 and Mario Kart World Launch Today!

June 9, 2025
Lebanon aims to lure back wealthy Gulf tourists to jump-start its war-torn economy – Los Angeles Times

Lebanon Sets Sights on Wealthy Gulf Tourists to Revive Its War-Torn Economy

June 8, 2025
Cisco Partners with Monumental Sports & Entertainment to Power New D.C. Arena – Cisco Newsroom

Cisco Teams Up with Monumental Sports & Entertainment to Revolutionize the New D.C. Arena Experience

June 8, 2025
Couples who cuddle before sleep reap key health benefits, study reveals – Fox News

Couples who cuddle before sleep reap key health benefits, study reveals – Fox News

June 8, 2025
‘Damnit, get somebody in there’: Jimmy Patronis presses Ron DeSantis on CFO vacancy – Florida Politics

Damnit, Get Somebody In There!’ Jimmy Patronis Urges Ron DeSantis to Fill CFO Vacancy Immediately

June 8, 2025

Categories

Archives

June 2025
MTWTFSS
 1
2345678
9101112131415
16171819202122
23242526272829
30 
« May    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (678)
  • Economy (691)
  • Entertainment (21,597)
  • General (15,281)
  • Health (9,733)
  • Lifestyle (695)
  • News (22,149)
  • People (692)
  • Politics (698)
  • Science (15,910)
  • Sports (21,193)
  • Technology (15,678)
  • World (676)

Recent News

Technology, Labor Rights, and Political Power in Kenya and Across Africa – Tech Policy Press

How Technology is Shaping Labor Rights and Political Power Across Africa

June 9, 2025
PCT Day 16-22 — Ecological Whiplash – The Trek

PCT Day 16-22 — Ecological Whiplash – The Trek

June 9, 2025
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version