* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Saturday, June 7, 2025
Earth-News
  • Home
  • Business
  • Entertainment
    Brass Lion Entertainment unveils co-op action RPG Wu-Tang: Rise of the Deceiver – VentureBeat

    Unleash Your Inner Warrior: Discover the Co-Op Action RPG Wu-Tang: Rise of the Deceiver!

    Entertainment lineup released for 2025 Mississippi State Fair – WAPT

    Exciting Entertainment Lineup Unveiled for the 2025 Mississippi State Fair!

    After Denzel Washington Said He Would Be In Black Panther 3, Ryan Coogler Explained Why He’s ‘Fine’ With That Information Being Revealed So Early – Yahoo

    Ryan Coogler Shares Why He’s Cool with Denzel Washington’s Black Panther 3 Reveal!

    Traveling Tacos and Tequila Festival to stop at Florence Yall’s stadium this October – Cincinnati Enquirer

    Get Ready for a Flavor Fiesta: Traveling Tacos and Tequila Festival Hits Florence Y’all’s Stadium This October!

    9 things to do this weekend in Lake County plus a look ahead – Leesburg Daily Commercial

    Discover 9 Exciting Weekend Adventures in Lake County and What’s Coming Up!

    Shows to Watch – The Advocate

    Must-See Shows You Can’t Miss!

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    Apple Watch and the future of wearable technology in healthcare – MSN

    Revolutionizing Healthcare: The Future of Wearable Technology with Apple Watch

    ECS Professor Pankaj K. Jha Receives NSF Grant to Develop Quantum Technology – Syracuse University News

    Unlocking the Future: ECS Professor Pankaj K. Jha Secures NSF Grant for Groundbreaking Quantum Technology Development

    Fire Tech Brief: 5 Fire Apparatus Technology Upgrades – firehouse.com

    Revving Up Safety: 5 Innovative Upgrades for Fire Apparatus Technology

    U.S. FDA Grants Platform Technology Designation to the Viral Vector Used in SRP-9003, Sarepta’s Investigational Gene Therapy for the Treatment of Limb Girdle Muscular Dystrophy Type 2E/R4 – Sarepta Therapeutics

    Breakthrough for Gene Therapy: FDA Designates Viral Vector in Sarepta’s SRP-9003 for Limb Girdle Muscular Dystrophy Treatment

    Waunakee Fifth-Graders Dive into the Future at Exciting Tech Day!

    Property Technology Magazine Unveils “PropTech Top 50 Index” and the “2025 PropTech Trends Report – The Great Rebuild.” – Business Wire

    Property Technology Magazine Unveils “PropTech Top 50 Index” and the “2025 PropTech Trends Report – The Great Rebuild.” – Business Wire

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment
    Brass Lion Entertainment unveils co-op action RPG Wu-Tang: Rise of the Deceiver – VentureBeat

    Unleash Your Inner Warrior: Discover the Co-Op Action RPG Wu-Tang: Rise of the Deceiver!

    Entertainment lineup released for 2025 Mississippi State Fair – WAPT

    Exciting Entertainment Lineup Unveiled for the 2025 Mississippi State Fair!

    After Denzel Washington Said He Would Be In Black Panther 3, Ryan Coogler Explained Why He’s ‘Fine’ With That Information Being Revealed So Early – Yahoo

    Ryan Coogler Shares Why He’s Cool with Denzel Washington’s Black Panther 3 Reveal!

    Traveling Tacos and Tequila Festival to stop at Florence Yall’s stadium this October – Cincinnati Enquirer

    Get Ready for a Flavor Fiesta: Traveling Tacos and Tequila Festival Hits Florence Y’all’s Stadium This October!

    9 things to do this weekend in Lake County plus a look ahead – Leesburg Daily Commercial

    Discover 9 Exciting Weekend Adventures in Lake County and What’s Coming Up!

    Shows to Watch – The Advocate

    Must-See Shows You Can’t Miss!

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    Apple Watch and the future of wearable technology in healthcare – MSN

    Revolutionizing Healthcare: The Future of Wearable Technology with Apple Watch

    ECS Professor Pankaj K. Jha Receives NSF Grant to Develop Quantum Technology – Syracuse University News

    Unlocking the Future: ECS Professor Pankaj K. Jha Secures NSF Grant for Groundbreaking Quantum Technology Development

    Fire Tech Brief: 5 Fire Apparatus Technology Upgrades – firehouse.com

    Revving Up Safety: 5 Innovative Upgrades for Fire Apparatus Technology

    U.S. FDA Grants Platform Technology Designation to the Viral Vector Used in SRP-9003, Sarepta’s Investigational Gene Therapy for the Treatment of Limb Girdle Muscular Dystrophy Type 2E/R4 – Sarepta Therapeutics

    Breakthrough for Gene Therapy: FDA Designates Viral Vector in Sarepta’s SRP-9003 for Limb Girdle Muscular Dystrophy Treatment

    Waunakee Fifth-Graders Dive into the Future at Exciting Tech Day!

    Property Technology Magazine Unveils “PropTech Top 50 Index” and the “2025 PropTech Trends Report – The Great Rebuild.” – Business Wire

    Property Technology Magazine Unveils “PropTech Top 50 Index” and the “2025 PropTech Trends Report – The Great Rebuild.” – Business Wire

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home General

Meta’s Llama 2 LLM is still prone to hallucinations and other severe security vulnerabilities

April 18, 2024
in General
Meta’s Llama 2 LLM is still prone to hallucinations and other severe security vulnerabilities
Share on FacebookShare on Twitter

Serving tech enthusiasts for over 25 years.

TechSpot means tech analysis and advice you can trust.

In context: Unless you are directly involved with developing or training a large language model, you don’t think about or even realize their potential security vulnerabilities. Whether it’s providing misinformation or leaking personal data, these weaknesses pose risks for LLM providers and users.

Meta’s Llama LLM performed poorly in a recent third-party evaluation by AI security firm DeepKeep. Researchers tested the model in 13 risk-assessment categories, but it only managed to pass in four. The severity of its performance was particularly evident in the categories of hallucinations, prompt injection, and PII/data leakage, where it demonstrated significant weaknesses.

When speaking of LLMs, hallucinations are when the model presents inaccurate or made-up information as if it is fact, sometimes even insisting that it is true when confronted about it. In DeepKeep’s test, Llama 2 7B scored “extremely high” for hallucinations, with a hallucination rate of 48 percent. In other words, your odds of getting an accurate answer amount to a coin flip.

“The results indicate a significant propensity for the model to hallucinate, presenting approximately a 50 percent likelihood of either providing the correct answer or fabricating a response,” said DeepKeep. “Typically, the more widespread the misconception, the higher the chance the model will echo that incorrect information.”

Hallucinations are a long-known problem for Llama. Stanford University removed its Llama-based chatbot “Alpaca” from the internet last year due to its tendency to hallucinate. So the fact that it is as bad as ever in this category reflects poorly on Meta’s efforts to address the matter.

Llama’s vulnerabilities in prompt injection and PII/data leakage are also particularly concerning.

Prompt injection involves manipulating the LLM into overwriting its internal programming to perform the attacker’s instructions. In tests, prompt injection successfully manipulated Llama’s output 80 percent of the time, a worrying statistic considering the potential for bad actors using it to direct users to malicious websites.

“For the prompts that included context with the Prompt Injection, the model was manipulated in 80 percent of instances, meaning it followed the Prompt Injection instructions and ignored the system’s instructions,” DeepKeep said. “[Prompt injection] can take many forms, ranging from the exfiltration of personally identifiable information (PII) to triggering denial of service and facilitating phishing attacks.”

Llama also has a propensity for data leakage. It mostly avoids leaking personally identifiable information, like phone numbers, email addresses, or street addresses. However, it appears overzealous when redacting information, often erroneously removing benign items unnecessarily. It is highly restrictive with queries regarding race, gender, sexual orientation, and other classes, even when the context is appropriate.

In other areas of PII, such as health and financial information, Llama suffers from almost “random” data leakages. The model frequently acknowledges that information may be confidential but then exposes it anyway. This category of security was another coin flip regarding reliability.

“The performance of LlamaV2 7B closely mirrors randomness, with data leakage and unnecessary data removal occurring in approximately half of the instances,” the study revealed. “On occasion, the model claims certain information is private and cannot be disclosed, yet it proceeds to quote the context regardless. This indicates that while the model may recognize the concept of privacy, it does not consistently apply this understanding to effectively redact sensitive information.”

On the bright side, DeepKeep says that Llama’s responses to queries are mostly grounded, meaning that when it is not producing hallucinations, its answers are sound and accurate. It also effectively handles toxicity, harmfulness, and semantic jailbreaks. However, it tends to flip-flop between being excessively elaborate and overly ambiguous in its responses.


While Llama seems strong against prompts that leverage language ambiguity to get the LLM to go against its filters or programming (semantic jailbreaking), the model is still moderately susceptible to other types of adversarial jailbreaking. As perviously mentioned, it is highly prone to direct and indirect prompt injections, a standard method for overwriting the model’s hard-coded functions (jailbreaking).

Meta is not the only LLM provider with security risks like these. Last June, Google warned its employees not to trust Bard with confidential information, presumably because of the potential for leakage. Unfortunately, companies employing these models are in a terrible rush to be the first, so many weaknesses can persist for extended periods without seeing a fix.

In at least one instance, an automated menu bot got customer orders wrong 70 percent of the time. Instead of addressing the issue or pulling its product, it masked the failure rate by outsourcing human help to correct the orders. The company, Presto Automation, downplayed the bot’s poor performance by revealing it needed help with 95 percent of the orders it took when first launched. It’s an unflattering stance, no matter how you look at it.

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : TechSpot – https://www.techspot.com/news/102653-meta-llama-2-llm-prone-hallucinations-other-severe.html

Previous Post

Astronomers detected the most massive stellar black hole in the Milky Way

Next Post

Sony’s Ghost of Tsushima brings familiar PlayStation features to PC

Groundbreaking study maps the movements of marine megafauna – EurekAlert!

Revolutionary Research Unveils the Migrations of Marine Megafauna

June 7, 2025
The science behind having perfect lake days – wtol.com

Unlocking the Secrets to Your Perfect Lake Day!

June 7, 2025
For both artists and scientists, slow looking allows surprising connections to surface – The Conversation

Unlocking Creativity: How Slow Looking Sparks Unexpected Connections for Artists and Scientists

June 7, 2025
Less colorful, more meaningful: Sean O’Malley thinks lifestyle changes key to reclaim UFC gold – MMA Junkie

Sean O’Malley: Embracing Lifestyle Changes to Reclaim UFC Gold

June 7, 2025
World Cup qualifying: Haaland leads Norway to its first win vs. Italy in 25 years – FOX Sports

Historic Victory: Haaland Guides Norway to First Win Over Italy in 25 Years!

June 7, 2025
City of Albertville Breaks Ground on Alleyway Entertainment Venue – WHNT.com

Albertville Unveils Exciting New Alleyway Entertainment Venue!

June 7, 2025
Eliminating Waste, Fraud, and Abuse in Medicaid – The White House (.gov)

Eliminating Waste, Fraud, and Abuse in Medicaid – The White House (.gov)

June 7, 2025
After Trump pulled NASA nomination, Musk ally Jared Isaacman says stint in politics was ‘thrilling’ – CNBC

After Trump pulled NASA nomination, Musk ally Jared Isaacman says stint in politics was ‘thrilling’ – CNBC

June 7, 2025
Apple Watch and the future of wearable technology in healthcare – MSN

Revolutionizing Healthcare: The Future of Wearable Technology with Apple Watch

June 7, 2025
Letters to Sports: Dodgers must figure out their injured pitcher problem – Los Angeles Times

Dodgers Face a Pitching Dilemma: How to Tackle Their Injury Woes

June 7, 2025

Categories

Archives

June 2025
MTWTFSS
 1
2345678
9101112131415
16171819202122
23242526272829
30 
« May    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (675)
  • Economy (688)
  • Entertainment (21,594)
  • General (15,269)
  • Health (9,730)
  • Lifestyle (692)
  • News (22,149)
  • People (689)
  • Politics (696)
  • Science (15,907)
  • Sports (21,192)
  • Technology (15,674)
  • World (674)

Recent News

Groundbreaking study maps the movements of marine megafauna – EurekAlert!

Revolutionary Research Unveils the Migrations of Marine Megafauna

June 7, 2025
The science behind having perfect lake days – wtol.com

Unlocking the Secrets to Your Perfect Lake Day!

June 7, 2025
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version