* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Thursday, March 12, 2026
Earth-News
  • Home
  • Business
  • Entertainment

    Seattle’s Wing Luke Museum Announces Exciting New Executive Director

    Golden Nugget Owner Eyes Major Acquisition of Caesars Entertainment

    Inspired Entertainment Unveils Exciting Q4 2025 Earnings Results

    Inspired Entertainment Q4 2025: Record-Breaking Margins Outshine EPS Challenges

    Live Nation and DOJ Settle: What This Means for Live Entertainment Fans

    Capitol Groove Music Festival Delayed Until 2027 in Unexpected Setback

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology

    Top Industry Experts Reveal Crucial Insights on Globant SA and Uber Technologies

    JIATF 401 Publishes Guide to Counter-Drone Technology and Privacy Protections – U.S. Department of War (.gov)

    Could This Technology Pose the Greatest Threat to American Democracy?

    Breakthrough Discovery: 80 Key Proteins Uncovered in Plasma Membrane Repair

    Cheyenne Police Invite Community to Explore New Flock Safety Technology Together

    How Tech Titans Are Transforming Humanity as Traditional Billionaires Fade Away

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment

    Seattle’s Wing Luke Museum Announces Exciting New Executive Director

    Golden Nugget Owner Eyes Major Acquisition of Caesars Entertainment

    Inspired Entertainment Unveils Exciting Q4 2025 Earnings Results

    Inspired Entertainment Q4 2025: Record-Breaking Margins Outshine EPS Challenges

    Live Nation and DOJ Settle: What This Means for Live Entertainment Fans

    Capitol Groove Music Festival Delayed Until 2027 in Unexpected Setback

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology

    Top Industry Experts Reveal Crucial Insights on Globant SA and Uber Technologies

    JIATF 401 Publishes Guide to Counter-Drone Technology and Privacy Protections – U.S. Department of War (.gov)

    Could This Technology Pose the Greatest Threat to American Democracy?

    Breakthrough Discovery: 80 Key Proteins Uncovered in Plasma Membrane Repair

    Cheyenne Police Invite Community to Explore New Flock Safety Technology Together

    How Tech Titans Are Transforming Humanity as Traditional Billionaires Fade Away

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home Business

Scraped images of sexually abused children found in AI training database

December 21, 2023
in Business
Scraped images of sexually abused children found in AI training database
Share on FacebookShare on Twitter

Thousands of images of sexually abused children scraped from the internet are part of a commonly-used database used to train artificial intelligence image generators, according to a report, which warns that AI applications can use offensive photos to create realistic-looking fake child exploitation images that can be sold.

The report, released today by the Stanford University Internet Observatory (SIO), says removal of the source images is going on now because researchers reported the image URLs to the National Center for Missing and Exploited Children (NCMEC) in the U.S. and the Canadian Centre for Child Protection (C3P).

The investigation found the worrisome images in the biggest repository of images used by AI developers for training, known as LAION-5B, containing billions of images scraped from a wide array of sources, including mainstream social media websites and popular adult video sites.

According to the Associated Press, LAION, which stands for the nonprofit Large-scale Artificial Intelligence Open Network, said in a statement that it “has a zero tolerance policy for illegal content and in an abundance of caution” has taken down the datasets until the offending images can be deleted.

The SIO study of LAION-5B was primarily conducted using hashing tools such as Microsoft’s PhotoDNA, which match a fingerprint of an image to databases maintained by nonprofits that receive and process reports of online child sexual exploitation and abuse. Researchers did not view abuse content, and matches were reported to NCMEC and confirmed by C3P where possible.

There are methods to minimize child sexual abuse material (CSAM) in datasets used to train AI models, the SIO said in a statement, but it is challenging to clean or stop the distribution of open datasets with no central authority that hosts the actual data.

The report outlines safety recommendations for collecting datasets, training models, and hosting models trained on scraped datasets. Images collected in future datasets should be checked against known lists of CSAM by using detection tools such as Microsoft’s PhotoDNA or partnering with child safety organizations such as NCMEC and C3P.

The LAION‐5B dataset is derived from a broad cross‐section of the web, and has
been used to train various visual generative machine learning models. This dataset
was built by taking a snapshot of the Common Crawl5 repository, downloading
images referenced in the HTML, reading the “alt” attributes of the images, and using CLIP6
interrogation to discard images that did not sufficiently match the captions. The developers of LAION‐5B did attempt to classify whether content was sexually explicit as well as to detect some degree of underage explicit content.

However, the report notes, version 1.5 of one of the most popular AI image-generating models, Stable Diffusion, was also trained on a wide array of content, both explicit and otherwise. LAION datasets have also been used to train other models, says the report, such as Google’s Imagen, which was trained on a combination of internal datasets and the previous generation LAION‐400M.17.

“Notably,” the report says, “during an audit of the LAION‐400M, Imagen’s developers found
‘a wide range of inappropriate content including pornographic imagery, racist slurs, and harmful social stereotypes’, and deemed it unfit for public use.”

Despite its best efforts to find all CSAM in LAION-5B, the SIO says its work was a “significant undercount” due to the incompleteness of industry hash sets, attrition of live hosted content, lack of access to the original LAION reference image sets, and the limited accuracy of “unsafe” content classifiers.

Web-scale datasets are highly problematic for a number of reasons, even with
attempts at safety filtering, says the report. Ideally, such datasets should be restricted to research settings only, with more curated and well‐sourced datasets used for publicly distributed AI models.

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : ITBusiness.ca – https://www.itbusiness.ca/news/scraped-images-of-sexually-abused-children-found-in-ai-training-database/126848

Tags: businessimagesscraped
Previous Post

Threat actors still exploiting old unpatched vulnerabilities, says Cisco

Next Post

The Best Video Game Surprises Of 2023

Falling Phosphorus Levels Drive Ecological Revival in English Rivers

March 12, 2026

The Goddard Century: Sparking a Legacy of Innovation in Aerospace Research

March 12, 2026

Ghost’ Great White Shark Ignites a Thrilling New Mystery in the Mediterranean

March 12, 2026

Gavin Strange Honored With Charlie Stalcup Award – Southlake Style

March 12, 2026

Foxborough and Kraft Sports Secure Funding to Boost World Cup Security Ahead of Upcoming Matches

March 12, 2026

Housing Market Clouds Gather as Economic Turmoil Deepens – Pasadena Now

March 12, 2026

Seattle’s Wing Luke Museum Announces Exciting New Executive Director

March 12, 2026

Psychiatrists Foresee a Groundbreaking Shift in Mental Health Diagnoses

March 12, 2026

New charges tied to Epstein remain elusive as scrutiny continues – CNN

March 12, 2026

Top Industry Experts Reveal Crucial Insights on Globant SA and Uber Technologies

March 12, 2026

Categories

Archives

March 2026
M T W T F S S
 1
2345678
9101112131415
16171819202122
23242526272829
3031  
« Feb    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (1,114)
  • Economy (1,132)
  • Entertainment (22,009)
  • General (20,364)
  • Health (10,170)
  • Lifestyle (1,146)
  • News (22,149)
  • People (1,135)
  • Politics (1,150)
  • Science (16,348)
  • Sports (21,635)
  • Technology (16,115)
  • World (1,125)

Recent News

Falling Phosphorus Levels Drive Ecological Revival in English Rivers

March 12, 2026

The Goddard Century: Sparking a Legacy of Innovation in Aerospace Research

March 12, 2026
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version