* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Wednesday, February 18, 2026
Earth-News
  • Home
  • Business
  • Entertainment

    Discover Can’t-Miss Arts and Entertainment Events Happening February 19 in Vallejo and Vacaville!

    How to remember actor Robert Duvall – CNN

    Air Cambodia Elevates Passenger Experience with AirFi’s Wireless In-Flight Entertainment

    Celebrate Mardi Gras, Black History Month, and More Exciting Events This Week in Coral Springs!

    QVC on the Brink of Bankruptcy, Negotiating Major Debt Restructuring

    LSU School of Music Unveils Newly Renovated Recital Hall – Find Out the Reopening Date!

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology

    Discover the VISION EQXX: Mercedes-Benz’s Most Efficient Electric Vehicle Ever

    Yeast Enzyme Unlocks DNA Synthesis Independent of Mitochondrial Respiration

    UK Occupiers Embrace Advanced Building Technology to Transform Employee Experience

    Drone, LPR technology lead to arrest of suspected diesel fuel thieves in Murfreesboro – WKRN News 2

    ProShare Advisors LLC Offloads Shares of GigaCloud Technology Inc. $GCT

    TS Skin Clinic Transforms GTA Beauty Scene with Groundbreaking Lipolift Pro Technology Launch

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment

    Discover Can’t-Miss Arts and Entertainment Events Happening February 19 in Vallejo and Vacaville!

    How to remember actor Robert Duvall – CNN

    Air Cambodia Elevates Passenger Experience with AirFi’s Wireless In-Flight Entertainment

    Celebrate Mardi Gras, Black History Month, and More Exciting Events This Week in Coral Springs!

    QVC on the Brink of Bankruptcy, Negotiating Major Debt Restructuring

    LSU School of Music Unveils Newly Renovated Recital Hall – Find Out the Reopening Date!

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology

    Discover the VISION EQXX: Mercedes-Benz’s Most Efficient Electric Vehicle Ever

    Yeast Enzyme Unlocks DNA Synthesis Independent of Mitochondrial Respiration

    UK Occupiers Embrace Advanced Building Technology to Transform Employee Experience

    Drone, LPR technology lead to arrest of suspected diesel fuel thieves in Murfreesboro – WKRN News 2

    ProShare Advisors LLC Offloads Shares of GigaCloud Technology Inc. $GCT

    TS Skin Clinic Transforms GTA Beauty Scene with Groundbreaking Lipolift Pro Technology Launch

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home Business

Scraped images of sexually abused children found in AI training database

December 21, 2023
in Business
Scraped images of sexually abused children found in AI training database
Share on FacebookShare on Twitter

Thousands of images of sexually abused children scraped from the internet are part of a commonly-used database used to train artificial intelligence image generators, according to a report, which warns that AI applications can use offensive photos to create realistic-looking fake child exploitation images that can be sold.

The report, released today by the Stanford University Internet Observatory (SIO), says removal of the source images is going on now because researchers reported the image URLs to the National Center for Missing and Exploited Children (NCMEC) in the U.S. and the Canadian Centre for Child Protection (C3P).

The investigation found the worrisome images in the biggest repository of images used by AI developers for training, known as LAION-5B, containing billions of images scraped from a wide array of sources, including mainstream social media websites and popular adult video sites.

According to the Associated Press, LAION, which stands for the nonprofit Large-scale Artificial Intelligence Open Network, said in a statement that it “has a zero tolerance policy for illegal content and in an abundance of caution” has taken down the datasets until the offending images can be deleted.

The SIO study of LAION-5B was primarily conducted using hashing tools such as Microsoft’s PhotoDNA, which match a fingerprint of an image to databases maintained by nonprofits that receive and process reports of online child sexual exploitation and abuse. Researchers did not view abuse content, and matches were reported to NCMEC and confirmed by C3P where possible.

There are methods to minimize child sexual abuse material (CSAM) in datasets used to train AI models, the SIO said in a statement, but it is challenging to clean or stop the distribution of open datasets with no central authority that hosts the actual data.

The report outlines safety recommendations for collecting datasets, training models, and hosting models trained on scraped datasets. Images collected in future datasets should be checked against known lists of CSAM by using detection tools such as Microsoft’s PhotoDNA or partnering with child safety organizations such as NCMEC and C3P.

The LAION‐5B dataset is derived from a broad cross‐section of the web, and has
been used to train various visual generative machine learning models. This dataset
was built by taking a snapshot of the Common Crawl5 repository, downloading
images referenced in the HTML, reading the “alt” attributes of the images, and using CLIP6
interrogation to discard images that did not sufficiently match the captions. The developers of LAION‐5B did attempt to classify whether content was sexually explicit as well as to detect some degree of underage explicit content.

However, the report notes, version 1.5 of one of the most popular AI image-generating models, Stable Diffusion, was also trained on a wide array of content, both explicit and otherwise. LAION datasets have also been used to train other models, says the report, such as Google’s Imagen, which was trained on a combination of internal datasets and the previous generation LAION‐400M.17.

“Notably,” the report says, “during an audit of the LAION‐400M, Imagen’s developers found
‘a wide range of inappropriate content including pornographic imagery, racist slurs, and harmful social stereotypes’, and deemed it unfit for public use.”

Despite its best efforts to find all CSAM in LAION-5B, the SIO says its work was a “significant undercount” due to the incompleteness of industry hash sets, attrition of live hosted content, lack of access to the original LAION reference image sets, and the limited accuracy of “unsafe” content classifiers.

Web-scale datasets are highly problematic for a number of reasons, even with
attempts at safety filtering, says the report. Ideally, such datasets should be restricted to research settings only, with more curated and well‐sourced datasets used for publicly distributed AI models.

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : ITBusiness.ca – https://www.itbusiness.ca/news/scraped-images-of-sexually-abused-children-found-in-ai-training-database/126848

Tags: businessimagesscraped
Previous Post

Threat actors still exploiting old unpatched vulnerabilities, says Cisco

Next Post

The Best Video Game Surprises Of 2023

Dive Into the Hidden World of Coral Reef Soundscapes with Immersive Spatial Audio and 360° Video

February 18, 2026

Is Biology Intelligently Engineered? Captivating Perspectives from an Engineer

February 18, 2026

Why Fewer Students Are Choosing Computer Science Majors: Exploring the Decline in Interest

February 18, 2026

Fort Recovery Middle School Honors Exceptional Honor Roll Students in Grand Celebration

February 18, 2026

Stephen Vogt’s Ultimate Goal: Winning the World Series with the Guardians

February 18, 2026

How West Virginia Is Poised to Lead the 21st-Century Outdoor Economy Revolution

February 18, 2026

Discover Can’t-Miss Arts and Entertainment Events Happening February 19 in Vallejo and Vacaville!

February 18, 2026

FLARE: Unlocking the Power of Essential Mental Health Literacy

February 18, 2026

Susan Blanco Takes the Oath as Colorado’s Newest Supreme Court Justice

February 18, 2026

Discover the VISION EQXX: Mercedes-Benz’s Most Efficient Electric Vehicle Ever

February 18, 2026

Categories

Archives

February 2026
M T W T F S S
 1
2345678
9101112131415
16171819202122
232425262728  
« Jan    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (1,078)
  • Economy (1,095)
  • Entertainment (21,972)
  • General (19,955)
  • Health (10,136)
  • Lifestyle (1,111)
  • News (22,149)
  • People (1,102)
  • Politics (1,112)
  • Science (16,310)
  • Sports (21,598)
  • Technology (16,077)
  • World (1,087)

Recent News

Dive Into the Hidden World of Coral Reef Soundscapes with Immersive Spatial Audio and 360° Video

February 18, 2026

Is Biology Intelligently Engineered? Captivating Perspectives from an Engineer

February 18, 2026
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version