* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Tuesday, June 10, 2025
Earth-News
  • Home
  • Business
  • Entertainment
    Cisco Partners with Monumental Sports & Entertainment to Power New D.C. Arena – Cisco Newsroom

    Cisco Teams Up with Monumental Sports & Entertainment to Revolutionize the New D.C. Arena Experience

    Middle Eastern Entertainment Headlines at 5:49 a.m. GMT – Yahoo

    Exciting Updates from the Middle Eastern Entertainment Scene!

    Ceramic Dalmatian Entertainment is WLAF’s Business of the Week – WLAF

    Spotlight on Success: Ceramic Dalmatian Entertainment Shines as This Week’s Featured Business!

    Brass Lion Entertainment unveils co-op action RPG Wu-Tang: Rise of the Deceiver – VentureBeat

    Unleash Your Inner Warrior: Discover the Co-Op Action RPG Wu-Tang: Rise of the Deceiver!

    Entertainment lineup released for 2025 Mississippi State Fair – WAPT

    Exciting Entertainment Lineup Unveiled for the 2025 Mississippi State Fair!

    After Denzel Washington Said He Would Be In Black Panther 3, Ryan Coogler Explained Why He’s ‘Fine’ With That Information Being Revealed So Early – Yahoo

    Ryan Coogler Shares Why He’s Cool with Denzel Washington’s Black Panther 3 Reveal!

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    SunHydrogen Unveils Large Hydrogen Reactor in Houston – Fuel Cells Works

    SunHydrogen Launches Groundbreaking Large-Scale Hydrogen Reactor in Houston

    Technology, Labor Rights, and Political Power in Kenya and Across Africa – Tech Policy Press

    How Technology is Shaping Labor Rights and Political Power Across Africa

    Reeves to Announce £86 Billion for Science and Technology in Spending Review – Bloomberg

    Reeves Set to Unveil Groundbreaking £86 Billion Investment in Science and Technology!

    Innovation at Scale: How P&G Transforms Business Through Technology – Procter & Gamble

    Revolutionizing Business: P&G’s Bold Journey into Technological Innovation

    Drag racer survives frightening airborne crash at World Wide Technology Raceway – FOX 2

    Drag racer survives frightening airborne crash at World Wide Technology Raceway – FOX 2

    Apple Watch and the future of wearable technology in healthcare – MSN

    Revolutionizing Healthcare: The Future of Wearable Technology with Apple Watch

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment
    Cisco Partners with Monumental Sports & Entertainment to Power New D.C. Arena – Cisco Newsroom

    Cisco Teams Up with Monumental Sports & Entertainment to Revolutionize the New D.C. Arena Experience

    Middle Eastern Entertainment Headlines at 5:49 a.m. GMT – Yahoo

    Exciting Updates from the Middle Eastern Entertainment Scene!

    Ceramic Dalmatian Entertainment is WLAF’s Business of the Week – WLAF

    Spotlight on Success: Ceramic Dalmatian Entertainment Shines as This Week’s Featured Business!

    Brass Lion Entertainment unveils co-op action RPG Wu-Tang: Rise of the Deceiver – VentureBeat

    Unleash Your Inner Warrior: Discover the Co-Op Action RPG Wu-Tang: Rise of the Deceiver!

    Entertainment lineup released for 2025 Mississippi State Fair – WAPT

    Exciting Entertainment Lineup Unveiled for the 2025 Mississippi State Fair!

    After Denzel Washington Said He Would Be In Black Panther 3, Ryan Coogler Explained Why He’s ‘Fine’ With That Information Being Revealed So Early – Yahoo

    Ryan Coogler Shares Why He’s Cool with Denzel Washington’s Black Panther 3 Reveal!

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    SunHydrogen Unveils Large Hydrogen Reactor in Houston – Fuel Cells Works

    SunHydrogen Launches Groundbreaking Large-Scale Hydrogen Reactor in Houston

    Technology, Labor Rights, and Political Power in Kenya and Across Africa – Tech Policy Press

    How Technology is Shaping Labor Rights and Political Power Across Africa

    Reeves to Announce £86 Billion for Science and Technology in Spending Review – Bloomberg

    Reeves Set to Unveil Groundbreaking £86 Billion Investment in Science and Technology!

    Innovation at Scale: How P&G Transforms Business Through Technology – Procter & Gamble

    Revolutionizing Business: P&G’s Bold Journey into Technological Innovation

    Drag racer survives frightening airborne crash at World Wide Technology Raceway – FOX 2

    Drag racer survives frightening airborne crash at World Wide Technology Raceway – FOX 2

    Apple Watch and the future of wearable technology in healthcare – MSN

    Revolutionizing Healthcare: The Future of Wearable Technology with Apple Watch

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home General

Why publishers are questioning the effectiveness of blocking AI web crawlers

October 1, 2023
in General
Why publishers are questioning the effectiveness of blocking AI web crawlers
Share on FacebookShare on Twitter

This article is part of Digiday’s coverage of its Digiday Publishing Summit. More from the series →

A number of publishers — including Bloomberg and The New York Times — were quick to block OpenAI’s web crawler from accessing their sites, to protect their content from getting scraped and used to feed the artificial intelligence tech company’s large language models (LLMs). But whether this tactic is actually effective is debatable, according to conversations with five publishing executives.

“It’s a symbolic gesture,” said a senior tech executive at a media company, who requested anonymity to speak freely.

In August, OpenAI announced that publishers can now block its GPTBot web crawler from accessing their web pages’ content. Since then, 26 of the 100 most-visited sites (and 242 of the top 1,000 sites) have done so, according to Originality.ai.

However, publishers’ content distribution models might make the protective strategy moot. One publishing exec told Digiday their company publishes on eight different syndication apps and websites. Because the content is already so discoverable, it feels like the protective measure to block OpenAI’s web crawler was a futile effort, they said.

“I think it was kind of a wasted effort on my part. It’s an inevitability that this stuff is ingested and crawled and learned from,” the exec said during a closed-door session at the Digiday Publishing Summit in Key Biscayne, Fla. last week.

Publishers have had a hard time protecting against generative AI tools like OpenAI’s chatbot ChatGPT from bypassing their paywalls and scraping their content to train their LLMs. Though publishers can now block OpenAI’s crawler, some publishing execs aren’t convinced it’s enough to protect their IP.

“It’s a long-term problem, and there isn’t a short-term solution,” said Matt Rogerson, director of public policy at Guardian Media Group. “It’s a sign that publishers are taking back a bit more control and are going to start demanding more control over other folks that are scraping for different purposes.”

Google and Microsoft are listening

OpenAI is just one of the tech companies using web crawlers to feed their LLMs for AI tools and systems. Google and Microsoft’s web crawlers are essential for publishers’ content to get indexed and surfaced in search results on Google Search and Bing — but those crawlers also scrape content to train those tech companies’ LLMs and AI chatbots. The Guardian’s Rogerson called these “bundled scrapers.”

“They treat it all as one big search product,” the first tech exec said. “They’re like, ‘No, you don’t get the granularity choice. We give you the opportunity to opt out.’ But obviously, we don’t want to opt out of all web crawling.”

Those tech companies are listening to publishers’ concerns. In July, Google announced it was exploring alternatives to its robots.txt protocol — the file that tells search engine crawlers which URLs they can access — to give publishers more control over how their IP is used in different contexts. And just Thursday, Google released a new tool called Google-Extended that gives website owners the ability to opt out of having their sites crawled for data used to train Google’s AI systems and its generative AI chatbot Bard. (The execs interviewed for this story spoke to Digiday before that announcement.)

Microsoft has chosen to go another route. Last week, the company announced that publishers can add a piece of code to their web pages to communicate that the content should not be used for LLMs (a bit like a copyright tag). Microsoft is giving website owners two options: a “NOCACHE” tag that allows only titles, snippets and URLs to appear in the Bing chatbot or to train its AI models, or a “NOARCHIVE” tag, which prevents any usage in its chatbot or AI training.

“They are signaling that they will add more granularity,” Rogerson said. “We’re examining that in detail.”

The New York Times took matters into their own hands and added language to its Terms of Service last month prohibiting the use of its content to train machine learning or AI systems, giving the Times the ability to pursue legal action against companies using their data.

A negotiation tactic

So why are publishers blocking OpenAI’s web crawler at all, if the move doesn’t ensure protection of their content?

Execs told Digiday it’s a negotiation tactic.

“Putting the blocker in place is at least one… starting point for the inevitable negotiations that we’ll have as publishers with OpenAI and other companies. We’ll be able to have that as a point of leverage and say, we’ll take it off if we can reach a deal or an agreement,” said the publishing exec at the Digiday Publishing Summit.

Publishers’ protective actions are creating a “market for licenses for data mining,” with a potential for compensation for sharing their data, Rogerson said. OpenAI struck a licensing partnership with the Associated Press in July, wherein OpenAI is paying to license part of the AP’s text archive to train its models.

But not all publishers feel like they’re powerful enough to negotiate the use of their content with these large tech companies.

“We’re not big enough to flex our muscles and block it,” said a second publishing executive who asked to remain anonymous. The exec was also unsure if blocking OpenAI’s web crawler would affect their use of GPT, the AI technology ChatGPT is built on that OpenAI has made available for outside developers to license.

“If you start blocking the crawler, do they cut you off from using the tool? Does the tool stop working as well? It’s really unclear,” the publishing exec said. “There probably is a way to eventually figure it out, but not without a ton of detective work,” they added.

https://digiday.com/?p=519789

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : DigiDay – https://digiday.com/media/a-symbolic-gesture-publishers-question-the-effectiveness-of-blocking-ai-web-crawlers/?utm_campaign=digidaydis&utm_medium=rss&utm_source=general-rss

Previous Post

Why security, scalability and a data-driven mindset are crucial for enterprise analytics

Next Post

Actor Michael Gambon, Known For ‘Harry Potter’ Dumbledore Role, Passes Away At Age 82

SunHydrogen Unveils Large Hydrogen Reactor in Houston – Fuel Cells Works

SunHydrogen Launches Groundbreaking Large-Scale Hydrogen Reactor in Houston

June 9, 2025
Coco Gauff Makes Request After French Open – Yahoo Sports

Coco Gauff’s Unexpected Request After French Open Exit

June 9, 2025
Alcaraz tops Sinner in a French Open final for the ages – Yahoo Sports

Alcaraz tops Sinner in a French Open final for the ages – Yahoo Sports

June 9, 2025

Ecology expands Washington state’s drought emergency – Columbia Basin Herald

June 9, 2025
Jane Goodall: ‘We must let local wisdom and science be ou… – observer.co.uk

Jane Goodall: ‘We must let local wisdom and science be ou… – observer.co.uk

June 9, 2025
Nanowires replace lost retinal cells – Science | AAAS

Nanowires Spark Breakthrough in Restoring Lost Retinal Cells and Vision

June 9, 2025
Cancer Daily Horoscope Today (June 21 – July 22) June 9, 2025: Lifestyle will improve! – India Today

Cancer Daily Horoscope for June 9, 2025: Get Ready for Exciting Lifestyle Changes!

June 9, 2025
Luka Mijatovic on Qualifying for World Champs: “I didn’t want to put any pressure on myself” – SwimSwam

Luka Mijatovic Opens Up on His World Championships Qualifying: “I Didn’t Want to Put Any Pressure on Myself

June 9, 2025
North America Creator Economy Market Size | CAGR of 19% – Market.us

North America’s Creator Economy Poised for Explosive 19% Growth

June 9, 2025

Three Trailblazing Women Celebrate Victory with Prestigious Frank Prize in Performing Arts

June 9, 2025

Categories

Archives

June 2025
MTWTFSS
 1
2345678
9101112131415
16171819202122
23242526272829
30 
« May    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (678)
  • Economy (692)
  • Entertainment (21,597)
  • General (15,293)
  • Health (9,734)
  • Lifestyle (696)
  • News (22,149)
  • People (693)
  • Politics (699)
  • Science (15,911)
  • Sports (21,195)
  • Technology (15,679)
  • World (677)

Recent News

SunHydrogen Unveils Large Hydrogen Reactor in Houston – Fuel Cells Works

SunHydrogen Launches Groundbreaking Large-Scale Hydrogen Reactor in Houston

June 9, 2025
Coco Gauff Makes Request After French Open – Yahoo Sports

Coco Gauff’s Unexpected Request After French Open Exit

June 9, 2025
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version