Home Tech Reddit to Update Web Standard to Block Automated Data Scraping From Its...

Tech

Reddit to Update Web Standard to Block Automated Data Scraping From Its Website

June 26, 2024

Social media platform Reddit said on Tuesday it will update a Web standard used by the platform to block automated data scraping from its website, following reports that AI startups were bypassing the rule to gather content for their systems.

The move comes at a time when artificial intelligence firms have been accused of plagiarizing content from publishers to create AI-generated summaries without giving credit or asking for permission.

Reddit said that it would update the Robots Exclusion Protocol, or “robots.txt,” a widely accepted standard meant to determine which parts of a site are allowed to be crawled.

The company also said it will maintain rate-limiting, a technique used to control the number of requests from one particular entity, and will block unknown bots and crawlers from data scraping – collecting and saving raw information – on its website.

More recently, robots.txt has become a key tool that publishers employ to prevent tech companies from using their content free-of-charge to train AI algorithms and create summaries in response to some search queries.

Last week, a letter to publishers by the content licensing startup TollBit said that several AI firms were circumventing the web standard to scrape publisher sites.

This follows a Wired investigation which found that AI search startup Perplexity likely bypassed efforts to block its Web crawler via robots.txt.

Earlier in June, business media publisher Forbes accused Perplexity of plagiarizing its investigative stories for use in generative AI systems without giving credit.

Reddit said on Tuesday that researchers and organizations such as the Internet Archive will continue to have access to its content for non-commercial use.

Affiliate links may be automatically generated – see our ethics statement for details.

Source link

Reddit to Update Web Standard to Block Automated Data Scraping From Its Website

MOST READ NEWS

Faith Ladies unbeaten in Malta Guinness Women’s Premier League

Limited Voter Registration: NDC Supporters Being Disenfranchised Is Palpable Falsehood – EC | Politics

Adane Best goes ‘Topless’ on stage at 2024 African Legends Night [Video]

Are you aware that Ghana has obtained second spyware from QuaDream, another Israeli company?

Patapaa releases ‘My Grandfada’ song featuring Ada Gh (Check Out)

When You Skip School to Be A Slay Queen – Video...

Nollywood actress Nancy Isime explains why she started working at 17

Photos : DJ Lord Gives Patrons A Classic Black Party Experience

“I’m single” – Fella Makafui states in a trending video amidst...

Afenyo-Markin Warns NDC on Potential Pitfalls of Supermajority in Upcoming Parliament

Milwaukee Bucks crown champions of 2024 NBA Cup with dominant victory...

President Akufo-Addo Celebrates Ghana’s Democratic Progress and Global Diplomacy at Ministry...

United Cadres Front congratulates John Mahama, Opoku-Agyemang

Vim Lady Criticizes Captain Smart for Disrespectful Remarks About Business Mogul...

EVEN MORE NEWS

Afenyo-Markin Warns NDC on Potential Pitfalls of Supermajority in Upcoming Parliament

ORAL’s mandate is evidence gathering, not parallel prosecution – Ablakwa

Zelenskyy now voicing the reality that’s been apparent for a long...

POPULAR CATEGORY