Internet Archive Sued for Copyright Infringement

SAN FRANCISCO — The Internet Archive, a nonprofit that acts as a library with snapshots of old versions of websites, is being sued by a company that says the Archive has no right to store and make available pages that have been removed by their rightful owners.

Started in 1996, the Archive uses web-crawling bot programs to make copies of publicly accessible sites. The copies are then available for research purposes via a search tool called the Wayback Machine.

The site has so far accumulated 40 billion pages, about 1 petabyte, or 1 million gigabytes, of data and is growing at a rate of 20 terabytes per month. The Archive includes millions of pages from adult websites.

At the center of the current dispute is Philadelphia-based Healthcare Advocates, a company that recently lost a trade secrets lawsuit when attorneys for the defendant produced archived copies that showed the information in question had been made publicly available on a 1999 version of the company’s site.

The pages, Healthcare Advocates claims, were protected against unauthorized indexing and viewing by use of a robots.txt file, which are supposed to tell web crawlers when certain pages are not to be stored. The company says the Archive infringed its copyrights by not doing enough to block access to the pages.

In its suit, filed in U.S. District Court in Philadelphia, Healthcare Advocates said a representative of the Archive brushed off charges of wrongdoing and said the problem was probably caused by a glitch related to the robots.txt files and, therefore, was not the Archives concern.

Danny Sullivan of Search Engine Watch said he believes the Archive representative was right, adding that, while any outcome in the case is possible, he would be surprised if a judge doesn’t dismiss it summarily.

“Robots.txt is a voluntary opt-out option. It has no legal bearing,” Sullivan said.

If the court sides with the Archive, as Sullivan predicts, the decision could have far-reaching implications for adult webmasters who rely on nonbinding opt-out provisions of robots.txt to prevent search engines from copying and distributing their intellectual property.

Apparently, doing so is not as reliable as many might think. Attorneys for the defendant in the initial Healthcare Advocates case were able to access at least 92 pages that had supposedly been protected by robots.txt files.

And once a technology such as the Archive stores a page, webmasters may not have the right to make them disappear at a later date, for example, if they are lacking 2257 records for the models on the page.

Copyright © 2026 Adnet Media. All Rights Reserved. XBIZ is a trademark of Adnet Media.
Reproduction in whole or in part in any form or medium without express written permission is prohibited.

More News

Final IRS 'No Tax on Tips' Rule Excludes Pornography

The Internal Revenue Service on Monday published final regulations on the “No Tax on Tips” provision included in the “One Big Beautiful Bill Act,” offering new tax deductions for tip workers but excluding revenue received for “pornographic activity.”

Pennsylvania Legislature Weighs 'Porn Tax' Bill

The Pennsylvania State Senate is considering a bill that would impose a 10% tax on the revenue of adult websites doing business in that state.

Trump Tariffs Refund Process to Launch April 20

WASHINGTON — U.S. Customs and Border Protection (CBP) will begin the process of refunding duties paid under the Trump administration’s sweeping program of tariffs by providing, starting April 20, an online tool for submitting refund claims.

BranditScan Rolls Out 2 New Platform Features

BranditScan has introduced its new Traffic Optimization and Doxing Protection features for creators.

NMG Management Partners With Cosplayground to Scale Distribution

NMG Management has partnered with Cosplayground to expand the studio’s digital distribution and licensing operations.

Dreamcam Adds Real-Time Speech Translation

Dreamcam has introduced Voice Translator AI to its livestreaming platform.

UK Government May Limit 'Step' Porn Ban With New Amendments

The U.K. Ministry of Justice on Friday revealed new government amendments to the pending Crime and Policing Bill, potentially limiting a planned ban on “step” content to apply only if adult performers role-play as minors.

Arizona Senate Removes 'Catch-22' Provision From Consent Bill

The Arizona State Senate has amended a bill that would impose new requirements for adult content uploaded online, removing a seemingly contradictory provision that could have effectively made it impossible for adult sites to operate in the state.

Climaxx Media Launches Networking Platform

Climaxx Media has officially launched its new networking platform.

Italian Court in Aylo Case Limits International Reach of AV Rules

An Italian administrative court has ruled that Italy’s recently-enacted age verification rules for adult content may not currently be enforced against sites based in other EU member states, pending further procedural action under the EU’s Directive on Electronic Commerce.

Show More