Internet Archive Sued for Copyright Infringement

SAN FRANCISCO — The Internet Archive, a nonprofit that acts as a library with snapshots of old versions of websites, is being sued by a company that says the Archive has no right to store and make available pages that have been removed by their rightful owners.

Started in 1996, the Archive uses web-crawling bot programs to make copies of publicly accessible sites. The copies are then available for research purposes via a search tool called the Wayback Machine.

The site has so far accumulated 40 billion pages, about 1 petabyte, or 1 million gigabytes, of data and is growing at a rate of 20 terabytes per month. The Archive includes millions of pages from adult websites.

At the center of the current dispute is Philadelphia-based Healthcare Advocates, a company that recently lost a trade secrets lawsuit when attorneys for the defendant produced archived copies that showed the information in question had been made publicly available on a 1999 version of the company’s site.

The pages, Healthcare Advocates claims, were protected against unauthorized indexing and viewing by use of a robots.txt file, which are supposed to tell web crawlers when certain pages are not to be stored. The company says the Archive infringed its copyrights by not doing enough to block access to the pages.

In its suit, filed in U.S. District Court in Philadelphia, Healthcare Advocates said a representative of the Archive brushed off charges of wrongdoing and said the problem was probably caused by a glitch related to the robots.txt files and, therefore, was not the Archives concern.

Danny Sullivan of Search Engine Watch said he believes the Archive representative was right, adding that, while any outcome in the case is possible, he would be surprised if a judge doesn’t dismiss it summarily.

“Robots.txt is a voluntary opt-out option. It has no legal bearing,” Sullivan said.

If the court sides with the Archive, as Sullivan predicts, the decision could have far-reaching implications for adult webmasters who rely on nonbinding opt-out provisions of robots.txt to prevent search engines from copying and distributing their intellectual property.

Apparently, doing so is not as reliable as many might think. Attorneys for the defendant in the initial Healthcare Advocates case were able to access at least 92 pages that had supposedly been protected by robots.txt files.

And once a technology such as the Archive stores a page, webmasters may not have the right to make them disappear at a later date, for example, if they are lacking 2257 records for the models on the page.

Copyright © 2026 Adnet Media. All Rights Reserved. XBIZ is a trademark of Adnet Media.
Reproduction in whole or in part in any form or medium without express written permission is prohibited.

More News

SCOTUS Won't Hear Appeal in NYC Adult Businesses Zoning Case

The U.S. Supreme Court has declined to hear an appeal by a group of adult businesses of a lower court’s decision allowing enforcement of a 2001 zoning law aimed at forcing adult retail stores out of most parts of New York City.

AEBN Publishes Popular Searches for November, December

AEBN has published the top search terms for November and December from its straight and gay theaters in all 50 states and the District of Columbia.

X3 Expo Day 2 Delivers Stars, Screenings and Fan Favorites

The sun once again shone brightly on the historic Hollywood Palladium as throngs of avid fans made their way through the doors, ready to experience Day 2 of the 2026 X3 Expo.

X3 Expo Kicks Into Gear With an All-Star Lineup

Outside the historic Hollywood Palladium on Friday, a huge crowd of fans lined Sunset Boulevard, eagerly awaiting the opening of the 2026 X3 Expo and their big chance to meet the cream of the crop of adult stars.

2026 XBIZ Honors Salutes Resilience Across the Online Adult Industry

The 2026 XBIZ Honors packed house Wednesday night, turning the Kimpton Everly Hotel’s Nichols Ballroom into a gala celebration of industry excellence.

Elevated X Integrates CCBill for Payment Processing

Elevated X has added CCBill payment processing integration to its ELXNexus traffic management and affiliate software.

Florida Congressman Files Latest Bill to Repeal Section 230

Rep. Jimmy Patronis of Florida has become the latest member of Congress to propose legislation that would repeal Section 230 of the Communications Decency Act, which protects interactive computer services — including adult platforms — from liability for user-generated content.

Irish Parliamentary Committee Weighs Stricter AV Laws

The Irish national parliament’s Joint Committee on Arts, Media, Communications, Culture and Sport met Wednesday to discuss regulation of online platforms and improving online safety, including calls for stricter age verification by adult sites.

Ofcom Issues Guidance on Age Check Placement for Adult Sites

U.K. media regulator Ofcom on Wednesday published its recommendations for where and how adult sites should deploy age checks as required for compliance with the Online Safety Act.

Tubes Booster Launches Web Hosting Solutions

Content hosting platform Tubes Booster has launched two new hosting solutions.

Show More