educational

Robot Wars

Stephen Yagielowicz Apr 25, 2002

No, this article is not about one of those increasingly popular television shows that feature large metallic automatons bashing each other into submission with heavy, spinning, pointy things. Rather, it is a hands-on look at ways in which Webmasters can control Search Engine Spiders visiting their sites:

As is the case with all such articles, I must begin with my usual 'I am not a techno-geek, so take all of this advice with a big grain of salt, and use these techniques at your own risk' disclaimer. Having said that, this is an inside look at an often misunderstood application: the 'robots.txt' file. This is a simple text document that can help keep surfers from finding and directly entering your protected members area as well as other 'sensitive' areas of your site, and help focus attention on those parts of your site that need it and are prepared to handle it.

To make this easier to understand, consider many of the search results listings you've seen. Oftentimes the pages that you are directed to are not the site's home pages, but often 'inside' pages that can easily be taken out of context — or even out of framesets, hampering navigation and the natural 'flow' of information that the site's designer intended. Free site owners, for one example, do not really want people hitting their galleries directly, bypassing their warning pages, FPAs and other marketing tools; yet without specific instructions to the contrary, SE spiders are more than happy to provide direct links to these areas. These 'awkward' results can be avoided and manipulated through the use of the robots.txt file.

The Robots Exclusion Protocol
The mechanics of spider manipulation are carried out through the "Robots Exclusion Protocol," which allows Webmasters to tell visiting robots which areas of the site they should, and should not, visit and index. When a spider enters a site, the first thing it does is check the root directory for the robots.txt file. If it finds this file, it will attempt to follow the instructions included in it. If it doesn't find this file, it will have its way with your site, according to the parameters of the spider's individual programming.

It is vitally important that this robots.txt file be placed in your domain's root directory, i.e.: https://pornworks.com/robots.txt and should not be placed in any other sub-directory, such as https://pornworks.com/galleries/robots.txt — since it (unlike .htaccess files) won't work there because the robot simply won't look for it there, or obey it even if it finds this file outside your site's domain root directory. While I won't promise you this, that appears to mean that free-hosted and other sites that are not on their own domain will not be able to use this technique.

These non-domain sites do have an available option, however, in the use of the robots META tag. While not universally accepted, its use by spiders is now quite commonplace, and provides an alternative for those without domain root access. Here's the code:

META name="robots" content="index,follow">

META name="robots" content="noindex,follow">

META name="robots" content="index,nofollow">

META name="robots" content="noindex,nofollow"> Each listing must be on a separate line, is case-sensitive, and cannot contain blank spaces.

These four META tags illustrate the possibilities, and tell the spider whether or not to index the page this tag appears on, and whether or not to follow any links it finds on the page that this tag appears on. Of these four examples, only one should be used, and placed within the document's HEAD /HEAD tag. While some Search Engines may recognize additional parameters within these tags, the listed examples detail the most commonly accepted values. For those site's with domain root access, a simple robots.txt file is formatted thusly (but should be modified to suit your site's individual needs and directory structure):

User-agent: *
Disallow: /cgi-bin/
Disallow: /htsdata/
Disallow: /logs/
Disallow: /admin/
Disallow: /images/
Disallow: /includes/

In the above example, all robots are instructed to follow the file's instructions, as indicated by the "User-agent: *" wildcard. More advanced files could tailor the robot's actions according to its source, for example, individual spiders could be limited to those pages that are specifically optimized for the Search Engine that sent them, a subject well beyond the scope of this article, but perhaps the subject of a future follow-up.

Back to the above example, the 'Disallow:' command tells the robot not to enter or index the contents of the directories that follow this command. Each listing must be on a separate line, is case-sensitive, and cannot contain blank spaces. The rest of the site is now free for the robot to explore and index.

I hope this brief tutorial helps you to understand how robots interact with your site, and allows you to gain a degree of control over their actions. If you have any questions or comments about these techniques, click on the link below. ~ Stephen

Copyright © 2025 Adnet Media. All Rights Reserved. XBIZ is a trademark of Adnet Media.
Reproduction in whole or in part in any form or medium without express written permission is prohibited.

opinion

What Adult Businesses Need to Know About Florida's Age Verification Law

The rise and proliferation of age verification laws has changed the landscape for the online adult industry. A recent and compelling example is the state of Florida, where Attorney General James Uthmeier has filed multiple complaints against major platforms as well as affiliates accused of violating the state’s AV law.

Corey D. Silverstein · Dec 2, 2025

opinion

Maintaining Brand Trust in the Face of Negative Press

Over the last year, several of our merchants have found themselves caught up in litigation over compliance with state age verification laws. Recently, Segpay itself was pulled into the spotlight, facing scrutiny over Florida’s AV statute, HB 3. These stories inevitably get picked up by both industry and mainstream news outlets.

Cathy Beardsley · Nov 25, 2025

opinion

How to Switch Payment Processors Without Disrupting Business

For many merchants, the idea of switching payment processors can feel pretty overwhelming. That’s understandable. After all, downtime can stall sales, recurring subscriptions can suddenly fail, or compliance gaps can put accounts at risk. Operating in a high-risk sector like the adult industry can further amplify the stress of transition.

Jonathan Corona · Nov 21, 2025

profile

WIA Profile: Katie

Katie is the ultimate girl’s girl. As community manager at Chaturbate, she answers DMs, remembers names, and shows up for creators and fellow businesswomen when it counts. She’s quick to credit the people around her, and careful to make space for others in every room she enters.

Women in Adult · Nov 10, 2025

opinion

How to Stay Legally Protected When Policies Get Outdated

The adult industry has long operated in a complex legal environment subject to rapid change. Now, a confluence of age verification laws, lawsuits, credit card processing and data privacy rules has created an urgent need for all industry participants — from major platforms to independent creators — to review and potentially overhaul their legal and operational policies.

Corey D. Silverstein · Nov 7, 2025

opinion

From Compliance Chaos to Crypto Clarity: Making the Case for Digital Payments in Adult

These are uncertain times for adult merchants. With compliance tightening and age verification mandates rising, the barrier to entry keeps getting higher.

Cathy Beardsley · Nov 3, 2025

opinion

Real-Time Insights to Streamline E-Payments and Stop Lost Sales

A slow checkout process is more than just annoying — it’s expensive. In a high-risk sector like the adult industry, even small delays or declined transactions can cost businesses thousands in lost revenue every month.

Jonathan Corona · Oct 31, 2025

profile

FSC's Valentine Leads Charge for Sex Worker Rights and Financial Access

Before ever stepping into a courtroom, Valentine already understood the power of presence. After all, they’ve shimmied on stages as a burlesque performer, consulted behind the scenes for creative businesses and moved through the adult industry not just as an advocate, but as a participant.

Jackie Backman · Oct 27, 2025

opinion

Breaking Down HB 805 and How it Affects the Adult Industry

North Carolina House Bill 805 was enacted July 29, after the state legislature overrode Governor Josh Stein’s veto. The provisions that relate to the adult industry, imposing requirements for age verification, consent and content removal, are scheduled to become effective Dec. 1. Platforms have until then to update their policies and systems to comply with the new regulations.

Corey D. Silverstein · Oct 6, 2025

opinion

Staying Compliant With Payment Standards Across Europe and Australia

So, you’ve got your eye on international growth. Smart move. No matter where adult-industry merchants operate, however, one requirement remains consistent: regulatory compliance. This isn’t just a legal checkbox — it’s a critical component of keeping payments flowing and business operations intact.

Jonathan Corona · Oct 3, 2025

Movies & Stars

Web & Tech

Pleasure & Retail

Gay

Cams & Clips

Robot Wars

More Articles

What Adult Businesses Need to Know About Florida's Age Verification Law

Maintaining Brand Trust in the Face of Negative Press

How to Switch Payment Processors Without Disrupting Business

WIA Profile: Katie

How to Stay Legally Protected When Policies Get Outdated

From Compliance Chaos to Crypto Clarity: Making the Case for Digital Payments in Adult

Real-Time Insights to Streamline E-Payments and Stop Lost Sales

FSC's Valentine Leads Charge for Sex Worker Rights and Financial Access

Breaking Down HB 805 and How it Affects the Adult Industry

Staying Compliant With Payment Standards Across Europe and Australia

Events

XBIZ.net

Don't Miss a Beat

More Articles