Search
Recommended Products
Related Links


 

 

Informative Articles

Dead domains - how to give them a decent burial
If you've registered a domain name, built a web site and have had visitors but no sales, you might feel like abandoning the site and the domain.  Find out why there is value in your domain names and what to do about it. How many...

How to Choose a Web Hosting Company?
Nowadays, anybody can afford to order a feature-packed web hosting plan for next to nothing; however, affordable price is not the only criterion which should be considered when purchasing web hosting services. In this article, I'll explain the...

One In A Million - Make Your Site Stand Out
"One in a million". What do I mean by that? Well, for those of you who have dabbled in internet businesses such as affiliate programs and network marketing businesses, you would know that these programs generally provide an entire website for you to...

Spam Fighter's Toolkit
Fighting the never ending tide of spam mail can turn into a very frustrating experience if you don't know the real tricks of the trade. After all, there's a whole lot more to it than simply responding to a (usually bogus) From: address with a...

TRANSLATING YOUR WEB
Considering that 43% of Internet users cannot read English, translating your web site into a foreign language is a powerful way to promote your business. But getting your web site translated can be an expensive exercise. Rates can vary between 0.04...

 
The Proper Way To Use The robots.txt File Update

In my last article about the robots.txt file I had spelled it wrong. It should have been robots.txt instead of robot.txt. The article should read like this:

When optimizing your web site most webmasters don’t consider using the robots.txt file.This is a very important file for your site. It let the spiders and crawlers know what they can and can not index. This is helpful in keeping them out of folders that you do not want index like the admin or stats folder.

Here is a list of variables that you can include in a robot.txt file and there meaning:

1) User-agent: In this field you can specify a specific robot to describe access policy

for or a “*” for all robots more explained in example.

2) Disallow: In the field you specify the files and folders not to include in the crawl.

3) The # is to represent comments

Here are some examples of a


robots.txt file
User-agent: *
Disallow:

The above would let all spiders index all content.

Here another example

User-agent: *
Disallow: /cgi-bin/

The above would block all spiders from indexing the cgi-bin directory.

User-agent: googlebot
Disallow:

User-agent: *
Disallow: /admin.php
Disallow: /cgi-bin/
Disallow: /admin/
Disallow: /stats/

In the above example googlebot can index everything while all other spiders can not index admin.php, cgi-bin, admin, and stats directory. Notice that you can block single files like admin.php.

About The Author

Jimmy Whisenhunt is the webmaster at VIP Enterprises http://www.vipenterprises.org.
vipenter@vipenterprises.org