SE Optimization: Creating Robots.txt File and its Importance

Creating Robots.txt File and its Importance

Written by San Christopher

If you are thinking you have developed a truly great keyword-rich-unique-content fully optimized website for repparttar

search engines and an attracting site for repparttar 127779

visitors - that's fine, but do you know you are missing something? A robots.txt file. Did you include it? By repparttar 127780

way do you know what's repparttar 127781

importance of a robots.txt file?

Success of big companies lies in keeping their confidential data a secret, hidden from all. They tell repparttar 127782 world something and do something. This enables them to execute their future course of action easily and change plans according to repparttar 127783 situation. Job of robots.txt file is repparttar 127784 same. It can or cannot allow a search engine to visit some or all of your web pages. Of course a human visitor is free to visit these pages. That being repparttar 127785 case, for repparttar 127786 search engines your website may be different than what a visitor is seeing. If you think one or some of repparttar 127787 pages/files aren't good enough to be visited by a particular search engine or engines you can do it. Although this is not recommended - your website should be made in such a way it should not shy away from repparttar 127788 search engines. Nevertheless its always better to know repparttar 127789 basics of writing robots.txt file. It will help you. We will discuss farther down - robots.txt file is important. I repeat again - don't make pages you think should be hidden from repparttar 127790 search engines. If any search engine think you are up to some tricks, it may panelize your site causing a no-rank - in repparttar 127791 worst case for ever!

Every search engine has a "robot" (a software program) that does repparttar 127792 job of visiting a website. Their purpose is to "know" repparttar 127793 website, what it is all about, gather all information about it etc. Search engine robots gather this information and bring them back to their databases to show them in their search results. So, if your site is not there in their database it never shows up in repparttar 127794 search results.

Web Robots are sometimes referred to as Web Crawlers, or Spiders. Therefore repparttar 127795 process of a robot visiting your website is called "Spidering" or "Crawling". When somebody says "the search engines have spidered my website," it means repparttar 127796 search engine robots have visited their website. This robot is known by a name and has an independent IP address. This IP address is of no importance to us, but knowing their names will help since this name will be used when we create a robots.txt file. This is why repparttar 127797 file is called "robots.txt." Given below is repparttar 127798 list of repparttar 127799 robots of some of repparttar 127800 very popular search engines:

Search Engine - Robot Alexa.com - ia_archiver Altavista.com - Scooter (Bought by Yahoo) UK.Altavista.com - AltaVista-Intranet (Bought by Yahoo) Alltheweb.com - FAST-WebCrawler (Bought by Yahoo) Excite.com - ArchitextSpider Euroseek.net - Arachnoidea Gendoor.com (Genealogical Search Engine) - GenCrawler Google.com - Googlebot (http://www.google.com/bot.html) Hotbot.com (uses Inktomi's robot) - Slurp Inktomi.com Slurp - (slurp@inktomi.com) (Bought by Yahoo) Infoseek.com - UltraSeek Looksmart.com - MantraAgent Lycos.com - Lycos_Spider_(T-Rex) Northernlight.com - Gulliver Nationaldirectory.com - NationalDirectory-SuperSpider UKSearcher.co.uk - UK Searcher Spider

Writing Robots.txt:

Let's learn to write robots command. Note that there are two ways to write robots command. One is to include all repparttar 127801 commands in a text file called "robots.txt" and another is to write robots command in repparttar 127802 meta tag.

We will learn both ways of writing robots command.

Writing robots command in Meta tag:

There are 4 things you can tell a search engine robot when it requests (visits) your page:

1) Do not index this page - repparttar 127803 search engines will not index repparttar 127804 page. 2) Do not follow any links on this page - repparttar 127805 search engines will not follow repparttar 127806 links included in repparttar 127807 page, i.e. they will not index any page that this page links to. 3) Do index this page - repparttar 127808 search engines will index repparttar 127809 page. 4) Do follow repparttar 127810 links - repparttar 127811 search engines will index repparttar 127812 pages that this page links to.

Note that "index" is different than "spider". A search engine first spiders a page and then indexes it. Indexing is giving a certain importance to repparttar 127813 page on repparttar 127814 basis of its content, information, meta tags, link popularity with respect to repparttar 127815 searched keyword. All this is decided at run time. When you tell search engines not to index a page, it means they know that "certain" page exists but do not rank them. That is, a no-index page will never be shown in their search results. This in any case does not mean a no-index page will not get visitors, it might get visitors indirectly from a page which links to it. Yes, no direct visitors from repparttar 127816 search engines.

Suppose you want repparttar 127817 search engines to index and also index (follow) its linked pages then include repparttar 127818 following command in repparttar 127819 Meta Tag:

Suppose you want repparttar 127820 search engines to index a page but not follow its links then include repparttar 127821 following command in repparttar 127822 Meta Tag:

Suppose you do not want repparttar 127823 search engines to index a page but follow its links then include repparttar 127824 following command in repparttar 127825 Meta Tag:

Suppose you do not want repparttar 127826 search engines to either index or follow links of a particular page then include repparttar 127827 following command in repparttar 127828 Meta Tag:

Note: Google makes a "Cached" of every file it spiders. It's a small snap shot of repparttar 127829 page. Want to stop Google from doing so? Include repparttar 127830 following Meta Tag:

Like any meta tag repparttar 127831 above written tags should be placed in repparttar 127832 HEAD section of an HTML page:

your title

Creating robots.txt file:

A robots.txt file is an independent file and should be written in a plain text editor like Notepad. Do not use MS-Word or any other text editor to create robots.txt. The bottom line is this file should have repparttar 127833 extension ".txt" else it will be useless.

Let's begin. Open Notepad (it comes free with Microsoft Windows) and save repparttar 127834 file with repparttar 127835 name "robots.txt". Make sure that repparttar 127836 extension is .txt.

By repparttar 127837 way, did you note we did not use name of any robot in repparttar 127838 meta tag! What does it indicate? Simple - by using meta you direct all repparttar 127839 search engines to do something or not do something on a page. You do not have control over any one search engine. The solution is robots.txt.

It can always happen you do not want a particular search engine to index a page for certain reasons. In that case using a robots.txt file will help. Even though I do not recommend such a thing. The search engines get you traffic, why hate them. Stop them from doing their job and they hate you. I again repeat keep your pages smart for repparttar 127840 search engines and welcome them. Fine, then why take repparttar 127841 trouble to learn robots.txt? Why should you include a robots.txt file at all?

Search Engines: Different Types, Different Strategies

Written by Terry Nicholls

There are four basic types of Search Engines:

Free Search Engines

Pay-For-Inclusion Search Engines

Pay-Per-Click (PPC) Search Engines

Directories

Because each type does things a little differently, you need to adapt your strategy to take advantage of their differences.

Free Search Engines

You can submit your pages to these engines free, but be careful. You must make sure not to over-submit (submit too often) or you'll be banned and never get listed.

Always check to see if your site is listed before submitting it.

Pay-For-Inclusion Search Engines

With this type of Search Engine, you pay to have your web site listed in their database. Pay-for-inclusion Search Engines (and repparttar paid section of free engines) are a quick way to get listed in some major databases -- for a price, literally. The cost varies from engine to engine.

The advantages are threefold:

Faster inclusion into Search Engine's index.

Repeated, regular spiderings.

Guaranteed continuous inclusion.

Pay-Per-Click (PPC) Search Engines

Pay-per-click Search Engines allow you to bid for keyword placement. For example, if one of your pages focuses on repparttar 127779 topic of "fashion models," you can bid for repparttar 127780 #1 (or any other number) placement on repparttar 127781 first page of search results. You only pay when someone actually clicks on your ad.

Cont'd on page 2 ==>