Follow @pkumar54992987 badge

Monday, 1 April 2013

Using Robots.txt in your Website, Wordpress, Blog etc

One simple thing you can do help to website crawlers crawl your site is have a robots.txt file. This is especially important if you have sections of your website you do not want indexed by search engines.
In addition your robots.txt file can store the location of your site’s sitemap, making it easy for crawlers to find and crawl every page on your site.
“The robots exclusion standard, also known as the Robots Exclusion Protocol or robots.txt protocol is a convention to prevent cooperating web spiders and other web robots from accessing all or part of a website which is, otherwise, publicly viewable.” – WikiPedia
robots.txt files are quite simple and easy to create.
An example robots.txt file:
User-agent: *Disallow: /tmp/Disallow: /private/Sitemap: http://www.example.com/sitemap.xml.gz
The first line file “User-agent: *” tells crawlers that any crawler can crawl the site.
The next 2 lines tell crawlers not to crawl anything in the tmp and private folders.
The last line tells the crawler where to find the site’s sitemap.
A robots.txt file is always found in the top level directory on your domain.
Example: http://www.example.com/robot.txt
I used to create my own robots.txt file but I recently found a site that will generate one for me for free:
http://www.mcanerin.com/EN/search-engine/robots-txt.asp
Read more about robots.txt:

http://en.wikipedia.org/wiki/Robots.txt
SEO Checklist

Learn the basics of HTML
Choose your keywords
Find what other sites are competing for those keywords
Put keyword phrase in <title></title>, <h1></h1>, and <h2></h2> tags.
Include an image on the page with the file name & the alt attribute containing your keyword phrase
Make sure your keyword phrase is found at the very bottom of your page.
Embed your keyword phrase throughout your page at a rate of 1% to 2%.
Have your keyword phrase in domain and in the page’s file name.
Create your robots.txt file.
Other Suggested SEO Resources

Learn More about PageRank
http://www.seofaststart.com/blog/why-google-cant-just-dump-pagerank
Google Basic Searching
http://www.google.com/help/basics.html
Google Advanced Search
http://www.google.com/help/operators.html
Interpreting Google Search Results
http://www.google.com/help/interpret.html
Google Webmaster Central
http://www.google.com/webmasters/
Google SEO Basics
http://www.interspire.com/content/articles/13/1/Google-SEO-Basics-for-Beginners
WikiPedia & SEO
http://en.wikipedia.org/wiki/Search_engine_optimization
Google’s keyword search tool
https://adwords.google.com/select/KeywordToolExternal