Add a robots.txt file to your blog

Table of contents for Learning Robots.txt
  1. Add a robots.txt file to your blog
  2. Using Robots.txt to tell search engines what you want them to index

If you are on free hosting such as wordpress.com or blogspot, you don’t need to do this since they have already made this file for you.

However, if you are on paid hosting, you need to make a robots.txt file.

What is this robots.txt file?

I have limited knowledge about this so let me allow wikipedia to define what’s robots.txt.

The robots exclusion standard, also known as the Robots Exclusion Protocol or robots.txt protocol is a convention to prevent cooperating web spiders and other web robots from accessing all or part of a website which is, otherwise, publicly viewable. Robots are often used by search engines to categorize and archive web sites, or by webmasters to proofread source code. (Wikipedia)

Is this important?

Yes. Search engines use robots to crawl your sites. That means, it browses your files in your hosting. The robot crawls every bit of your site, public and private files all together. That will pose a big security risk for you since confidential data will be made available by the robots and be placed into the search engines. Imagine your password files can be found on google searches!

People visit your website through your web pages. What they are seeing are a result of html code that has been parsed/processed by your browser. Robots don’t have browser so they just crawl the files one by one. You have to restrict their access to your files. Robots should be limited only the files that you allow them to crawl.

So how do you limit these robots? By creating a robots.txt file.

How to create a robots.txt file?

  • You can write the contents of the robots.txt file yourself if you know how to code it.
  • You can check these websites out that will generate the contents for you.
    • http://www.mcanerin.com/EN/search-engine/robots-txt.asp
    • http://www.1-hit.com/all-in-one/tool-robots.txt-generator.htm
  • Use an external program to create the file
    • http://www.softsland.com/oven_fresh_robots_txt_maker.html
  • If you are using wordpress in your blog, these posts might prove useful.

The robots.txt file I’m using is what I copied from enblogopedia. http://www.enblogopedia.com/robots.txt

Related Items:

Best Practices for the Knowledge Society. Knowledge, Learning, Development and Technology for All: Second World Summit on the Knowledge Society, WSKS 2009, ... in Computer and Information Science)
Blogging All-in-One For Dummies
| how to upload robots txt into the blogspot | add robots txt to blogger | ROBOT TXT GENERATOR FOR BLOGGERS BLOG | robots code wikipedia | upload a robot file to blogspot | upload robots txt to wordpress com | upload robots txt wordpress com | Where do I put my robots txt file in my blog | where to put my robot txt file in blog | how to update robots txt on blogger | how to add robots txt for blogger blog | how to add robots txt file to my root directory | allow robots txt in blogger | best robot txt blogger | how do I put robots txt on a blogspot blog | how to add robot blogspot | how to add robot txt cpanel | how to add robot txt for your blog | how to add robot txt to blogger | how to add robots txt file on blogger
  • Back up your blog through Cpanel
  • Two reasons why my blog is unoptimized for search engines!
  • Using Robots.txt to tell search engines what you want them to index
  • OpenKore Error: unable to load file
  • How to file a Google Reinclusion Reconsideration Request
  • Posted in All on Blogging on Jul 16th, 2007 by Allen Gurrea   

    5 Responses

    1. tina (16 comments.)
      July 16th, 2007 | 4:34 pm

      ill remember this when i have my own. salamat sa tips

      Reply

    2. fionixe (20 comments.)
      July 18th, 2007 | 4:48 am

      just a question here, where would you put the robot.txt file?

      Reply

    3. Allen
      July 18th, 2007 | 9:30 am

      @Tina – no problem

      @fionixe – You put the robots.txt in the root directory. Usually under public_html. =)

      Reply

    4. ice (1 comments.)
      November 7th, 2008 | 12:03 pm

      My googlebot is blocked by my robots.txt I want to update the file but I don’t know how to upload the robots.txt to my blogspot account.

      Can you please help me how to FTP the file to my http://maalamat.blogspot.com

      Thanks.

      ices last blog post..The Ego has Landed

      Reply

    5. silkenhut (100 comments.)
      November 7th, 2008 | 12:35 pm

      @Ice – I’m not sure if you can upload a robots.txt file when you are hosted in blogger. But as far as I know, blogger won’t block Google Bot since they are just from the same company. :)

      Reply

    Leave a reply

     
    | |

    Powered by Yahoo! Answers