Subscribe to BlogGet all of our latest industry tips by email
Recent posts
- SEO in 2022
- New Google Privacy Changes – Digital Delight or Data Nightmare?
- Social Media for the Gambling Sector
- Search Plus Your World – What is it and what are its Implications?
- Google Offer Extensions Impact On Footfall
- Mission Accomplished (Sort of…) #1Man1Mission3
- Social Media Analytics – Are you tracking your interactions?
- Social media doesn’t need to be complicated
- The Future of Search: Reviewed
- 4Ps Marketing 2012 Social Media Outlook
Related Posts
BY CATEGORY
BY AUTHOR
- Aatif Basheer (1)
- Alina Katchi (10)
- Ashley Lindley (19)
- Chris Hyland (30)
- Eleanor Steele (22)
- Emma Pilcher (2)
- Gerard Harris (108)
- Hannah Miller (177)
- Ibeth Santos (3)
- Jack Mclaren (38)
- James Cornwall (11)
- James Green (7)
- Kia McSween (3)
- Laura Pagani (3)
- Luke Knight (14)
- Matt Stannard (16)
- Matthew Phelan (77)
- Nicola McAdam (2)
- Peter O'Neill (1)
- Rachel Kneen (146)
- Robert Laver (1)
- Serena Wong (1)
- Stephanie Villegas (45)
The robots.txt file checker
robots.txt history
Web robots, also known as web crawlers or web spiders, are programs that Search engines use to roam the web automatically to index its content. In the early 90s there were occasions where robots visited servers where they weren’t welcome for a number of reasons:
• swamping servers with requests reducing performance
• retrieved the same files repeatedly
• indexing duplicated content or temporary information
• scanning for email address to send spam
These incidents highlighted a need to develop the means to restrict what the robots could index.
This led to the creation of a file on your server which dictates where the robots can’t go. The file has to be accessible via HTTP on the local URL “/robots.txt“, this can be easily implemented and the robot can find tell where it’s not allowed to go by retrieving one single document.
robots.txt basics, covered in an earlier post, can be found here: Robots.txt – A Rough Guide
robots.txt checker
To ensure there are no errors in your robots.txt file, instead of wading through lines of html code, you can use this handy robots.txt checker tool from motoricera, an Italian non-profit SEO site.
In their own word: ‘This robots.txt checker is a “validator” that analyzes the syntax of a robots.txt file to see if its format is valid as established by Robot Exclusion Standard (please read the documentation and the tutorial to learn the basics) or if it contains errors. The validation process takes in account both Robots Exclusion Standard rules and spider-specific (Google, Inktomi, etc.) extensions (including the new “Sitemap” command).’
While this is a neat little tool that will save you some time checking your robots.txt file, if it does uncover errors that could mean your site is not being correctly indexed and could be affecting your presence in the web. Unless you know your html I wouldn’t recommend fiddling with your robots.txt yourself rather get in touch with a professional.
Save on Delicious