gogetquality.com
robots.txt

Robots Exclusion Standard data for gogetquality.com

Resource Scan

Scan Details

Site Domain gogetquality.com
Base Domain gogetquality.com
Scan Status Ok
Last Scan2024-08-31T19:26:57+00:00
Next Scan 2024-09-30T19:26:57+00:00

Last Scan

Scanned2024-08-31T19:26:57+00:00
URL https://gogetquality.com/robots.txt
Domain IPs 69.20.98.30
Response IP 69.20.98.30
Found Yes
Hash 8251d1b7d26d0b4e06885c39d75ca0370b5e78207047168fa3e99c380eb58aaa
SimHash a8149d034555

Groups

*

Rule Path
Disallow /

googlebot
googlebot-image
mediapartners-google
msnbot
msnbot-media
slurp
yahoo-blogs
yahoo-mmcrawler

Rule Path
Disallow /App_Data/
Disallow /bin/
Disallow /Content/
Disallow /css/
Disallow /fonts/
Disallow /images/
Disallow /js/
Disallow /survey/
Disallow /GoGetQuality.asmx
Disallow /GoGetQuality.asmx.cs
Disallow /index_09_.html
Disallow /JpgImage.aspx
Disallow /JpgImage.aspx.cs
Disallow /newserver.txt
Disallow /send_mail.php
Disallow /404.html
Disallow /web.config

Comments

  • $Id: robots.txt,v 1.9.2.1 2017/07/26 20:12:19 goba Exp $
  • robots.txt
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • This file will be ignored unless it is at the root of your host:
  • Used: http://gogetquality.com/robots.txt
  • Ignored: http://ggq.stagingsoftware.com/robots.txt
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/wc/robots.html
  • For syntax checking, see:
  • http://www.sxw.org.uk/computing/robots/check.html
  • disallow all
  • but allow only important bots
  • Directories
  • Files
  • Sitemap

Warnings

  • `http` is not a known field.