hoards.com
robots.txt

Robots Exclusion Standard data for hoards.com

Resource Scan

Scan Details

Site Domain hoards.com
Base Domain hoards.com
Scan Status Ok
Last Scan2024-10-15T11:28:35+00:00
Next Scan 2024-11-14T11:28:35+00:00

Last Scan

Scanned2024-10-15T11:28:35+00:00
URL https://hoards.com/robots.txt
Domain IPs 70.35.197.40
Response IP 70.35.197.40
Found Yes
Hash 73ec2b05e566c1a647b742be9abfbd9c68b785c44d9648f240d7240cf33c61c0
SimHash 000549608633

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /control
Disallow /dev.tools
Disallow /cache
Disallow /engines
Disallow /extras
Disallow /importApi
Disallow /inc
Disallow /js
Disallow /post.by.email
Disallow /run
Disallow /wcp
Disallow /themes
Disallow /wcp.v4
Disallow /wysiwyg
Disallow /z
Disallow /Zend
Disallow /themes
Disallow /openx
Allow /dating
Allow /*.html$
Allow /*.xml$

Other Records

Field Value
sitemap https://hoards.com/news.xml

Comments

  • www.robotstxt.org/
  • www.google.com/support/webmasters/bin/answer.py?hl=en&answer=156449
  • Disallow:/files