hoards.com
robots.txt
Robots Exclusion Standard data for hoards.com
Resource Scan
Scan Details
Site Domain | hoards.com |
Base Domain | hoards.com |
Scan Status | Ok |
Last Scan | 2024-10-15T11:28:35+00:00 |
Next Scan | 2024-11-14T11:28:35+00:00 |
Last Scan
Scanned | 2024-10-15T11:28:35+00:00 |
URL | https://hoards.com/robots.txt |
Domain IPs | 70.35.197.40 |
Response IP | 70.35.197.40 |
Found | Yes |
Hash | 73ec2b05e566c1a647b742be9abfbd9c68b785c44d9648f240d7240cf33c61c0 |
SimHash | 000549608633 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin |
Disallow | /control |
Disallow | /dev.tools |
Disallow | /cache |
Disallow | /engines |
Disallow | /extras |
Disallow | /importApi |
Disallow | /inc |
Disallow | /js |
Disallow | /post.by.email |
Disallow | /run |
Disallow | /wcp |
Disallow | /themes |
Disallow | /wcp.v4 |
Disallow | /wysiwyg |
Disallow | /z |
Disallow | /Zend |
Disallow | /themes |
Disallow | /openx |
Allow | /dating |
Allow | /*.html$ |
Allow | /*.xml$ |
Other Records
Field | Value |
---|---|
sitemap | https://hoards.com/news.xml |
Comments