thescxchange.com
robots.txt
Robots Exclusion Standard data for thescxchange.com
Resource Scan
Scan Details
Site Domain | thescxchange.com |
Base Domain | thescxchange.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-11-04T15:46:24+00:00 |
Next Scan | 2025-01-03T15:46:24+00:00 |
Last Successful Scan
Scanned | 2024-09-06T12:15:24+00:00 |
URL | https://thescxchange.com/robots.txt |
Redirect | https://www.thescxchange.com/robots.txt |
Redirect Domain | www.thescxchange.com |
Redirect Base | thescxchange.com |
Domain IPs | 208.91.62.10, 208.91.62.11, 208.91.62.12, 208.91.62.13 |
Redirect IPs | 208.91.62.10, 208.91.62.11, 208.91.62.12, 208.91.62.13 |
Response IP | 208.91.62.10 |
Found | Yes |
Hash | 5d1c07ffc7c077024189220fcf8be72ba8ce6bc6421d290bd84a2da1c6db2c1e |
SimHash | eb9c0f1df652 |
Groups
*
Rule | Path |
---|---|
Disallow | /comments/flag/ |
Disallow | /search |
Disallow | /articles/comment/abuse |
Disallow | /articles/email |
Disallow | /articles/preview |
Disallow | /articles/print |
Disallow | /products/email |
Disallow | /products/print |
Disallow | /cart |
Disallow | /user/* |
Disallow | /*/log_view |
Disallow | /query/* |
Disallow | /media/video/ |
Allow | /media/videos/play/* |
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Other Records
Field | Value |
---|---|
sitemap | https://www.thescxchange.com/sitemap.xml |
Comments