theclaudenews.com
robots.txt
Robots Exclusion Standard data for theclaudenews.com
Resource Scan
Scan Details
Site Domain | theclaudenews.com |
Base Domain | theclaudenews.com |
Scan Status | Ok |
Last Scan | 2024-04-30T12:24:13+00:00 |
Next Scan | 2024-05-30T12:24:13+00:00 |
Last Scan
Scanned | 2024-04-30T12:24:13+00:00 |
URL | https://theclaudenews.com/robots.txt |
Redirect | https://www.theclaudenews.com/robots.txt |
Redirect Domain | www.theclaudenews.com |
Redirect Base | theclaudenews.com |
Domain IPs | 199.34.228.183 |
Redirect IPs | 199.34.228.183 |
Response IP | 199.34.228.183 |
Found | Yes |
Hash | 38efa0867a986fdd037e7e4ea0729f680c81f71a1828f0439ce55c8e3721ec2e |
SimHash | e8291904fc97 |
Groups
*
Rule | Path |
---|---|
Disallow | /s/search |
Disallow | /s/cart/ |
Disallow | /store/checkout |
Disallow | /store/status |
Disallow | /product/*/*/leave-review |
Other Records
Field | Value |
---|---|
sitemap | https://www.theclaudenews.com/sitemap.xml |