cutimes.com
robots.txt
Robots Exclusion Standard data for cutimes.com
Resource Scan
Scan Details
Site Domain | cutimes.com |
Base Domain | cutimes.com |
Scan Status | Ok |
Last Scan | 2024-11-18T01:09:46+00:00 |
Next Scan | 2024-11-25T01:09:46+00:00 |
Last Scan
Scanned | 2024-11-18T01:09:46+00:00 |
URL | https://cutimes.com/robots.txt |
Redirect | https://www.cutimes.com/robots.txt |
Redirect Domain | www.cutimes.com |
Redirect Base | cutimes.com |
Domain IPs | 104.18.30.177, 104.18.31.177, 2606:4700::6812:1eb1, 2606:4700::6812:1fb1 |
Redirect IPs | 104.18.30.177, 104.18.31.177, 2606:4700::6812:1eb1, 2606:4700::6812:1fb1 |
Response IP | 104.18.30.177 |
Found | Yes |
Hash | e0b171505aba0026cffc5d747c59729a4b7407a51896a27fe14ed1a86c142289 |
SimHash | 6d04dc70cd91 |
Groups
*
Rule | Path |
---|---|
Disallow | /*/?printer-friendly |
Disallow |
Other Records
Field | Value |
---|---|
crawl-delay | 1 |
Other Records
Field | Value |
---|---|
sitemap | https://www.cutimes.com/sitemap.xml |