copykat.com
robots.txt
Robots Exclusion Standard data for copykat.com
Resource Scan
Scan Details
Site Domain | copykat.com |
Base Domain | copykat.com |
Scan Status | Ok |
Last Scan | 2024-11-15T11:56:28+00:00 |
Next Scan | 2024-11-22T11:56:28+00:00 |
Last Scan
Scanned | 2024-11-15T11:56:28+00:00 |
URL | https://copykat.com/robots.txt |
Domain IPs | 104.18.4.29, 104.18.5.29, 2606:4700::6812:41d, 2606:4700::6812:51d |
Response IP | 104.18.5.29 |
Found | Yes |
Hash | 558c5ac529b43bf85f90a03ea55f932c5a2516a3ed3c5c858e62ba53c3a04a77 |
SimHash | 09058a14a353 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /wp-admin |
Disallow | *?print* |
Disallow | */comments/ |
Disallow | */comments-page* |
Other Records
Field | Value |
---|---|
sitemap | https://copykat.com/sitemap_index.xml |