copykat.com
robots.txt

Robots Exclusion Standard data for copykat.com

Resource Scan

Scan Details

Site Domain copykat.com
Base Domain copykat.com
Scan Status Ok
Last Scan2024-11-15T11:56:28+00:00
Next Scan 2024-11-22T11:56:28+00:00

Last Scan

Scanned2024-11-15T11:56:28+00:00
URL https://copykat.com/robots.txt
Domain IPs 104.18.4.29, 104.18.5.29, 2606:4700::6812:41d, 2606:4700::6812:51d
Response IP 104.18.5.29
Found Yes
Hash 558c5ac529b43bf85f90a03ea55f932c5a2516a3ed3c5c858e62ba53c3a04a77
SimHash 09058a14a353

Groups

*

Rule Path
Allow /
Disallow /wp-admin
Disallow *?print*
Disallow */comments/
Disallow */comments-page*

Other Records

Field Value
sitemap https://copykat.com/sitemap_index.xml