cloneyourself.cc
robots.txt

Robots Exclusion Standard data for cloneyourself.cc

Resource Scan

Scan Details

Site Domain cloneyourself.cc
Base Domain cloneyourself.cc
Scan Status Ok
Last Scan2025-09-23T11:32:09+00:00
Next Scan 2025-10-07T11:32:09+00:00

Last Scan

Scanned2025-09-23T11:32:09+00:00
URL https://www.cloneyourself.cc/robots.txt
Domain IPs 162.159.128.53, 162.159.138.52
Response IP 162.159.128.53
Found Yes
Hash 3772d83e089a2d378d3ee101a25dbf817fd38e8b3431b85b0704da08b174f877
SimHash 3a5dd850a9b2

Groups

botify
spider

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

mj12bot

Rule Path
Disallow /

*

Rule Path
Disallow /search?
Disallow /list?
Disallow /sign_in?
Disallow /sign_up?
Disallow /*?*filters=*
Disallow /*comments
Disallow /spaces/16616995*
Disallow /spaces/16616916*
Disallow /spaces/16616901*
Disallow /spaces/16364369*
Disallow /spaces/16147133*
Disallow /communities/

Other Records

Field Value
sitemap https://www.cloneyourself.cc/sitemap.xml