curiosity2connect.com
robots.txt

Robots Exclusion Standard data for curiosity2connect.com

Resource Scan

Scan Details

Site Domain curiosity2connect.com
Base Domain curiosity2connect.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-09-21T23:56:20+00:00
Next Scan 2025-10-21T23:56:20+00:00

Last Successful Scan

Scanned2025-08-16T23:12:27+00:00
URL https://curiosity2connect.com/robots.txt
Domain IPs 147.185.161.77, 147.185.161.78
Response IP 147.185.161.77
Found Yes
Hash 6bb03a9637ce5a03296bc0d929a4f41611559f4a580aa9c2c6066a972f302f45
SimHash 724dd850fbb8

Groups

botify
spider

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

mj12bot

Rule Path
Disallow /

*

Rule Path
Disallow /search?
Disallow /list?
Disallow /sign_in?
Disallow /sign_up?
Disallow /*?*filters=*
Disallow /*comments
Disallow /spaces/17063932*
Disallow /spaces/16167917*
Disallow /spaces/17463383*
Disallow /spaces/16906809*
Disallow /spaces/17141942*
Disallow /spaces/17053122*
Disallow /spaces/17069743*
Disallow /spaces/16993464*
Disallow /spaces/16156427*
Disallow /spaces/16912959*
Disallow /spaces/16660773*
Disallow /spaces/16580861*
Disallow /spaces/18661506*
Disallow /spaces/18970599*
Disallow /spaces/16581195*
Disallow /spaces/16580659*
Disallow /communities/

Other Records

Field Value
sitemap https://curiosity2connect.com/sitemap.xml