curious.social
robots.txt

Robots Exclusion Standard data for curious.social

Resource Scan

Scan Details

Site Domain curious.social
Base Domain curious.social
Scan Status Ok
Last Scan2025-11-16T22:31:56+00:00
Next Scan 2025-11-17T22:31:56+00:00

Last Scan

Scanned2025-11-16T22:31:56+00:00
URL https://curious.social/robots.txt
Domain IPs 104.21.19.68, 172.67.185.152, 2606:4700:3030::ac43:b998, 2606:4700:3035::6815:1344
Response IP 104.21.19.68
Found Yes
Hash fba77ace66a67c92b8c2a06cf676a2eb4c0ac4d3bb53b524a57ba3f482b7e01e
SimHash a874bba4f763

Groups

gptbot

Rule Path
Disallow /

*

Rule Path
Disallow /media_proxy/
Disallow /interact/
Disallow /api/v1/instance/domain_blocks

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file