embracing.space
robots.txt

Robots Exclusion Standard data for embracing.space

Resource Scan

Scan Details

Site Domain embracing.space
Base Domain embracing.space
Scan Status Ok
Last Scan2025-10-17T02:04:14+00:00
Next Scan 2025-10-18T02:04:14+00:00

Last Scan

Scanned2025-10-17T02:04:14+00:00
URL https://embracing.space/robots.txt
Domain IPs 104.21.41.148, 172.67.147.168, 2606:4700:3033::6815:2994, 2606:4700:3035::ac43:93a8
Response IP 104.21.41.148
Found Yes
Hash fba77ace66a67c92b8c2a06cf676a2eb4c0ac4d3bb53b524a57ba3f482b7e01e
SimHash a874bba4f763

Groups

gptbot

Rule Path
Disallow /

*

Rule Path
Disallow /media_proxy/
Disallow /interact/
Disallow /api/v1/instance/domain_blocks

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file