cau.se
robots.txt

Robots Exclusion Standard data for cau.se

Resource Scan

Scan Details

Site Domain cau.se
Base Domain cau.se
Scan Status Ok
Last Scan2026-02-10T13:12:04+00:00
Next Scan 2026-02-11T13:12:04+00:00

Last Scan

Scanned2026-02-10T13:12:04+00:00
URL https://cau.se/robots.txt
Domain IPs 104.21.68.2, 172.67.183.145, 2606:4700:3030::6815:4402, 2606:4700:3037::ac43:b791
Response IP 104.21.68.2
Found Yes
Hash fba77ace66a67c92b8c2a06cf676a2eb4c0ac4d3bb53b524a57ba3f482b7e01e
SimHash a874bba4f763

Groups

gptbot

Rule Path
Disallow /

*

Rule Path
Disallow /media_proxy/
Disallow /interact/
Disallow /api/v1/instance/domain_blocks

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file