eldritch.cafe
robots.txt

Robots Exclusion Standard data for eldritch.cafe

Resource Scan

Scan Details

Site Domain eldritch.cafe
Base Domain eldritch.cafe
Scan Status Ok
Last Scan2024-10-01T06:52:33+00:00
Next Scan 2024-10-02T06:52:33+00:00

Last Scan

Scanned2024-10-01T06:52:33+00:00
URL https://eldritch.cafe/robots.txt
Domain IPs 2001:41d0:305:2100::13fc, 2a03:4000:37:737::1, 51.75.122.114, 91.132.147.113
Response IP 91.132.147.113
Found Yes
Hash 7d5bbcefa7d9a270953c9b850b8389715c356459553205a18f7556d0c0c5df40
SimHash 08745a70a153

Groups

*

Rule Path
Disallow /media_proxy/
Disallow /interact/

ccbot

Rule Path
Disallow /

fedicrawl/1.0

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /