geocitiesarchive.com
robots.txt

Robots Exclusion Standard data for geocitiesarchive.com

Resource Scan

Scan Details

Site Domain geocitiesarchive.com
Base Domain geocitiesarchive.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-10-31T15:11:16+00:00
Next Scan 2026-01-29T15:11:16+00:00

Last Successful Scan

Scanned2023-12-27T00:06:03+00:00
URL https://geocitiesarchive.com/robots.txt
Redirect http://www.geocitiesarchive.com/robots.txt
Redirect Domain www.geocitiesarchive.com
Redirect Base geocitiesarchive.com
Domain IPs 104.21.83.181, 172.67.180.125, 2606:4700:3033::6815:53b5, 2606:4700:3034::ac43:b47d
Redirect IPs 104.21.83.181, 172.67.180.125, 2606:4700:3033::6815:53b5, 2606:4700:3034::ac43:b47d
Response IP 172.67.180.125
Found Yes
Hash c1ef86287c87376e5fcf8b3ea03cd8cb0ab6ee3549e28019899323e087c993e6
SimHash b8005cc6e1f2

Groups

*

Rule Path
Disallow /cache/
Disallow /css/
Disallow /skin/
Disallow /fonts/
Disallow /icon/
Disallow /js/