googlepages.com
robots.txt

Robots Exclusion Standard data for googlepages.com

Resource Scan

Scan Details

Site Domain googlepages.com
Base Domain googlepages.com
Scan Status Ok
Last Scan2024-09-09T22:54:54+00:00
Next Scan 2024-10-09T22:54:54+00:00

Last Scan

Scanned2024-09-09T22:54:54+00:00
URL http://googlepages.com/robots.txt
Redirect http://sites.google.com/robots.txt
Redirect Domain sites.google.com
Redirect Base google.com
Domain IPs 2404:6800:4003:c03::79, 74.125.200.121
Redirect IPs 142.251.175.100, 142.251.175.101, 142.251.175.102, 142.251.175.113, 142.251.175.138, 142.251.175.139, 2404:6800:4003:c02::65, 2404:6800:4003:c02::66, 2404:6800:4003:c02::71, 2404:6800:4003:c02::8a
Response IP 142.251.10.101
Found Yes
Hash 06f2f341f0db6020f8d8ecbe9e162411d8fb6d9df4569ca78631eb30fa83554e
SimHash f0050058c131

Groups

*

Rule Path
Disallow /feeds
Allow /*/_/rsrc/
Allow /_/atari/*
Disallow /*/_/