earth.google.com
robots.txt

Robots Exclusion Standard data for earth.google.com

Resource Scan

Scan Details

Site Domain earth.google.com
Base Domain google.com
Scan Status Ok
Last Scan2024-11-05T21:35:03+00:00
Next Scan 2024-11-19T21:35:03+00:00

Last Scan

Scanned2024-11-05T21:35:03+00:00
URL https://earth.google.com/robots.txt
Domain IPs 2404:6800:4003:c1c::64, 2404:6800:4003:c1c::66, 2404:6800:4003:c1c::71, 2404:6800:4003:c1c::8a, 64.233.170.100, 64.233.170.101, 64.233.170.102, 64.233.170.113, 64.233.170.138, 64.233.170.139
Response IP 74.125.200.113
Found Yes
Hash b14e51d04906c9bf61989e4623b6cf03e5502328a3cd8c2963b5bb3608656c03
SimHash 660d1e404d92

Groups

*

Rule Path
Disallow /static
Disallow /web/search/

Other Records

Field Value
sitemap https://earth.google.com/sitemap.xml
sitemap https://earth.google.com/sitemap-website.xml