earth.google.com
robots.txt

Robots Exclusion Standard data for earth.google.com

Resource Scan

Scan Details

Site Domain earth.google.com
Base Domain google.com
Scan Status Ok
Last Scan2024-05-07T20:42:19+00:00
Next Scan 2024-05-21T20:42:19+00:00

Last Scan

Scanned2024-05-07T20:42:19+00:00
URL https://earth.google.com/robots.txt
Domain IPs 142.251.175.100, 142.251.175.101, 142.251.175.102, 142.251.175.113, 142.251.175.138, 142.251.175.139, 2404:6800:4003:c1c::65, 2404:6800:4003:c1c::71, 2404:6800:4003:c1c::8a, 2404:6800:4003:c1c::8b
Response IP 172.253.118.100
Found Yes
Hash b14e51d04906c9bf61989e4623b6cf03e5502328a3cd8c2963b5bb3608656c03
SimHash 660d1e404d92

Groups

*

Rule Path
Disallow /static
Disallow /web/search/

Other Records

Field Value
sitemap https://earth.google.com/sitemap.xml
sitemap https://earth.google.com/sitemap-website.xml