earth.com
robots.txt

Robots Exclusion Standard data for earth.com

Resource Scan

Scan Details

Site Domain earth.com
Base Domain earth.com
Scan Status Ok
Last Scan2024-06-29T13:23:12+00:00
Next Scan 2024-07-06T13:23:12+00:00

Last Scan

Scanned2024-06-29T13:23:12+00:00
URL https://earth.com/robots.txt
Redirect https://www.earth.com/robots.txt
Redirect Domain www.earth.com
Redirect Base earth.com
Domain IPs 13.33.88.126, 13.33.88.33, 13.33.88.70, 13.33.88.91
Redirect IPs 13.33.88.126, 13.33.88.33, 13.33.88.70, 13.33.88.91
Response IP 13.33.88.70
Found Yes
Hash 5e159d148b8ea58b4569c926244d34d8a3439fe49dc6a1976f411f8fa770f3d6
SimHash ed5d5800cd92

Groups

*

Rule Path
Disallow /shop/

Other Records

Field Value
sitemap https://www.earth.com/sitemap_index.xml