search.earth911.com
robots.txt

Robots Exclusion Standard data for search.earth911.com

Resource Scan

Scan Details

Site Domain search.earth911.com
Base Domain earth911.com
Scan Status Ok
Last Scan2024-05-21T06:58:24+00:00
Next Scan 2024-06-20T06:58:24+00:00

Last Scan

Scanned2024-05-21T06:58:24+00:00
URL https://search.earth911.com/robots.txt
Domain IPs 34.192.250.59
Response IP 34.192.250.59
Found Yes
Hash 9da01b2003306d81097fda826b260a6c3ff2bf6d9aad5d8ab5141ac07bf50045
SimHash 651ddc56eb43

Groups

bender

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

urlappendbot

Rule Path
Disallow /

*

Rule Path
Disallow /material

Other Records

Field Value
crawl-delay 5

Comments

  • Sitemap: https://search.earth911.com/sitemap.xml