earth911.com
robots.txt

Robots Exclusion Standard data for earth911.com

Resource Scan

Scan Details

Site Domain earth911.com
Base Domain earth911.com
Scan Status Ok
Last Scan2024-11-16T13:59:16+00:00
Next Scan 2024-11-23T13:59:16+00:00

Last Scan

Scanned2024-11-16T13:59:16+00:00
URL https://earth911.com/robots.txt
Domain IPs 104.196.133.3
Response IP 104.196.133.3
Found Yes
Hash fb56b33492a0ae54b74d1517f329c31da6cebc61ce13596b8be342c54011a0a3
SimHash 196458c0a193

Groups

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://earth911.com/sitemap_index.xml

Comments

  • ======Raptive Begin======
  • ======Raptive End======
  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK