eartharchives.org
robots.txt

Robots Exclusion Standard data for eartharchives.org

Resource Scan

Scan Details

Site Domain eartharchives.org
Base Domain eartharchives.org
Scan Status Ok
Last Scan2024-10-18T22:49:44+00:00
Next Scan 2024-11-17T22:49:44+00:00

Last Scan

Scanned2024-10-18T22:49:44+00:00
URL https://eartharchives.org/robots.txt
Domain IPs 13.215.144.61, 2406:da18:880:3800::c8, 2406:da18:880:3801::c8, 52.74.166.77
Response IP 13.215.144.61
Found Yes
Hash 92067bd014a8cbb15f13abe1be92faa515daaae2ff11a67fbf19a9498c498c5d
SimHash b2850d8d2550

Groups

*

Rule Path
Disallow /tagged*

Comments

  • See //www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines: