awareearth.org
robots.txt

Robots Exclusion Standard data for awareearth.org

Resource Scan

Scan Details

Site Domain awareearth.org
Base Domain awareearth.org
Scan Status Ok
Last Scan2025-11-16T21:05:54+00:00
Next Scan 2025-11-23T21:05:54+00:00

Last Scan

Scanned2025-11-16T21:05:54+00:00
URL https://awareearth.org/robots.txt
Domain IPs 104.21.70.94, 172.67.222.121, 2606:4700:3033::ac43:de79, 2606:4700:3036::6815:465e
Response IP 172.67.222.121
Found Yes
Hash 78d3d11cf3e6d0df503ebc5a989204b743c49c489ff888636fb95412fbc5d7dc
SimHash fb81986100b6

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://awareearth.org/sitemap.xml
sitemap https://awareearth.org/sitemap-news.xml
sitemap https://awareearth.org/sitemap-posttype-web-story.2023.xml

Comments

  • XML Sitemap & Google News version 5.3.3