freshair.org
robots.txt

Robots Exclusion Standard data for freshair.org

Resource Scan

Scan Details

Site Domain freshair.org
Base Domain freshair.org
Scan Status Ok
Last Scan2025-10-16T04:31:48+00:00
Next Scan 2025-11-15T04:31:48+00:00

Last Scan

Scanned2025-10-16T04:31:48+00:00
URL https://freshair.org/robots.txt
Domain IPs 34.72.112.150
Response IP 34.72.112.150
Found Yes
Hash 7833c034c3065a374d362d1deb11e043ff7e4a92fcdc7198ed31c257c4aff1d0
SimHash e11510440d13

Groups

*

Rule Path
Disallow %5E*/news/page/*

Other Records

Field Value
sitemap https://freshair.org/sitemap_index.xml