osceolaiowa.com
robots.txt
Robots Exclusion Standard data for osceolaiowa.com
Resource Scan
Scan Details
Site Domain | osceolaiowa.com |
Base Domain | osceolaiowa.com |
Scan Status | Ok |
Last Scan | 2024-06-24T13:46:36+00:00 |
Next Scan | 2024-07-01T13:46:36+00:00 |
Last Scan
Scanned | 2024-06-24T13:46:36+00:00 |
URL | https://osceolaiowa.com/robots.txt |
Redirect | https://www.osceolaiowa.com:443/robots.txt |
Redirect Domain | www.osceolaiowa.com |
Redirect Base | osceolaiowa.com |
Domain IPs | 35.71.182.24, 52.223.10.247 |
Redirect IPs | 23.45.207.199, 23.45.207.202, 2600:1413:a000::17d2:fa98, 2600:1413:a000::17d2:fab0 |
Response IP | 23.202.33.120 |
Found | Yes |
Hash | 4432831bc8f19b2ac82b752850812da0a0839dc850f0184c799fcc5d573ed1e5 |
SimHash | 711e1cc9cd93 |
Groups
*
Rule | Path |
---|---|
Disallow | /admin/ |
Disallow | /examples/ |
Disallow | /search/ |
Disallow | /test/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.osceolaiowa.com/arc/outboundfeeds/sitemap-index?outputType=xml |
sitemap | https://www.osceolaiowa.com/arc/outboundfeeds/news-sitemap-index?outputType=xml |
sitemap | https://www.osceolaiowa.com/arc/outboundfeeds/news-sitemap?outputType=xml |