newsforjax.com
robots.txt
Robots Exclusion Standard data for newsforjax.com
Resource Scan
Scan Details
Site Domain | newsforjax.com |
Base Domain | newsforjax.com |
Scan Status | Ok |
Last Scan | 2024-06-14T08:07:24+00:00 |
Next Scan | 2024-06-21T08:07:24+00:00 |
Last Scan
Scanned | 2024-06-14T08:07:24+00:00 |
URL | http://newsforjax.com/robots.txt |
Redirect | https://www.news4jax.com/robots.txt |
Redirect Domain | www.news4jax.com |
Redirect Base | news4jax.com |
Domain IPs | 3.222.159.162, 35.170.218.86, 52.204.147.206, 54.205.175.85 |
Redirect IPs | 23.45.207.165, 23.45.207.169, 2600:1413:b000:13::b857:c197, 2600:1413:b000:13::b857:c19c |
Response IP | 42.99.140.152 |
Found | Yes |
Hash | e9283f0d57949f4f2516001ea517ef7718076b4ba2f28dd5b16491cf338b0deb |
SimHash | 690c9c608991 |
Groups
*
Rule | Path |
---|---|
Disallow | /appfeeds/* |
Disallow | /meta/* |
Disallow | /climate-collaborative/* |
Disallow | *outputType%3DrawHTML* |
Disallow | *outputType%3Dappfeeds* |
Other Records
Field | Value |
---|---|
sitemap | https://www.news4jax.com/sitemap.xml |
sitemap | https://www.news4jax.com/arcio/sitemap/ |
sitemap | https://www.news4jax.com/arcio/news-sitemap/ |
sitemap | https://www.news4jax.com/arcio/google-news-feed/ |