justnews.com
robots.txt
Robots Exclusion Standard data for justnews.com
Resource Scan
Scan Details
Site Domain | justnews.com |
Base Domain | justnews.com |
Scan Status | Ok |
Last Scan | 2024-04-24T07:09:00+00:00 |
Next Scan | 2024-05-24T07:09:00+00:00 |
Last Scan
Scanned | 2024-04-24T07:09:00+00:00 |
URL | http://justnews.com/robots.txt |
Redirect | https://www.local10.com/robots.txt |
Redirect Domain | www.local10.com |
Redirect Base | local10.com |
Domain IPs | 96.45.82.194, 96.45.82.26, 96.45.83.139, 96.45.83.47 |
Redirect IPs | 184.87.193.79, 184.87.193.84, 2600:1413:b000:14::b857:c14f, 2600:1413:b000:14::b857:c154 |
Response IP | 42.99.140.152 |
Found | Yes |
Hash | 555e55ce2548860c4882b141a1a37b26ffdc12e6a85f33768ec7b8d2a99943e2 |
SimHash | 890d9a408991 |
Groups
*
Rule | Path |
---|---|
Disallow | /appfeeds/* |
Disallow | /meta/* |
Disallow | /climate-collaborative/* |
Disallow | *outputType%3DrawHTML* |
Disallow | *outputType%3Dappfeeds* |
Other Records
Field | Value |
---|---|
sitemap | https://www.local10.com/sitemap.xml |
sitemap | https://www.local10.com/arcio/sitemap/ |
sitemap | https://www.local10.com/arcio/news-sitemap/ |
sitemap | https://www.local10.com/arcio/google-news-feed/ |