justnews.com
robots.txt

Robots Exclusion Standard data for justnews.com

Resource Scan

Scan Details

Site Domain justnews.com
Base Domain justnews.com
Scan Status Ok
Last Scan2024-10-21T13:44:27+00:00
Next Scan 2024-11-20T13:44:27+00:00

Last Scan

Scanned2024-10-21T13:44:27+00:00
URL http://justnews.com/robots.txt
Redirect https://www.local10.com/robots.txt
Redirect Domain www.local10.com
Redirect Base local10.com
Domain IPs 96.45.82.194, 96.45.82.26, 96.45.83.139, 96.45.83.47
Redirect IPs 23.209.46.25, 23.209.46.6, 2600:1413:b000:13::b857:c197, 2600:1413:b000:13::b857:c19c
Response IP 23.45.207.169
Found Yes
Hash 02dc697a9b72db3fe7abbb781dc70a81de2fe752399f5ba285f42bfc4da82b5f
SimHash 4b0d9e418183

Groups

*

Rule Path
Disallow /appfeeds/*
Disallow /meta/*
Disallow /climate-collaborative/*
Disallow *outputType%3DrawHTML*
Disallow *outputType%3Dappfeeds*

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.local10.com/sitemap.xml
sitemap https://www.local10.com/arc/outboundfeeds/sitemap/?outputType=xml
sitemap https://www.local10.com/arc/outboundfeeds/news-sitemap/?outputType=xml
sitemap https://www.local10.com/arc/outboundfeeds/google-news-feed/?outputType=xml