news4jax.co
robots.txt

Robots Exclusion Standard data for news4jax.co

Resource Scan

Scan Details

Site Domain news4jax.co
Base Domain news4jax.co
Scan Status Ok
Last Scan2024-09-21T01:28:36+00:00
Next Scan 2024-09-28T01:28:36+00:00

Last Scan

Scanned2024-09-21T01:28:36+00:00
URL https://news4jax.co/robots.txt
Redirect https://www.news4jax.com/robots.txt
Redirect Domain www.news4jax.com
Redirect Base news4jax.com
Domain IPs 13.35.18.14, 13.35.18.56, 13.35.18.8, 13.35.18.80
Redirect IPs 23.52.171.106, 23.52.171.75, 2600:1413:b000:13::b857:c197, 2600:1413:b000:13::b857:c19c
Response IP 23.52.171.145
Found Yes
Hash 32884646e8d14204ad256219cc9096484f04fe7c54b4957da9317d4d2cf08e4f
SimHash 6b0d1c208093

Groups

*

Rule Path
Disallow /appfeeds/*
Disallow /meta/*
Disallow /climate-collaborative/*
Disallow *outputType%3DrawHTML*
Disallow *outputType%3Dappfeeds*

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.news4jax.com/sitemap.xml
sitemap https://www.news4jax.com/arc/outboundfeeds/sitemap/?outputType=xml
sitemap https://www.news4jax.com/arc/outboundfeeds/news-sitemap/?outputType=xml
sitemap https://www.news4jax.com/arc/outboundfeeds/google-news-feed/?outputType=xml