katctv.com
robots.txt

Robots Exclusion Standard data for katctv.com

Resource Scan

Scan Details

Site Domain katctv.com
Base Domain katctv.com
Scan Status Ok
Last Scan2024-05-25T23:18:09+00:00
Next Scan 2024-06-01T23:18:09+00:00

Last Scan

Scanned2024-05-25T23:18:09+00:00
URL https://katctv.com/robots.txt
Redirect https://www.katc.com/robots.txt
Redirect Domain www.katc.com
Redirect Base katc.com
Domain IPs 18.207.17.124, 52.22.220.90
Redirect IPs 13.33.88.109, 13.33.88.119, 13.33.88.43, 13.33.88.71
Response IP 13.33.88.43
Found Yes
Hash 6d99e61ef9f48b1b700321bb5e6b6167497c41249e2baeb150cc435cf7670b45
SimHash 2b04c840e753

Groups

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

*

Rule Path
Disallow /_debug/
Disallow /_plugins/
Disallow /ajax/
Disallow /cms/

Other Records

Field Value
sitemap https://www.katc.com/sitemap.xml
sitemap https://www.katc.com/news-sitemap-content.xml