khabarindiatv.com
robots.txt

Robots Exclusion Standard data for khabarindiatv.com

Resource Scan

Scan Details

Site Domain khabarindiatv.com
Base Domain khabarindiatv.com
Scan Status Ok
Last Scan2024-05-26T08:14:48+00:00
Next Scan 2024-06-02T08:14:48+00:00

Last Scan

Scanned2024-05-26T08:14:48+00:00
URL https://khabarindiatv.com/robots.txt
Redirect https://www.indiatv.in:443/robots.txt
Redirect Domain www.indiatv.in
Redirect Base indiatv.in
Domain IPs 13.33.88.2, 13.33.88.44, 13.33.88.69, 13.33.88.93
Redirect IPs 108.156.133.15, 108.156.133.33, 108.156.133.61, 108.156.133.82
Response IP 108.156.133.82
Found Yes
Hash 49f2537f6e10d21f0320da81dfb46638162b499fb5e39d4decb82767797e540f
SimHash 4804d8f2d313

Groups

*

Rule Path
Allow /
Disallow /testfiles/
Disallow /8323530/

twitterbot

Rule Path
Allow /
Disallow /speedtest

adsbot-google

Rule Path
Disallow /testfiles/

googlebot-video

Rule Path
Disallow /search-video/
Disallow /testfiles/

gptbot

Rule Path
Disallow /

ccbots

Rule Path
Disallow /

ahrefssiteaudit

Rule Path
Allow /

ahrefsbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.indiatv.in/news-sitemap.xml
sitemap https://www.indiatv.in/xmlsitemap/sitemap/paisa-generic-index.xml
sitemap https://www.indiatv.in/sitemap.xml
sitemap https://www.indiatv.in/news-sitemap-paisa.xml