indiatv.in
robots.txt

Robots Exclusion Standard data for indiatv.in

Resource Scan

Scan Details

Site Domain indiatv.in
Base Domain indiatv.in
Scan Status Ok
Last Scan2024-11-13T12:14:16+00:00
Next Scan 2024-11-20T12:14:16+00:00

Last Scan

Scanned2024-11-13T12:14:16+00:00
URL https://indiatv.in/robots.txt
Redirect https://www.indiatv.in:443/robots.txt
Redirect Domain www.indiatv.in
Redirect Base indiatv.in
Domain IPs 15.207.52.18, 35.154.57.255
Redirect IPs 13.33.183.76, 13.33.183.86, 13.33.183.87, 13.33.183.93
Response IP 3.165.102.129
Found Yes
Hash f799110536395cc1ddd8660944935567b6978c41a39b050dbd40b33eff5be154
SimHash 480c49e2d731

Groups

*

Rule Path
Allow /
Disallow /testfiles/
Disallow /8323530/
Disallow /weatherapis/*
Disallow /relatedvideos/635623/45/694
Disallow /cricket/getmatchlist
Disallow /widgets/playerlivetv*
Disallow /widgets/liveblogresult/*
Disallow /widgets/playervod*
Disallow /newsdata/poll-indiatv-english/*
Disallow /widgets/getgallery/*
Disallow /topic/np-100k-tng-88k*
Disallow /print/*

twitterbot

Rule Path
Allow /
Disallow /speedtest

adsbot-google

Rule Path
Disallow /testfiles/

googlebot-video

Rule Path
Disallow /search-video/
Disallow /testfiles/

gptbot

Rule Path
Disallow /

ccbots

Rule Path
Disallow /

ahrefssiteaudit

Rule Path
Allow /

ahrefsbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.indiatv.in/news-sitemap.xml
sitemap https://www.indiatv.in/xmlsitemap/sitemap/paisa-generic-index.xml
sitemap https://www.indiatv.in/sitemap.xml
sitemap https://www.indiatv.in/news-sitemap-paisa.xml