indiatimes.com
robots.txt

Robots Exclusion Standard data for indiatimes.com

Resource Scan

Scan Details

Site Domain indiatimes.com
Base Domain indiatimes.com
Scan Status Ok
Last Scan2024-06-23T08:19:18+00:00
Next Scan 2024-06-30T08:19:18+00:00

Last Scan

Scanned2024-06-23T08:19:18+00:00
URL https://indiatimes.com/robots.txt
Redirect https://www.indiatimes.com/robots.txt
Redirect Domain www.indiatimes.com
Redirect Base indiatimes.com
Domain IPs 104.69.171.126, 2600:1413:1:98a::3621
Redirect IPs 184.50.85.137, 184.50.85.138, 184.50.85.139, 184.50.85.147, 184.50.85.148, 2600:1413:1::b832:558b, 2600:1413:1::b832:5593, 2600:1413:1::b832:5594, 2600:1413:1::b832:55a2, 2600:1413:1::b832:55b2, 96.17.180.179, 96.17.180.182, 96.17.180.187, 96.17.180.188
Response IP 184.50.85.137
Found Yes
Hash ee5931041e3f96e014ed5bbf66cb51bd72f898de938042f26563a0e7480c2f25
SimHash 890c18f05110

Groups

*

Rule Path
Allow /
Disallow /*.php
Disallow /7176/*
Disallow /whatshot/*
Disallow /wattpad/*
Disallow /uc_feed/*
Disallow /brief/*
Disallow /search/*
Disallow /user/*
Disallow /idtoin/*
Disallow */video_player/*

Other Records

Field Value
sitemap https://www.indiatimes.com/sitemap.xml
sitemap https://www.indiatimes.com/sitemap-news.xml
sitemap https://www.indiatimes.com/sitemap-posts-index-2024.xml
sitemap https://www.indiatimes.com/hindi/sitemap-ampstories-2024.xml

Comments

  • robots.txt
  • Sitemaps