timesinternet.in
robots.txt

Robots Exclusion Standard data for timesinternet.in

Resource Scan

Scan Details

Site Domain timesinternet.in
Base Domain timesinternet.in
Scan Status Ok
Last Scan2024-09-25T15:31:18+00:00
Next Scan 2024-10-09T15:31:18+00:00

Last Scan

Scanned2024-09-25T15:31:18+00:00
URL https://timesinternet.in/robots.txt
Domain IPs 184.87.193.73, 184.87.193.86, 2600:1413:b000:13::b857:c197, 2600:1413:b000:13::b857:c1a1
Response IP 23.45.207.165
Found Yes
Hash d15ed90046a53ec5cd4771d05320eafb84b3cb07d67fc20c8334628f3f5ad7e9
SimHash 285d897a6c13

Groups

*

Rule Path
Disallow /search?
Disallow /search/?
Disallow /careers/job-detail/
Disallow /careers/job-apply/
Disallow /blog/client-journey-listings/
Disallow /blog/tifm-banner-listings/
Disallow /blog/tifmtube-listings/
Disallow /offers/

Other Records

Field Value
sitemap https://timesinternet.in/sitemap.xml