theindiadaily.com
robots.txt

Robots Exclusion Standard data for theindiadaily.com

Resource Scan

Scan Details

Site Domain theindiadaily.com
Base Domain theindiadaily.com
Scan Status Ok
Last Scan2024-06-12T14:58:38+00:00
Next Scan 2024-06-19T14:58:38+00:00

Last Scan

Scanned2024-06-12T14:58:38+00:00
URL https://theindiadaily.com/robots.txt
Redirect https://www.theindiadaily.com/robots.txt
Redirect Domain www.theindiadaily.com
Redirect Base theindiadaily.com
Domain IPs 172.66.40.233, 172.66.43.23, 2606:4700:3108::ac42:28e9, 2606:4700:3108::ac42:2b17
Redirect IPs 172.66.40.233, 172.66.43.23, 2606:4700:3108::ac42:28e9, 2606:4700:3108::ac42:2b17
Response IP 172.66.43.23
Found Yes
Hash b4865c39065f279ecd65c39dda2f50f004076fded2e0a816acc5fc88c419c1fe
SimHash 2004c8640133

Groups

*

Rule Path
Allow /
Disallow /tag/*

baiduspider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.theindiadaily.com/sitemap-news.xml
sitemap https://www.theindiadaily.com/sitemap-webstory.xml
sitemap https://www.theindiadaily.com/sitemap-video.xml
sitemap https://www.theindiadaily.com/sitemap-gallery.xml
sitemap https://www.theindiadaily.com/sitemap-category.xml

Comments

  • Baiduspider
  • Yandex