static.india.com
robots.txt

Robots Exclusion Standard data for static.india.com

Resource Scan

Scan Details

Site Domain static.india.com
Base Domain india.com
Scan Status Ok
Last Scan2024-05-22T20:50:29+00:00
Next Scan 2024-06-21T20:50:29+00:00

Last Scan

Scanned2024-05-22T20:50:29+00:00
URL https://static.india.com/robots.txt
Domain IPs 23.209.46.86, 23.209.46.97, 2600:1413:b000:1e::17d1:2e56, 2600:1413:b000:1e::17d1:2e61
Response IP 23.202.33.112
Found Yes
Hash dee0de0ddaa995f62b700c70b4ce0f417a9afc54f70bf9d59d962270abbe602a
SimHash 2a0099288bb3

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /sponsored/
Disallow /independence.php
Disallow /mcd-election-2017
Disallow /mcd-election-2017/*
Allow /wp-admin/admin-ajax.php

urxbot/*

Rule Path
Disallow

urx-api/*

Rule Path
Disallow

twitterbot

Rule Path
Disallow

baiduspider

Rule Path
Disallow /

facebookexternalhit

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.india.com/sitemap.xml
sitemap https://www.india.com/all-image-sitemap.xml
sitemap https://www.india.com/google-news-sitemap.xml
sitemap https://www.india.com/hindi-news/sitemap.xml
sitemap https://www.india.com/hindi-news/hindi-news-sitemap.xml
sitemap https://www.india.com/special-sitemap.xml

Comments

  • Baiduspider