s3.india.com
robots.txt

Robots Exclusion Standard data for s3.india.com

Resource Scan

Scan Details

Site Domain s3.india.com
Base Domain india.com
Scan Status Ok
Last Scan2024-05-27T16:24:26+00:00
Next Scan 2024-06-26T16:24:26+00:00

Last Scan

Scanned2024-05-27T16:24:26+00:00
URL https://s3.india.com/robots.txt
Domain IPs 23.33.184.239, 23.33.184.240, 2600:140e:6::17ca:22ea, 2600:140e:6::b81a:5b0f
Response IP 23.202.33.114
Found Yes
Hash dee0de0ddaa995f62b700c70b4ce0f417a9afc54f70bf9d59d962270abbe602a
SimHash 2a0099288bb3

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /sponsored/
Disallow /independence.php
Disallow /mcd-election-2017
Disallow /mcd-election-2017/*
Allow /wp-admin/admin-ajax.php

urxbot/*

Rule Path
Disallow

urx-api/*

Rule Path
Disallow

twitterbot

Rule Path
Disallow

baiduspider

Rule Path
Disallow /

facebookexternalhit

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.india.com/sitemap.xml
sitemap https://www.india.com/all-image-sitemap.xml
sitemap https://www.india.com/google-news-sitemap.xml
sitemap https://www.india.com/hindi-news/sitemap.xml
sitemap https://www.india.com/hindi-news/hindi-news-sitemap.xml
sitemap https://www.india.com/special-sitemap.xml

Comments

  • Baiduspider