s3.india.com
robots.txt
Robots Exclusion Standard data for s3.india.com
Resource Scan
Scan Details
Site Domain | s3.india.com |
Base Domain | india.com |
Scan Status | Ok |
Last Scan | 2024-05-27T16:24:26+00:00 |
Next Scan | 2024-06-26T16:24:26+00:00 |
Last Scan
Scanned | 2024-05-27T16:24:26+00:00 |
URL | https://s3.india.com/robots.txt |
Domain IPs | 23.33.184.239, 23.33.184.240, 2600:140e:6::17ca:22ea, 2600:140e:6::b81a:5b0f |
Response IP | 23.202.33.114 |
Found | Yes |
Hash | dee0de0ddaa995f62b700c70b4ce0f417a9afc54f70bf9d59d962270abbe602a |
SimHash | 2a0099288bb3 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Disallow | /sponsored/ |
Disallow | /independence.php |
Disallow | /mcd-election-2017 |
Disallow | /mcd-election-2017/* |
Allow | /wp-admin/admin-ajax.php |
Other Records
Field | Value |
---|---|
sitemap | https://www.india.com/sitemap.xml |
sitemap | https://www.india.com/all-image-sitemap.xml |
sitemap | https://www.india.com/google-news-sitemap.xml |
sitemap | https://www.india.com/hindi-news/sitemap.xml |
sitemap | https://www.india.com/hindi-news/hindi-news-sitemap.xml |
sitemap | https://www.india.com/special-sitemap.xml |
Comments