indiasupernews.com
robots.txt
Robots Exclusion Standard data for indiasupernews.com
Resource Scan
Scan Details
Site Domain | indiasupernews.com |
Base Domain | indiasupernews.com |
Scan Status | Ok |
Last Scan | 2024-09-26T14:15:56+00:00 |
Next Scan | 2024-10-03T14:15:56+00:00 |
Last Scan
Scanned | 2024-09-26T14:15:56+00:00 |
URL | https://indiasupernews.com/robots.txt |
Redirect | https://www.indiasupernews.com/robots.txt |
Redirect Domain | www.indiasupernews.com |
Redirect Base | indiasupernews.com |
Domain IPs | 2600:1413:5000:e::1736:9b85, 2600:1413:5000:e::1736:9b92, 42.99.140.170, 42.99.140.217 |
Redirect IPs | 2600:1413:5000:e::1736:9b90, 2600:1413:5000:e::1736:9b92, 42.99.140.161, 42.99.140.193 |
Response IP | 42.99.140.193 |
Found | Yes |
Hash | 62214c03dfd3059f0e50e306a6acc5b1fec5ad9d7ddf69765b33e108b0572aad |
SimHash | 611159735614 |
Groups
*
Rule | Path |
---|---|
Allow | / |
*
Rule | Path |
---|---|
Disallow | /preview?article |
*
Rule | Path |
---|---|
Disallow | */can/evnt/click* |
*
Rule | Path |
---|---|
Disallow | *?col_ci=* |
Other Records
Field | Value |
---|---|
sitemap | https://www.indiasupernews.com/sitemap_index.xml |
sitemap | https://www.indiasupernews.com/sitemap.xml |
sitemap | https://www.indiasupernews.com/news-sitemap.xml |
sitemap | https://www.indiasupernews.com/category-sitemap.xml |
sitemap | https://www.indiasupernews.com/image-sitemap.xml |
Comments