epaper.thehindu.com
robots.txt
Robots Exclusion Standard data for epaper.thehindu.com
Resource Scan
Scan Details
Site Domain | epaper.thehindu.com |
Base Domain | thehindu.com |
Scan Status | Ok |
Last Scan | 2024-05-19T21:44:36+00:00 |
Next Scan | 2024-06-02T21:44:36+00:00 |
Last Scan
Scanned | 2024-05-19T21:44:36+00:00 |
URL | https://epaper.thehindu.com/robots.txt |
Domain IPs | 104.18.39.235, 172.64.148.21, 2606:4700:4400::6812:27eb, 2606:4700:4400::ac40:9415 |
Response IP | 104.18.39.235 |
Found | Yes |
Hash | 71cae851ae5505eda7ed8a4de637d7dcd7d9c5ea851dccbb90059436da6d054f |
SimHash | 3800e820e7e2 |
Groups
*
Rule | Path |
---|---|
Disallow | /ccidist-ws/ |
Disallow | *%3Bhttp%3A* |
Disallow | *%3Bhttps%3A* |
Disallow | *%20http%3A* |
Disallow | *%20https%3A* |
Disallow | */couponRedirect |
Disallow | *?redirect= |
Disallow | *?store= |
Disallow | /200* |
Disallow | /201* |
Disallow | */http%3A* |
Disallow | */https%3A* |
Disallow | */mailto%3A* |
Disallow | *.ecehttp* |
Disallow | *.ece1http* |
Disallow | *.ece2http* |
Disallow | */appwebview |
Disallow | /search/ |
Disallow | /SEARCH/ |
Disallow | /Search/ |
Disallow | *?tpcc= |
Disallow | */?_ptid=* |
Warnings
- 2 invalid lines.