epaper.thehindu.com
robots.txt

Robots Exclusion Standard data for epaper.thehindu.com

Resource Scan

Scan Details

Site Domain epaper.thehindu.com
Base Domain thehindu.com
Scan Status Ok
Last Scan2024-05-19T21:44:36+00:00
Next Scan 2024-06-02T21:44:36+00:00

Last Scan

Scanned2024-05-19T21:44:36+00:00
URL https://epaper.thehindu.com/robots.txt
Domain IPs 104.18.39.235, 172.64.148.21, 2606:4700:4400::6812:27eb, 2606:4700:4400::ac40:9415
Response IP 104.18.39.235
Found Yes
Hash 71cae851ae5505eda7ed8a4de637d7dcd7d9c5ea851dccbb90059436da6d054f
SimHash 3800e820e7e2

Groups

*

Rule Path
Disallow /ccidist-ws/
Disallow *%3Bhttp%3A*
Disallow *%3Bhttps%3A*
Disallow *%20http%3A*
Disallow *%20https%3A*
Disallow */couponRedirect
Disallow *?redirect=
Disallow *?store=
Disallow /200*
Disallow /201*
Disallow */http%3A*
Disallow */https%3A*
Disallow */mailto%3A*
Disallow *.ecehttp*
Disallow *.ece1http*
Disallow *.ece2http*
Disallow */appwebview
Disallow /search/
Disallow /SEARCH/
Disallow /Search/
Disallow *?tpcc=
Disallow */?_ptid=*

Warnings

  • 2 invalid lines.