theweek.in
robots.txt
Robots Exclusion Standard data for theweek.in
Resource Scan
Scan Details
Site Domain | theweek.in |
Base Domain | theweek.in |
Scan Status | Ok |
Last Scan | 2024-05-01T04:33:00+00:00 |
Next Scan | 2024-05-08T04:33:00+00:00 |
Last Scan
Scanned | 2024-05-01T04:33:00+00:00 |
URL | https://theweek.in/robots.txt |
Redirect | https://www.theweek.in/robots.txt |
Redirect Domain | www.theweek.in |
Redirect Base | theweek.in |
Domain IPs | 23.222.245.77 |
Redirect IPs | 23.52.112.217, 2600:1413:b000:382::4a9, 2600:1413:b000:389::4a9 |
Response IP | 23.54.56.229 |
Found | Yes |
Hash | f0b0bcd892df93e59c0721bf1fd2a7ef0b402c756f1d951e82438a3e967b8b18 |
SimHash | 7875f3248113 |
Groups
*
Rule | Path |
---|---|
Disallow | /content/week/archival/ |
Disallow | /content/week/public-feed-configurations/ |
Disallow | /cgi-bin/ |
Disallow | /*/print.htm%C3%82 |
Disallow | /*jcr%3Acontent*%C3%82 |
Disallow | /_jcr_content*%C3%82 |