theweek.in
robots.txt

Robots Exclusion Standard data for theweek.in

Resource Scan

Scan Details

Site Domain theweek.in
Base Domain theweek.in
Scan Status Ok
Last Scan2024-05-01T04:33:00+00:00
Next Scan 2024-05-08T04:33:00+00:00

Last Scan

Scanned2024-05-01T04:33:00+00:00
URL https://theweek.in/robots.txt
Redirect https://www.theweek.in/robots.txt
Redirect Domain www.theweek.in
Redirect Base theweek.in
Domain IPs 23.222.245.77
Redirect IPs 23.52.112.217, 2600:1413:b000:382::4a9, 2600:1413:b000:389::4a9
Response IP 23.54.56.229
Found Yes
Hash f0b0bcd892df93e59c0721bf1fd2a7ef0b402c756f1d951e82438a3e967b8b18
SimHash 7875f3248113

Groups

*

Rule Path
Disallow /content/week/archival/
Disallow /content/week/public-feed-configurations/
Disallow /cgi-bin/
Disallow /*/print.htm%C3%82
Disallow /*jcr%3Acontent*%C3%82
Disallow /_jcr_content*%C3%82

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /