4.images.theweek.com
robots.txt
Robots Exclusion Standard data for 4.images.theweek.com
Resource Scan
Scan Details
Site Domain | 4.images.theweek.com |
Base Domain | theweek.com |
Scan Status | Ok |
Last Scan | 2024-04-25T05:52:42+00:00 |
Next Scan | 2024-05-25T05:52:42+00:00 |
Last Scan
Scanned | 2024-04-25T05:52:42+00:00 |
URL | https://4.images.theweek.com/robots.txt |
Redirect | https://theweek.com:443/robots.txt |
Redirect Domain | theweek.com |
Redirect Base | theweek.com |
Domain IPs | 3.220.87.34, 3.223.42.158, 35.169.142.225, 35.174.50.69, 52.201.11.20, 52.203.214.110 |
Redirect IPs | 199.232.194.114, 199.232.198.114 |
Response IP | 199.232.194.114 |
Found | Yes |
Hash | 7136b4aa2d97c66f12d45353c6e5f502622b2787f06aab661f410347ebe5757f |
SimHash | 2024f480ad19 |
Groups
*
Rule | Path |
---|---|
Disallow | */deals/compare |
Disallow | */html/ |
Disallow | */p/*/embed/captioned |
Disallow | *searchTerm%3D* |
Disallow | *sortBy%3D* |
Disallow | *productBrand%3D* |
Disallow | *%7B*%7D* |
Disallow | /infinite-scroll-article/* |
Disallow | /infinite-scroll-review/* |
Disallow | /infinite-scroll-recipe/* |
*
Rule | Path |
---|---|
Disallow | /search/ |
Disallow | /359/ |
Disallow | /content/ |
Disallow | /blaize/datalayer |
Disallow | /*?*xhr=* |
*
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
sitemap | https://theweek.com/sitemap.xml |
sitemap | https://theweek.com/uk/sitemap.xml |
Comments