theweekjunior.co.uk
robots.txt
Robots Exclusion Standard data for theweekjunior.co.uk
Resource Scan
Scan Details
| Site Domain | theweekjunior.co.uk |
| Base Domain | theweekjunior.co.uk |
| Scan Status | Ok |
| Last Scan | 2026-01-27T12:02:46+00:00 |
| Next Scan | 2026-02-03T12:02:46+00:00 |
Last Scan
| Scanned | 2026-01-27T12:02:46+00:00 |
| URL | https://theweekjunior.co.uk/robots.txt |
| Domain IPs | 199.232.194.114, 199.232.198.114 |
| Response IP | 199.232.198.114 |
| Found | Yes |
| Hash | 655869f49c40d11c99cb751b22ec41fcac7252a9e233105f8ab5719ab33c5fca |
| SimHash | 6024d4829d31 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | */deals/compare |
| Disallow | */html/ |
| Disallow | */p/*/embed/captioned |
| Disallow | */outlink* |
| Disallow | *searchTerm%3D* |
| Disallow | *sortBy%3D* |
| Disallow | *productBrand%3D* |
| Disallow | *%7B*%7D* |
| Disallow | *seenMatchId%3D* |
| Disallow | /infinite-scroll-article/* |
| Disallow | /infinite-scroll-review/* |
| Disallow | /infinite-scroll-recipe/* |
bytespider
mistralai
cohere
ai2bot
youbot
omgili
diffbot
kangaroo
img2dataset
amazonbot
amazon-qbusiness
| Rule | Path |
|---|---|
| Disallow | / |
*
| Rule | Path |
|---|---|
| Disallow | /search |
| Disallow | /*searchTerm |
| Disallow | /deals/compare |
| Disallow | /shop* |
| Disallow | /*productBrand |
| Disallow | *jwsource%3D* |
*
No rules defined. All paths allowed.
Other Records
| Field | Value |
|---|---|
| sitemap | https://theweekjunior.co.uk/sitemap.xml |
| sitemap | https://sciencenature.theweekjunior.co.uk/sitemap.xml |
| sitemap | https://theweekjunior.co.uk/sitemap.xml |
| sitemap | https://theweekjunior.co.uk/sitemap-news.xml |
Comments