coop.se
robots.txt

Robots Exclusion Standard data for coop.se

Resource Scan

Scan Details

Site Domain coop.se
Base Domain coop.se
Scan Status Ok
Last Scan2024-11-04T14:06:52+00:00
Next Scan 2024-11-18T14:06:52+00:00

Last Scan

Scanned2024-11-04T14:06:52+00:00
URL https://coop.se/robots.txt
Redirect https://www.coop.se/robots.txt
Redirect Domain www.coop.se
Redirect Base coop.se
Domain IPs 185.195.93.127, 2a0a:56c4::b9c3:5c83
Redirect IPs 104.18.42.172, 172.64.145.84, 2606:4700:4400::6812:2aac, 2606:4700:4400::ac40:9154
Response IP 104.18.42.172
Found Yes
Hash 34d6d0c44804fa6fa68d7fbb0e136e807f214e03d83375af805f8a0235af642d
SimHash 984ed276edf2

Groups

*

Rule Path
Disallow /mitt-coop
Disallow /mitt-coop/
Disallow /handla/sok/*
Disallow /handla/kopklara-recept/*filter%3D
Disallow /handla/kopklara-recept/*cookingSort%3D
Disallow /handla/betala
Disallow /handla/betala/*
Disallow /Recept--mat/mat-for-alla-tillfallen2/
Disallow /recept/*filter%3D
Disallow /recept/*sort%3D
Disallow /handla/search*
Disallow /handla/filter*
Disallow /handla/sort*
Allow /

cazoodlebot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.coop.se/sitemap.xml

Comments

  • robots.txt for www.coop.se
  • Block CazoodleBot as it does not present correct accept content headers
  • Block MJ12bot as it is just noise
  • Block dotbot as it cannot parse base urls properly
  • Block Gigabot