business.makro.co.za
robots.txt

Robots Exclusion Standard data for business.makro.co.za

Resource Scan

Scan Details

Site Domain business.makro.co.za
Base Domain makro.co.za
Scan Status Ok
Last Scan2024-11-03T12:48:45+00:00
Next Scan 2024-11-17T12:48:45+00:00

Last Scan

Scanned2024-11-03T12:48:45+00:00
URL https://business.makro.co.za/robots.txt
Domain IPs 23.32.29.89, 23.32.29.90, 2600:1413:b000:1d::17d1:2e90, 2600:1413:b000:1d::17d1:2e96
Response IP 23.215.7.32
Found Yes
Hash 05fb629910efe1a91c3fedb3025b3d22cb78e549c78d007e3b0ad6bbc1d7eacd
SimHash bc57b77bcffc

Groups

*

Rule Path
Allow /commercial
Disallow /cart/
Disallow /checkout/
Disallow /my-account/
Disallow /login/
Disallow /newpassword
Disallow /otp
Disallow /*/c/
Disallow /c/
Disallow /*/p/
Disallow /p/
Disallow /*/search/
Disallow /search/
Disallow /search*
Disallow /*/category/
Disallow /allCategory/
Disallow /commercial-offering
Disallow /blocked
Disallow /*?*q=
Disallow /wmapi/bff*

cazoodlebot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://business.makro.co.za/sitemap.xml

Comments

  • For all robots
  • Allow access to specific groups of pages
  • Block access to specific groups of pages
  • Block Search string pages
  • Allow search crawlers to discover the sitemap
  • Block CazoodleBot as it does not present correct accept content headers
  • Block MJ12bot as it is just noise
  • Block dotbot as it cannot parse base urls properly
  • Block Gigabot