belsport.cl
robots.txt

Robots Exclusion Standard data for belsport.cl

Resource Scan

Scan Details

Site Domain belsport.cl
Base Domain belsport.cl
Scan Status Ok
Last Scan2026-02-10T12:51:18+00:00
Next Scan 2026-03-12T12:51:18+00:00

Last Scan

Scanned2026-02-10T12:51:18+00:00
URL https://belsport.cl/robots.txt
Redirect https://www.belsport.cl/robots-belsport.txt
Redirect Domain www.belsport.cl
Redirect Base belsport.cl
Domain IPs 104.26.2.36, 104.26.3.36, 172.67.72.18, 2606:4700:20::681a:224, 2606:4700:20::681a:324, 2606:4700:20::ac43:4812
Redirect IPs 104.26.2.36, 104.26.3.36, 172.67.72.18, 2606:4700:20::681a:224, 2606:4700:20::681a:324, 2606:4700:20::ac43:4812
Response IP 172.67.72.18
Found Yes
Hash 50a1a4fcf4cf54d1ec803b6590e26e6b48c0dfc74ea01b75829ca4e42d7db5e4
SimHash 3c545f1eedea

Groups

*

Rule Path
Disallow /cart
Disallow /checkout
Disallow /my-account
Disallow /login
Disallow /*query%3D
Disallow /*search%3D
Disallow /search/

cazoodlebot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://belsport.cl/sitemap.xml

Comments

  • For all robots
  • Block access to specific groups of pages
  • Allow search crawlers to discover the sitemap
  • Block CazoodleBot as it does not present correct accept content headers
  • Block MJ12bot as it is just noise
  • Block dotbot as it cannot parse base urls properly
  • Block Gigabot