mgbox.by
robots.txt

Robots Exclusion Standard data for mgbox.by

Resource Scan

Scan Details

Site Domain mgbox.by
Base Domain mgbox.by
Scan Status Ok
Last Scan2025-10-23T02:26:31+00:00
Next Scan 2025-11-22T02:26:31+00:00

Last Scan

Scanned2025-10-23T02:26:31+00:00
URL https://mgbox.by/robots.txt
Domain IPs 104.21.63.42, 172.67.143.17, 2606:4700:3035::6815:3f2a, 2606:4700:3036::ac43:8f11
Response IP 172.67.143.17
Found Yes
Hash ab7b193dad32ce90b84698fbb1d28fd82e868474764bf4cc51da72f2a5b2a9d2
SimHash 4e2cbae40731

Groups

*

Rule Path
Disallow
Disallow /acat/*
Disallow /search/number/
Disallow /admintools/
Disallow /adcp/
Disallow /security/
Disallow /login/
Disallow /account/
Disallow /history_orders/
Disallow /*?filter
Disallow /*?grid=table
Disallow /*?limit=
Disallow /*?sort=price_asc
Disallow /*?grid=list
Disallow /*?sort=reset
Disallow /*?page=
Disallow /*?mark=
Disallow /*?close
Disallow /*?disallow
Allow *.css
Allow *.js

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap /sitemap.xml

Warnings

  • `host` is not a known field.