mhccorp.com
robots.txt

Robots Exclusion Standard data for mhccorp.com

Resource Scan

Scan Details

Site Domain mhccorp.com
Base Domain mhccorp.com
Scan Status Ok
Last Scan2024-11-07T16:05:33+00:00
Next Scan 2024-12-07T16:05:33+00:00

Last Scan

Scanned2024-11-07T16:05:33+00:00
URL https://mhccorp.com/robots.txt
Redirect https://www.mhccorp.com/robots.txt
Redirect Domain www.mhccorp.com
Redirect Base mhccorp.com
Domain IPs 109.169.83.162
Redirect IPs 109.169.83.162
Response IP 109.169.83.162
Found Yes
Hash 2256fa3f5f849f910844dea7c993b42f3ac5df7ce0b2368bf85689af47a15401
SimHash 4a0ba8c64797

Groups

*

Rule Path
Disallow /

googlebot
slurp
yandex
bingbot
deepcrawl

Rule Path
Allow /
Disallow /*%26tag%3D
Disallow /*?tag=
Disallow /*%26filter_name%3D
Disallow /*?filter_name=
Disallow /*?route=checkout%2F
Disallow /*%26route%3Dcheckout/
Disallow /*?route=account%2F
Disallow /*%26route%3Daccount/
Disallow /*?route=product%2Fsearch
Disallow /*%26route%3Dproduct/search
Disallow /*?route=product%2Fproduct%2Freview
Disallow /*%26route%3Dproduct/product/review
Disallow /*?route=information%2Finformation%2Fagree
Disallow /*%26route%3Dinformation/information/agree
Disallow /*?page=1
Disallow /*%26page%3D1
Disallow /*?route=affiliate%2F
Disallow /*%26route%3Daffiliate/
Disallow /*?keyword=
Disallow /*%26keyword%3D
Disallow /*?order=
Disallow /*%26order%3D
Disallow /*?sort=
Disallow /*%26sort%3D
Disallow /admin/
Disallow /system/
Disallow /catalog/

Other Records

Field Value
sitemap https://www.mhccorp.com/sitemap.xml

Comments

  • User-agent: deepcrawl
  • Disallow: /