mheducation.com
robots.txt

Robots Exclusion Standard data for mheducation.com

Resource Scan

Scan Details

Site Domain mheducation.com
Base Domain mheducation.com
Scan Status Ok
Last Scan2024-10-31T08:59:25+00:00
Next Scan 2024-11-07T08:59:25+00:00

Last Scan

Scanned2024-10-31T08:59:25+00:00
URL https://mheducation.com/robots.txt
Redirect https://www.mheducation.com:443/robots.txt
Redirect Domain www.mheducation.com
Redirect Base mheducation.com
Domain IPs 3.86.21.60, 34.196.87.28, 54.243.70.188
Redirect IPs 18.161.97.101, 18.161.97.107, 18.161.97.31, 18.161.97.95
Response IP 13.226.2.94
Found Yes
Hash 52162c97121dfac928e06cbf52613eedd2b9777a5f11e9c047a7db128fee27fd
SimHash 225cde0041a2

Groups

*

Rule Path
Disallow /sharpen/*?utm_source
Disallow /highered/custom/product/*

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.mheducation.com/index-sitemap.xml

Comments

  • Disallowing Google Bard and Vertex AI web crawlers

Warnings

  • `host` is not a known field.