my1.mheducation.com
robots.txt

Robots Exclusion Standard data for my1.mheducation.com

Resource Scan

Scan Details

Site Domain my1.mheducation.com
Base Domain mheducation.com
Scan Status Ok
Last Scan2025-06-07T22:57:19+00:00
Next Scan 2025-06-14T22:57:19+00:00

Last Scan

Scanned2025-06-07T22:57:19+00:00
URL https://my1.mheducation.com/robots.txt
Redirect https://www.mheducation.com:443/robots.txt
Redirect Domain www.mheducation.com
Redirect Base mheducation.com
Domain IPs 35.169.182.133, 52.22.142.137, 54.236.147.247
Redirect IPs 99.86.4.101, 99.86.4.30, 99.86.4.33, 99.86.4.43
Response IP 18.165.72.119
Found Yes
Hash 78e2eef37b06afadad4d4178b9f5700d371351b5b0c504e8caef0ae6e20a0c0f
SimHash 225cde0041a2

Groups

*

Rule Path
Disallow /sharpen/*?utm_source
Disallow /highered/custom/product/*

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.mheducation.com/index-sitemap.xml

Comments

  • Disallowing Google Bard and Vertex AI web crawlers

Warnings

  • `host` is not a known field.