mheducation.com
robots.txt

Robots Exclusion Standard data for mheducation.com

Resource Scan

Scan Details

Site Domain mheducation.com
Base Domain mheducation.com
Scan Status Ok
Last Scan2024-06-26T23:27:29+00:00
Next Scan 2024-07-03T23:27:29+00:00

Last Scan

Scanned2024-06-26T23:27:29+00:00
URL https://mheducation.com/robots.txt
Redirect https://www.mheducation.com:443/robots.txt
Redirect Domain www.mheducation.com
Redirect Base mheducation.com
Domain IPs 35.170.224.167, 35.173.176.214, 54.211.240.81
Redirect IPs 18.161.6.11, 18.161.6.13, 18.161.6.31, 18.161.6.62
Response IP 18.165.171.36
Found Yes
Hash 52162c97121dfac928e06cbf52613eedd2b9777a5f11e9c047a7db128fee27fd
SimHash 225cde0041a2

Groups

*

Rule Path
Disallow /sharpen/*?utm_source
Disallow /highered/custom/product/*

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.mheducation.com/index-sitemap.xml

Comments

  • Disallowing Google Bard and Vertex AI web crawlers

Warnings

  • `host` is not a known field.