mwcc.edu
robots.txt

Robots Exclusion Standard data for mwcc.edu

Resource Scan

Scan Details

Site Domain mwcc.edu
Base Domain mwcc.edu
Scan Status Ok
Last Scan2024-09-25T12:46:17+00:00
Next Scan 2024-10-25T12:46:17+00:00

Last Scan

Scanned2024-09-25T12:46:17+00:00
URL https://mwcc.edu/robots.txt
Domain IPs 162.159.135.42
Response IP 162.159.135.42
Found Yes
Hash b1dbb83d356ec6638dde594e1c53013d04fe88d4defd2f79abf801f8cdd8d2b0
SimHash 6108d80049b1

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /thanks-for-your-request/
Allow /wp-admin/admin-ajax.php
Disallow /*.pdf$

Other Records

Field Value
crawl-delay 30

Other Records

Field Value
sitemap https://mwcc.edu/sitemap_index.xml