www.necc.mass.edu
robots.txt
Robots Exclusion Standard data for www.necc.mass.edu
Resource Scan
Scan Details
Site Domain | www.necc.mass.edu |
Base Domain | mass.edu |
Scan Status | Ok |
Last Scan | 2025-08-21T17:47:21+00:00 |
Next Scan | 2025-09-20T17:47:21+00:00 |
Last Scan
Scanned | 2025-08-21T17:47:21+00:00 |
URL | https://www.necc.mass.edu/robots.txt |
Domain IPs | 167.224.111.74 |
Response IP | 167.224.111.74 |
Found | Yes |
Hash | d42b2907743cb48628e6533e36dc613a8c1d06088149b63a1b17e4b5362ceda4 |
SimHash | cd45c63a5653 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Disallow | /wp-includes/ |
Disallow | /trackback/ |
Disallow | /cgi-bin/ |
Disallow | /early-childhood-director-certificate/ |
Disallow | /events/category/ |
Disallow | /events/tag/ |
Disallow | /campaigns |
Other Records
Field | Value |
---|---|
sitemap | https://www.necc.mass.edu/sitemap_index.xml |