archive.mpac.org
robots.txt

Robots Exclusion Standard data for archive.mpac.org

Resource Scan

Scan Details

Site Domain archive.mpac.org
Base Domain mpac.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2025-09-20T00:44:18+00:00
Next Scan 2025-12-19T00:44:18+00:00

Last Successful Scan

Scanned2024-08-26T20:53:53+00:00
URL https://archive.mpac.org/robots.txt
Domain IPs 104.21.72.41, 172.67.174.204, 2606:4700:3030::ac43:aecc, 2606:4700:3032::6815:4829
Response IP 104.21.72.41
Found Yes
Hash 97ed3b263e17ecb7926ab15399ccaa7ae91f3c5b6d7fd79af26a0857416b7c64
SimHash 31586880289b

Groups

*

Rule Path
Disallow /assets/cache/
Disallow /assets/docs/
Disallow /assets/export/
Disallow /assets/import/
Disallow /assets/modules/
Disallow /assets/plugins/
Disallow /assets/snippets/
Disallow /install/
Disallow /manager/

Other Records

Field Value
sitemap http://www.mpac.org/sitemap.xml

Comments

  • Default modx exclusions
  • For sitemaps.xml autodiscovery. Uncomment if you have one: