hacc.edu
robots.txt

Robots Exclusion Standard data for hacc.edu

Resource Scan

Scan Details

Site Domain hacc.edu
Base Domain hacc.edu
Scan Status Ok
Last Scan2026-03-07T13:33:31+00:00
Next Scan 2026-04-06T13:33:31+00:00

Last Scan

Scanned2026-03-07T13:33:31+00:00
URL https://hacc.edu/robots.txt
Redirect https://www.hacc.edu/robots.txt
Redirect Domain www.hacc.edu
Redirect Base hacc.edu
Domain IPs 54.152.27.216
Redirect IPs 34.192.183.230, 34.206.172.184, 54.85.181.126
Response IP 34.206.172.184
Found Yes
Hash e170e2eb1b9ee216e489d865e5b738e77205567d4566e31d60467bc33892d54c
SimHash 6701d470e7db

Groups

*

Rule Path
Disallow /commonspot

ut-dorkbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

amazonbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://www.hacc.edu/sitemap.xml

Comments

  • Sitemap: https://hacc.edu/sitemap.xml
  • User-agent: AhrefsBot
  • Disallow: /Calendar
  • User-agent: Arachni
  • Disallow: /