unt.edu
robots.txt

Robots Exclusion Standard data for unt.edu

Resource Scan

Scan Details

Site Domain unt.edu
Base Domain unt.edu
Scan Status Ok
Last Scan2025-12-31T23:38:50+00:00
Next Scan 2026-01-30T23:38:50+00:00

Last Scan

Scanned2025-12-31T23:38:50+00:00
URL https://unt.edu/robots.txt
Redirect https://www.unt.edu/robots.txt
Redirect Domain www.unt.edu
Redirect Base unt.edu
Domain IPs 172.202.165.49
Redirect IPs 172.202.165.49
Response IP 172.202.165.49
Found Yes
Hash 9c71fbee401afd7ac2c98473594126085c0912fe209aa068f503df0198fd8e25
SimHash fc214176c5d5

Groups

*

Rule Path
Disallow /archive/
Disallow /_archive/
Disallow /_dev/
Disallow /_migration_www/
Disallow /_resources/
Disallow /_showcase/
Disallow /ou-alerts/
Disallow /ubsc_testing_site/
Disallow /admissions/archive/
Disallow /admissions/dev/
Disallow /admissions/dev-1/
Disallow /admissions/contact-us/archive/
Disallow /academics/_archive/
Disallow /success/_archived/
Disallow /news/
Disallow /_enrollment-dev/
Disallow /strategic-plan/_dev/

Other Records

Field Value
sitemap https://www.unt.edu/sitemap.xml