utsa.edu
robots.txt

Robots Exclusion Standard data for utsa.edu

Resource Scan

Scan Details

Site Domain utsa.edu
Base Domain utsa.edu
Scan Status Ok
Last Scan2024-05-19T12:54:01+00:00
Next Scan 2024-06-18T12:54:01+00:00

Last Scan

Scanned2024-05-19T12:54:01+00:00
URL https://utsa.edu/robots.txt
Redirect https://www.utsa.edu/robots.txt
Redirect Domain www.utsa.edu
Redirect Base utsa.edu
Domain IPs 129.115.120.39
Redirect IPs 129.115.120.39
Response IP 129.115.120.39
Found Yes
Hash c1975967fc1fc6a718757a6319999ed492ad5059322a83d94c179273485dbb84
SimHash d90e3aeb73db

Groups

webcrawler

Rule Path
Disallow

*

Rule Path
Disallow /advising/internal/
Disallow /cgi
Disallow /cgi-bin
Disallow /Cfdocs
Disallow /Cfide
Disallow /ice
Disallow /internet/iom
Disallow /orientation
Disallow /lead/adm/
Disallow /logs
Disallow /today/admin
Disallow /today/new
Disallow /oracledoc
Disallow /scsstaff
Disallow /tlc/
Disallow /cftemp
Disallow /java
Disallow /_derived
Disallow /_fpclass
Disallow /_private
Disallow /_themes
Disallow /_vti_cnf
Disallow /_vti_log
Disallow /_vti_pvt
Disallow /_vti_script
Disallow /_vti_txt
Disallow /_vit_inf
Disallow /ofponline/ol/ct-training.html
Disallow /concern
Disallow /financialaffairs/ds
Disallow /sombrilla/spring2020/index.html
Disallow /index-reference.html
Disallow /index-emergency.html
Disallow /index-footer.html
Disallow /index-review.html
Disallow /roadmap/do-your-part/toolkit-with-email-signature.html
Disallow /president/confidential/
Disallow /today/preview/*
Disallow /marcomstudio-dev/*
Disallow /safecampus-dev/*
Disallow /preview/*
Disallow /giving1/*

Comments

  • robots.txt file for http://www.utsa.edu
  • This character signifies a comment tag
  • Dev sites