slu.edu
robots.txt

Robots Exclusion Standard data for slu.edu

Resource Scan

Scan Details

Site Domain slu.edu
Base Domain slu.edu
Scan Status Ok
Last Scan2024-10-19T12:19:41+00:00
Next Scan 2024-11-18T12:19:41+00:00

Last Scan

Scanned2024-10-19T12:19:41+00:00
URL https://slu.edu/robots.txt
Redirect https://www.slu.edu/robots.txt
Redirect Domain www.slu.edu
Redirect Base slu.edu
Domain IPs 173.213.236.59
Redirect IPs 173.213.236.59
Response IP 173.213.236.59
Found Yes
Hash 251a7d57f78cbb04bbc14d00674c697900a6bf47f25b0a94df9d60a288b1dac6
SimHash 65185a42c6b8

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /archive/
Disallow /_dev/
Disallow /data/
Disallow /services/
Disallow /peoplefinder/
Disallow /_resources
Disallow /_resources/widgets
Disallow /_resources/xsl
Disallow /madrid/academics/courses/

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 9

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 9

Other Records

Field Value
sitemap https://www.slu.edu/sitemap.xml

Comments

  • Blocks robots from specific folders / directories