slu.edu
robots.txt

Robots Exclusion Standard data for slu.edu

Resource Scan

Scan Details

Site Domain slu.edu
Base Domain slu.edu
Scan Status Ok
Last Scan2024-09-19T12:19:29+00:00
Next Scan 2024-10-19T12:19:29+00:00

Last Scan

Scanned2024-09-19T12:19:29+00:00
URL https://slu.edu/robots.txt
Redirect https://www.slu.edu/robots.txt
Redirect Domain www.slu.edu
Redirect Base slu.edu
Domain IPs 173.213.236.59
Redirect IPs 173.213.236.59
Response IP 173.213.236.59
Found Yes
Hash ec3065b18c72aed04dc170d8d04b9c84af5309c89c4fcc2b6ab7cf9b59e2c827
SimHash 65185a42c6b0

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /archive/
Disallow /_dev/
Disallow /data/
Disallow /services/
Disallow /peoplefinder/
Disallow /_resources
Disallow /_resources/widgets
Disallow /_resources/xsl
Disallow /madrid/academics/courses/

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 9

mj12bot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.slu.edu/sitemap.xml

Comments

  • Blocks robots from specific folders / directories