dordt.edu
robots.txt

Robots Exclusion Standard data for dordt.edu

Resource Scan

Scan Details

Site Domain dordt.edu
Base Domain dordt.edu
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-11-15T16:36:52+00:00
Next Scan 2024-11-22T16:36:52+00:00

Last Successful Scan

Scanned2024-11-07T16:36:14+00:00
URL https://dordt.edu/robots.txt
Redirect https://www.dordt.edu/robots.txt
Redirect Domain www.dordt.edu
Redirect Base dordt.edu
Domain IPs 34.223.217.41
Redirect IPs 13.33.88.109, 13.33.88.2, 13.33.88.55, 13.33.88.80, 2600:9000:223b:200:13:b13a:ba00:93a1, 2600:9000:223b:5800:13:b13a:ba00:93a1, 2600:9000:223b:6200:13:b13a:ba00:93a1, 2600:9000:223b:7400:13:b13a:ba00:93a1, 2600:9000:223b:a200:13:b13a:ba00:93a1, 2600:9000:223b:b200:13:b13a:ba00:93a1, 2600:9000:223b:c200:13:b13a:ba00:93a1, 2600:9000:223b:f400:13:b13a:ba00:93a1
Response IP 13.33.88.109
Found Yes
Hash 348ca4cc6822bfc1c312847bbadf001b83c75b4c0a7ae24a69fe2b10148f1236
SimHash 41581b522f93

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env

Other Records

Field Value
sitemap https://www.dordt.edu/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://www.dordt.edu/
  • live - don't allow web crawlers to index cpresources/ or vendor/