plato.stanford.edu
robots.txt

Robots Exclusion Standard data for plato.stanford.edu

Resource Scan

Scan Details

Site Domain plato.stanford.edu
Base Domain stanford.edu
Scan Status Ok
Last Scan2024-09-18T09:22:34+00:00
Next Scan 2024-10-18T09:22:34+00:00

Last Scan

Scanned2024-09-18T09:22:34+00:00
URL https://plato.stanford.edu/robots.txt
Domain IPs 171.67.193.20
Response IP 171.67.193.20
Found Yes
Hash ea8bcf62001a577d8f7190943649040ee095817147b042f2ebe54b0328825f70
SimHash 558c18f3c1cb

Groups

*

Rule Path
Disallow /MathJax/
Disallow /mathjax/
Disallow /archives/
Disallow /Archives/
Disallow /cgi-bin/
Disallow /ck-editor/
Disallow /css/
Disallow /CSS/
Disallow /diffs/
Disallow /DIFFS/
Disallow /font/
Disallow /FONT/
Disallow /inc/
Disallow /INC/
Disallow /js/
Disallow /JS/
Disallow /referer/
Disallow /rss/
Disallow /subject-editors/
Disallow /tmp/
Disallow /symbols/
Disallow /Symbols/
Disallow /usage/
Disallow /Usage/
Disallow /wwwstat/
Disallow /Wwwstat/
Disallow /search/
Disallow /perl/

ia_archiver

Rule Path
Disallow /

w3c-checklink

Rule Path
Disallow

pinterestbot

Rule Path
Allow /entries/

Warnings

  • 1 invalid line.