algs4.cs.princeton.edu
robots.txt

Robots Exclusion Standard data for algs4.cs.princeton.edu

Resource Scan

Scan Details

Site Domain algs4.cs.princeton.edu
Base Domain princeton.edu
Scan Status Ok
Last Scan2025-09-30T02:23:12+00:00
Next Scan 2025-10-30T02:23:12+00:00

Last Scan

Scanned2025-09-30T02:23:12+00:00
URL https://algs4.cs.princeton.edu/robots.txt
Domain IPs 128.112.136.67
Response IP 128.112.136.67
Found Yes
Hash 69adc3bb4017c470128d9d63bcfd550fcfdffffc01cc92fdd8ec0a5c715913dc
SimHash 3b4d490f6b52

Groups

*

Rule Path
Disallow /references/papers/
Disallow /31elementary/leipzig1M.txt
Disallow /31elementary/leipzig100K.txt
Disallow /31elementary/leipzig300K.txt
Disallow /35applications/movies.txt
Disallow /41undirected/movies.txt
Disallow /14analysis/mktsymbols.txt