introcs.cs.princeton.edu
robots.txt

Robots Exclusion Standard data for introcs.cs.princeton.edu

Resource Scan

Scan Details

Site Domain introcs.cs.princeton.edu
Base Domain princeton.edu
Scan Status Ok
Last Scan2025-09-30T08:35:09+00:00
Next Scan 2025-10-30T08:35:09+00:00

Last Scan

Scanned2025-09-30T08:35:09+00:00
URL https://introcs.cs.princeton.edu/robots.txt
Domain IPs 128.112.136.67
Response IP 128.112.136.67
Found Yes
Hash b6bb4db406f2dd47304173e49277d90bd52c8b53ea136ae31caa6c4ea3ae4f5e
SimHash 10f2c0058dd1

Groups

*

Rule Path
Disallow /data/
Disallow /45graph/cast.all.txt
Disallow /45graph/cast.rated.txt
Disallow /45graph/cast.00-06.txt
Disallow /45graph/cast.06.txt
Disallow /45graph/cast.action.txt
Disallow /45graph/cast.PG.txt
Disallow /45graph/cast.PG13.txt
Disallow /45graph/movies.txt
Disallow /java/data/
Disallow /java/data/mktsymbols.txt
Disallow /java/45graph/cast.all.txt
Disallow /java/45graph/cast.rated.txt
Disallow /java/45graph/cast.00-06.txt
Disallow /java/45graph/cast.06.txt
Disallow /java/45graph/cast.action.txt
Disallow /java/45graph/cast.PG.txt
Disallow /java/45graph/cast.PG13.txt
Disallow /java/45graph/movies.txt

Comments

  • Disallow: