icpsr.umich.edu
robots.txt

Robots Exclusion Standard data for icpsr.umich.edu

Resource Scan

Scan Details

Site Domain icpsr.umich.edu
Base Domain umich.edu
Scan Status Ok
Last Scan2025-07-03T22:28:06+00:00
Next Scan 2025-08-02T22:28:06+00:00

Last Scan

Scanned2025-07-03T22:28:06+00:00
URL https://icpsr.umich.edu/robots.txt
Redirect https://www.icpsr.umich.edu/robots.txt
Redirect Domain www.icpsr.umich.edu
Redirect Base umich.edu
Domain IPs 44.193.123.222
Redirect IPs 35.169.63.230
Response IP 35.169.63.230
Found Yes
Hash b51bebd169ab71c7574649f15773b03c5d2046a463c85ce5cd54c83f91df8c58
SimHash f8c9dd80135b

Groups

tapdance

Rule Path
Disallow /

*

Rule Path
Disallow /bibliofake/
Disallow /cgi-bin/
Disallow /cgi-bin/CITATIONS/
Disallow /cgi/CITATIONS/
Disallow /dannotest/boof/*
Disallow /DDI/threads/
Disallow /files/
Disallow /GEM/
Disallow /web/*/dara/
Disallow /web/*/studies/*/sdafm
Disallow /web/*/studies/*/solr
Disallow /web/*/studies/*/utilization
Disallow /web/*/studies/22940
Disallow /web/*/studies/34800
Disallow /web/*/studies/36861
Disallow /web/*/studies/3417
Disallow /web/*/*.mkup
Disallow /web/files/cfda/*
Disallow /web/instructors/series/*
Disallow /web/instructors/studies/*
Disallow /images/
Disallow /mkup/
Disallow /rcs/
Disallow /temp/
Disallow /robots.txt
Disallow /pages/*
Allow /web/*/search/series
Allow /files/static-aaap-sitemap.xml
Disallow /web/pages/ICPSR/styleguide/*
Disallow /web/pages/FAR/*
Disallow /web/pages/odf/*

Comments

  • This is loading from /icpsrweb-pages/robots/prod/icpsr/robots.txt

Warnings

  • 1 invalid line.