proquest.com
robots.txt

Robots Exclusion Standard data for proquest.com

Resource Scan

Scan Details

Site Domain proquest.com
Base Domain proquest.com
Scan Status Ok
Last Scan2024-10-18T05:16:13+00:00
Next Scan 2024-11-17T05:16:13+00:00

Last Scan

Scanned2024-10-18T05:16:13+00:00
URL https://proquest.com/robots.txt
Redirect https://www.proquest.com/robots.txt
Redirect Domain www.proquest.com
Redirect Base proquest.com
Domain IPs 165.215.200.228
Redirect IPs 162.159.152.9, 162.159.153.8
Response IP 162.159.153.8
Found Yes
Hash 5c2766db6bfc7bd2b4a2b144833b5e45414ffb0a4510d7311563034d783f0d35
SimHash 5f40db0372ae

Groups

*

Rule Path
Disallow /url/
Disallow /AppleWebKit/
Disallow /a11y/
Disallow /input/
Disallow /output/
Disallow /indexol.sourcetypessearch.expandedbasicsearchbox.searchterm
Disallow /docview/*PQ/1
Disallow /*.captureproxyhost%3A
Disallow /widget/
Disallow /*.docviewusetools
Disallow /*.docviewanalytics
Disallow /*.pdfdocusetools
Disallow /*.documentimageusetools
Disallow /previewunavailable
Disallow /*.fulltext%3Ahidefulltext
Disallow /honeypot
Disallow /help/
Disallow /*.similardocuments
Disallow /*.pagelayout.popuplocaleswitcher
Disallow /*%3Ainterdocimagesevent
Disallow /*.progressivedisplay
Disallow /*.loginoverlay
Disallow /*%3Aallsaveoptions
Disallow */shibbolethlogin
Disallow */error/
Disallow */errorpage/
Disallow /*%3Ahidebannerevent
Disallow /%
Disallow /c/
Disallow /C/
Disallow */indexinglinkhandler
Disallow */indexingvolumeissuelinkhandler
Disallow /login
Disallow /*.quicksearchbox
Disallow /*.accesstofulltextlinks.*
Disallow /congressional
Disallow /go/
Disallow /about/
Disallow /pubidlinkhandler/
Disallow /products-services/
Allow /products-services/*/se-2
Disallow /openview/
Disallow /blog/
Allow /blog/*/se-2
Disallow /embed/
Allow /embed/*/se-2
Disallow /professional/
Disallow /en-US/
Disallow /products_pq/
Allow /products_pq/*/se-2
Disallow /pdpq/
Allow /pdpq/*/se-2
Disallow /documents/
Allow /documents/*/se-2
Disallow /company/
Allow /company/*/se-2
Disallow /libraries/
Allow /libraries/*/se-2
Disallow /*?accountid=*
Disallow /*%26accountid%3D*
Disallow /*?username=
Disallow /*.quicksearchbox%3A
Disallow /*.pagelayout.pendo
Disallow /APAC-JP/
Disallow /histvault?
Disallow /histvault/
Disallow /*%3Aexternallink
Disallow .html$
Disallow .html?
Disallow .shtml$
Disallow /ebdetailsview
Disallow /*.hitnavigationswitch%3A
Disallow /historyvault
Disallow /customer-care/tools-resources/
Disallow /runSearch
Disallow /*.pagelayout%3A
Disallow /en/
Disallow /Documents/
Disallow /congcomp/getdoc
Disallow /myresearch/
Disallow /docview.accesstofulltextlinks.externallink_0%3A
Disallow /docprintview/