cmh.edu
robots.txt

Robots Exclusion Standard data for cmh.edu

Resource Scan

Scan Details

Site Domain cmh.edu
Base Domain cmh.edu
Scan Status Ok
Last Scan2024-09-30T15:07:46+00:00
Next Scan 2024-10-30T15:07:46+00:00

Last Scan

Scanned2024-09-30T15:07:46+00:00
URL https://cmh.edu/robots.txt
Redirect https://www.cmh.edu/robots.txt
Redirect Domain www.cmh.edu
Redirect Base cmh.edu
Domain IPs 217.114.94.2
Redirect IPs 104.18.38.113, 172.64.149.143, 2606:4700:4400::6812:2671, 2606:4700:4400::ac40:958f
Response IP 172.64.149.143
Found Yes
Hash 323cc14ff82871497c071c36c947f0f1122fdecaac748d30a72279a8ddef9fdf
SimHash bc5dd41de113

Groups

*

Rule Path
Disallow /api/
Disallow /api$
Disallow /episerver/
Disallow /episerver$
Disallow /EPiServer/
Disallow /EPiServer$
Disallow /*.axd

sogou spider

Rule Path
Disallow /

scoutjet

Rule Path
Disallow /

jakarta commons-httpclient/3.0.1

Rule Path
Disallow /

mozilla/3.0 (compatible; talwininethttpclient)

Rule Path
Disallow /

anemone/0.7.2

Rule Path
Disallow /

typhoeus

Rule Path
Disallow /

http://www.profound.net/domainappender

Rule Path
Disallow /

mozilla/5.0 [en] (x11, u; openvas 7.0.5)

Rule Path
Disallow /

php-5.2-zs

Rule Path
Disallow /