childrensmercy.org
robots.txt

Robots Exclusion Standard data for childrensmercy.org

Resource Scan

Scan Details

Site Domain childrensmercy.org
Base Domain childrensmercy.org
Scan Status Ok
Last Scan2024-09-03T09:36:16+00:00
Next Scan 2024-10-03T09:36:16+00:00

Last Scan

Scanned2024-09-03T09:36:16+00:00
URL https://childrensmercy.org/robots.txt
Redirect https://www.childrensmercy.org/robots.txt
Redirect Domain www.childrensmercy.org
Redirect Base childrensmercy.org
Domain IPs 217.114.94.2
Redirect IPs 104.18.38.113, 172.64.149.143, 2606:4700:4400::6812:2671, 2606:4700:4400::ac40:958f
Response IP 104.18.38.113
Found Yes
Hash 323cc14ff82871497c071c36c947f0f1122fdecaac748d30a72279a8ddef9fdf
SimHash bc5dd41de113

Groups

*

Rule Path
Disallow /api/
Disallow /api$
Disallow /episerver/
Disallow /episerver$
Disallow /EPiServer/
Disallow /EPiServer$
Disallow /*.axd

sogou spider

Rule Path
Disallow /

scoutjet

Rule Path
Disallow /

jakarta commons-httpclient/3.0.1

Rule Path
Disallow /

mozilla/3.0 (compatible; talwininethttpclient)

Rule Path
Disallow /

anemone/0.7.2

Rule Path
Disallow /

typhoeus

Rule Path
Disallow /

http://www.profound.net/domainappender

Rule Path
Disallow /

mozilla/5.0 [en] (x11, u; openvas 7.0.5)

Rule Path
Disallow /

php-5.2-zs

Rule Path
Disallow /