worldbank.org.cn
robots.txt

Robots Exclusion Standard data for worldbank.org.cn

Resource Scan

Scan Details

Site Domain worldbank.org.cn
Base Domain worldbank.org.cn
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-07-19T23:09:27+00:00
Next Scan 2024-10-17T23:09:27+00:00

Last Successful Scan

Scanned2023-09-01T22:46:26+00:00
URL http://www.worldbank.org.cn/robots.txt
Redirect https://www.shihang.org/robots.txt
Redirect Domain www.shihang.org
Redirect Base shihang.org
Domain IPs 192.86.98.187
Redirect IPs 18.155.68.11, 18.155.68.113, 18.155.68.129, 18.155.68.38, 2600:9000:23d2:1200:16:af4e:ae40:93a1, 2600:9000:23d2:1800:16:af4e:ae40:93a1, 2600:9000:23d2:5600:16:af4e:ae40:93a1, 2600:9000:23d2:5800:16:af4e:ae40:93a1, 2600:9000:23d2:8e00:16:af4e:ae40:93a1, 2600:9000:23d2:cc00:16:af4e:ae40:93a1, 2600:9000:23d2:d600:16:af4e:ae40:93a1, 2600:9000:23d2:e000:16:af4e:ae40:93a1
Response IP 18.161.97.90
Found Yes
Hash 4f32d5a569c8a156735eb443282ae7c6e589abc9e85b6286a29334628fbfe714
SimHash 45148182d711

Groups

*

Rule Path
Allow *
Disallow */apps/*
Disallow */wbg/services/*
Disallow */en/webarchives/archive*
Disallow */misc*
Disallow */conf/*
Disallow */downloadstats/*
Disallow */wbg/aem/*
Disallow */etc/dam*