worldbank.org
robots.txt

Robots Exclusion Standard data for worldbank.org

Resource Scan

Scan Details

Site Domain worldbank.org
Base Domain worldbank.org
Scan Status Ok
Last Scan2024-10-20T03:38:58+00:00
Next Scan 2024-11-19T03:38:58+00:00

Last Scan

Scanned2024-10-20T03:38:58+00:00
URL https://worldbank.org/robots.txt
Redirect https://www.worldbank.org/robots.txt
Redirect Domain www.worldbank.org
Redirect Base worldbank.org
Domain IPs 35.71.168.154, 52.223.24.49
Redirect IPs 104.18.40.198, 172.64.147.58, 2606:4700:4400::6812:28c6, 2606:4700:4400::ac40:933a
Response IP 104.18.40.198
Found Yes
Hash 49edfa04cad05d2abe98e3ce315ff7d9168d5604b0532fc9bcbd15c3ef4a0855
SimHash 051dc012df93

Groups

*

Rule Path
Allow *
Disallow */apps/*
Disallow */wbg/services/*
Disallow */en/webarchives/archive*
Disallow */misc*
Disallow */conf/*
Disallow */downloadstats/*
Disallow */wbg/aem/*
Disallow */en/search*
Disallow */content/search*

Other Records

Field Value
sitemap https://www.worldbank.org/sitemap.xml