cordaid.org
robots.txt

Robots Exclusion Standard data for cordaid.org

Resource Scan

Scan Details

Site Domain cordaid.org
Base Domain cordaid.org
Scan Status Ok
Last Scan2026-01-23T02:59:19+00:00
Next Scan 2026-02-22T02:59:19+00:00

Last Scan

Scanned2026-01-23T02:59:19+00:00
URL https://cordaid.org/robots.txt
Redirect https://www.cordaid.org/robots.txt
Redirect Domain www.cordaid.org
Redirect Base cordaid.org
Domain IPs 104.26.0.193, 104.26.1.193, 172.67.73.42, 2606:4700:20::681a:1c1, 2606:4700:20::681a:c1, 2606:4700:20::ac43:492a
Redirect IPs 104.26.0.193, 104.26.1.193, 172.67.73.42, 2606:4700:20::681a:1c1, 2606:4700:20::681a:c1, 2606:4700:20::ac43:492a
Response IP 104.26.1.193
Found Yes
Hash 72cdf20f21d03713d10d2d5c8f64aff896789aab4c641a7c89816d59072011ec
SimHash 25311647e111

Groups

*

Rule Path
Disallow /cdn-cgi/
Disallow /nl/wp-admin/
Disallow /en/wp-admin/
Allow /nl/wp-admin/admin-ajax.php
Allow /en/wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://www.cordaid.org/nl/sitemap_index.xml
sitemap https://www.cordaid.org/en/sitemap_index.xml

Comments

  • Cordaid Robots.txt
  • This file should be symlinked from the webroot.
  • All user agents
  • Sitemaps