ifglobal.org
robots.txt

Robots Exclusion Standard data for ifglobal.org

Resource Scan

Scan Details

Site Domain ifglobal.org
Base Domain ifglobal.org
Scan Status Ok
Last Scan2025-10-13T03:18:12+00:00
Next Scan 2025-11-12T03:18:12+00:00

Last Scan

Scanned2025-10-13T03:18:12+00:00
URL https://ifglobal.org/robots.txt
Domain IPs 104.21.75.189, 172.67.180.223, 2606:4700:3031::ac43:b4df, 2606:4700:3037::6815:4bbd
Response IP 172.67.180.223
Found Yes
Hash f8bd0fc19886669ad5a40d4d8f2c842c8c93208f4387954ad4bd23e598065775
SimHash 03100f765711

Groups

scrapy

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.ifglobal.org/sitemap.xml
sitemap https://www.ifglobal.org/sitemap-pt-page-2018-09.xml
sitemap https://www.ifglobal.org/sitemap-pt-page-2018-08.xml
sitemap https://www.ifglobal.org/sitemap-pt-page-2018-07.xml
sitemap https://www.ifglobal.org/sitemap-pt-page-2018-01.xml
sitemap https://www.ifglobal.org/sitemap-pt-page-2017-08.xml
sitemap https://www.ifglobal.org/sitemap-pt-page-2017-06.xml