csiresources.org
robots.txt

Robots Exclusion Standard data for csiresources.org

Resource Scan

Scan Details

Site Domain csiresources.org
Base Domain csiresources.org
Scan Status Ok
Last Scan2025-08-05T17:51:51+00:00
Next Scan 2025-09-04T17:51:51+00:00

Last Scan

Scanned2025-08-05T17:51:51+00:00
URL https://www.csiresources.org/robots.txt
Domain IPs 184.72.112.29, 184.72.125.39
Response IP 184.72.125.39
Found Yes
Hash c7deb7a74a095b11ca7e81aff84a2948442976849dd3ce08398467520f95ae4f
SimHash 58055e21dfb7

Groups

*

Rule Path
Allow /HigherLogic/System/DownloadDocument.ashx
Allow /HigherLogic/System/DownloadDocumentFile.ashx
Allow /higherlogic/system/downloaddocument.ashx
Allow /higherlogic/system/downloaddocumentfile.ashx
Disallow /ScriptResource.axd
Disallow /WebResource.axd
Disallow /*.axd$
Disallow /*/SearchLibrary
Disallow /*/searchlibrary
Disallow /*/PrintMessage
Disallow /*/printmessage
Disallow /HigherLogic/
Disallow /higherlogic/
Disallow /*/RSS
Disallow /*/rss
Disallow /*postreply*
Disallow /*postamessage*
Disallow /*postmessage*
Disallow /*forwardmessages*
Disallow /apps/group_public/calendar.php
Disallow /csidevelopmentsandbox/
Disallow /*forwardmessages*
Disallow /newchaptermicrositemodel/
Disallow /2019chaptermicrositemodel/
Disallow /cleanup-chaptermicrosite/
Disallow /csi-dev/
Disallow /maincopyjuly2020/
Disallow /masterspecifierretreat2018/
Disallow /masterspecifiersretreatjune2019/

*

Rule Path
Disallow /themelogic-dev/

*

Rule Path
Disallow /crosswalk2/

*

Rule Path
Disallow /crosswalk3/

Warnings

  • 1 invalid line.