siia.org
robots.txt

Robots Exclusion Standard data for siia.org

Resource Scan

Scan Details

Site Domain siia.org
Base Domain siia.org
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2026-01-21T14:05:10+00:00
Next Scan 2026-02-20T14:05:10+00:00

Last Successful Scan

Scanned2025-11-30T13:58:12+00:00
URL https://siia.org/robots.txt
Domain IPs 203.23.244.83
Response IP 203.23.244.83
Found Yes
Hash ee05c404210a730ceea5d1e84dcb48b3c76208dd4bf7c902830e4ea68cfec318
SimHash dafcd08adb1c

Groups

teleport

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

mercator-2.0

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

polybot

Rule Path
Disallow /

pjspider

Rule Path
Disallow /

wfarc

Rule Path
Disallow /

crescent internet toolpak http ole control v.1.0

Rule Path
Disallow /

guidebot/5.3

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /i4a/memberDirectory/

proximic

Rule Path
Disallow /i4a/ams/staff/

*

Rule Path
Disallow /custom/

piplbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

orbbot

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-bm

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /i4a/click

*

Rule Path
Disallow /i4a/manage-preferences

*

Rule Path
Disallow /_ai/

*

Rule Path
Disallow /i4a/utilities/

Comments

  • go away