aclweb.org
robots.txt

Robots Exclusion Standard data for aclweb.org

Resource Scan

Scan Details

Site Domain aclweb.org
Base Domain aclweb.org
Scan Status Ok
Last Scan2026-03-26T19:16:14+00:00
Next Scan 2026-04-25T19:16:14+00:00

Last Scan

Scanned2026-03-26T19:16:14+00:00
URL https://aclweb.org/robots.txt
Redirect https://www.aclweb.org/robots.txt
Redirect Domain www.aclweb.org
Redirect Base aclweb.org
Domain IPs 50.87.169.12
Redirect IPs 13.35.202.100, 13.35.202.3, 13.35.202.44, 13.35.202.51, 2600:9000:2078:1200:9:7594:79c0:93a1, 2600:9000:2078:2400:9:7594:79c0:93a1, 2600:9000:2078:3600:9:7594:79c0:93a1, 2600:9000:2078:5200:9:7594:79c0:93a1, 2600:9000:2078:5800:9:7594:79c0:93a1, 2600:9000:2078:c400:9:7594:79c0:93a1, 2600:9000:2078:dc00:9:7594:79c0:93a1, 2600:9000:2078:f800:9:7594:79c0:93a1
Response IP 13.35.202.3
Found Yes
Hash b5f12680ae034dcee21c379fa02d7644b0ef38f3fe7fd3cccba8bc7e3cd90853
SimHash 5158dcd0f6d1

Groups

*

Rule Path
Disallow /aclwiki/index.php?
Disallow /nmlwiki/index.php?
Disallow /execwiki/index.php?
Disallow /adminwiki/index.php?
Disallow /portal/user/
Disallow /portal/admin/
Disallow /portal/node/add/
Disallow /portal/search/

Other Records

Field Value
crawl-delay 10

bytespider

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

gptbot

Rule Path
Disallow /portal/user/
Disallow /portal/admin/

Other Records

Field Value
crawl-delay 6

chatgpt-user

Rule Path
Disallow /portal/user/
Disallow /portal/admin/

Other Records

Field Value
crawl-delay 6

ccbot

Rule Path
Disallow /portal/user/
Disallow /portal/admin/

Other Records

Field Value
crawl-delay 6

anthropic-ai

Rule Path
Disallow /portal/user/
Disallow /portal/admin/

Other Records

Field Value
crawl-delay 6

claudebot

Rule Path
Disallow /portal/user/
Disallow /portal/admin/

Other Records

Field Value
crawl-delay 6

claude-web

Rule Path
Disallow /portal/user/
Disallow /portal/admin/

Other Records

Field Value
crawl-delay 6

cohere-ai

Rule Path
Disallow /portal/user/
Disallow /portal/admin/

Other Records

Field Value
crawl-delay 6

omgilibot

Rule Path
Disallow /portal/user/
Disallow /portal/admin/

Other Records

Field Value
crawl-delay 6

diffbot

Rule Path
Disallow /portal/user/
Disallow /portal/admin/

Other Records

Field Value
crawl-delay 6

perplexitybot

Rule Path
Disallow /portal/user/
Disallow /portal/admin/

Other Records

Field Value
crawl-delay 6

youbot

Rule Path
Disallow /portal/user/
Disallow /portal/admin/

Other Records

Field Value
crawl-delay 6

newsai

Rule Path
Disallow /portal/user/
Disallow /portal/admin/

Other Records

Field Value
crawl-delay 6

Comments

  • Bad bots - blocked completely (403 at nginx)
  • AI bots - heavily rate limited