hubicl.org
robots.txt

Robots Exclusion Standard data for hubicl.org

Resource Scan

Scan Details

Site Domain hubicl.org
Base Domain hubicl.org
Scan Status Ok
Last Scan2026-01-22T09:10:14+00:00
Next Scan 2026-02-21T09:10:14+00:00

Last Scan

Scanned2026-01-22T09:10:14+00:00
URL https://hubicl.org/robots.txt
Domain IPs 2a02:4780:84:4c94:f7f6:6a38:af1b:156d, 2a02:4780:84:d83f:3514:b98e:ae39:593a, 84.32.84.201, 84.32.84.238
Response IP 179.61.189.160
Found Yes
Hash b8e1af00b321e9041fb61c75c0fa7a4e5a8dbedb2eb4c8d076f0dfc2232d2bed
SimHash e93c99502281

Groups

*

Rule Path
Disallow /administrator/
Disallow /api/
Disallow /cache/
Disallow /cli/
Disallow /components/
Disallow /images/
Disallow /includes/
Disallow /infrastructure/rappture
Disallow /installation/
Disallow /language/
Disallow /libraries/
Disallow /login*
Disallow /logs/
Disallow /media/
Disallow /modules/
Disallow /opt/trac/tools/
Disallow /plugins/
Disallow /publications/browse
Disallow /search/
Disallow /Shibboleth.sso/
Disallow /templates/
Disallow /tmp/
Disallow /tools/

Other Records

Field Value
crawl-delay 1

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

news-please

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

peer39_crawler

Rule Path
Disallow /

peer39_crawler/1.0

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /