communitywiki.org
robots.txt

Robots Exclusion Standard data for communitywiki.org

Resource Scan

Scan Details

Site Domain communitywiki.org
Base Domain communitywiki.org
Scan Status Ok
Last Scan2024-09-20T17:35:01+00:00
Next Scan 2024-10-20T17:35:01+00:00

Last Scan

Scanned2024-09-20T17:35:01+00:00
URL https://communitywiki.org/robots.txt
Domain IPs 178.209.50.237, 2a02:418:6a04:178:209:50:237:1
Response IP 178.209.50.237
Found Yes
Hash 2dd64e4e5fae2c30410d390bd41c21f8740ecb0a4c93fcad5fc64a4603b69519
SimHash dbd6e09a52a2

Groups

x28-job-bot
velenpublicwebcrawler
omgilitbot
google-extended
chatgpt-user
gptbot
magpie-crawler
seokicks
dataforseobot
adsbot
ccbot
barkrowler
petalbot
buck
femtosearchbot
gigabot
blexbot
mj12bot
ahrefsbot
seznambot
daumoa
sputnikbot
wbsearchbot
yeti
sogou spider
naverbot
sistrix
spbot
wiederfreibot/1.0
dotbot
xovibot
linguee
seoscanners.net
bubing
serpstatbot

Rule Path
Disallow /

*

Rule Path
Disallow /cw-de
Disallow /cw-fr
Disallow /cw-en
Disallow /cw-new
Disallow /cw?
Disallow /cw.pl
Disallow /mark?
Disallow /mark.pl
Disallow /test?
Disallow /test.pl
Disallow /wiki?
Disallow /wiki.pl
Disallow /zen?
Disallow /zen.pl
Disallow /cgit

Other Records

Field Value
crawl-delay 20

Warnings

  • 1 invalid line.