communitywiki.org
robots.txt
Robots Exclusion Standard data for communitywiki.org
Resource Scan
Scan Details
Site Domain | communitywiki.org |
Base Domain | communitywiki.org |
Scan Status | Ok |
Last Scan | 2024-09-20T17:35:01+00:00 |
Next Scan | 2024-10-20T17:35:01+00:00 |
Last Scan
Scanned | 2024-09-20T17:35:01+00:00 |
URL | https://communitywiki.org/robots.txt |
Domain IPs | 178.209.50.237, 2a02:418:6a04:178:209:50:237:1 |
Response IP | 178.209.50.237 |
Found | Yes |
Hash | 2dd64e4e5fae2c30410d390bd41c21f8740ecb0a4c93fcad5fc64a4603b69519 |
SimHash | dbd6e09a52a2 |
Groups
x28-job-bot
velenpublicwebcrawler
omgilitbot
google-extended
chatgpt-user
gptbot
magpie-crawler
seokicks
dataforseobot
adsbot
ccbot
barkrowler
petalbot
buck
femtosearchbot
gigabot
blexbot
mj12bot
ahrefsbot
seznambot
daumoa
sputnikbot
wbsearchbot
yeti
sogou spider
naverbot
sistrix
spbot
wiederfreibot/1.0
dotbot
xovibot
linguee
seoscanners.net
bubing
serpstatbot
Rule | Path |
---|---|
Disallow | / |
*
Rule | Path |
---|---|
Disallow | /cw-de |
Disallow | /cw-fr |
Disallow | /cw-en |
Disallow | /cw-new |
Disallow | /cw? |
Disallow | /cw.pl |
Disallow | /mark? |
Disallow | /mark.pl |
Disallow | /test? |
Disallow | /test.pl |
Disallow | /wiki? |
Disallow | /wiki.pl |
Disallow | /zen? |
Disallow | /zen.pl |
Disallow | /cgit |
Other Records
Field | Value |
---|---|
crawl-delay | 20 |
Warnings
- 1 invalid line.