projects.gcbbs.net
robots.txt

Robots Exclusion Standard data for projects.gcbbs.net

Archived Snapshots

Resource Scan

Scan Details

Site Domain	projects.gcbbs.net
Base Domain	gcbbs.net
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2025-12-09T04:18:43+00:00
Next Scan	2026-02-07T04:18:43+00:00

Last Successful Scan

Scanned	2025-09-22T07:51:31+00:00
URL	https://projects.gcbbs.net/robots.txt
Domain IPs	142.93.1.73, 2604:a880:400:d0::24b0:1
Response IP	142.93.1.73
Found	Yes
Hash	c26c06ec69629c1be08f312d737d05b20b81c8610a75aafdd926bb5f297155bd
SimHash	245b8b01c2e0

Groups

adsbot-google
adsbot-google-mobile
adsbot-google-mobile-apps
adidxbot
algolia crawler
applebot
applenewsbot
baiduspider
baiduspider-image
baiduspider-news
baiduspider-video
bingbot
bingpreview
bublupbot
ccbot
cliqzbot
coccoc
coccocbot-image
coccocbot-web
daumoa
dazoobot
deusu
duckduckbot
duckduckgo-favicons-bot
euripbot
exploratodo
facebookcatalog
facebookexternalhit
facebot
feedly
findxbot
gooblog
googlebot
googlebot-image
googlebot-mobile
googlebot-news
googlebot-video
haosouspider
ichiro
istellabot
jikespider
lycos
mail.ru
mediapartners-google
microsoftpreview
mojeekbot
msnbot
msnbot-media
orangebot
pinterest
plukkie
qwantify
rambler
semanticscholarbot
seznambot
sosospider
slurp
sogou blog
sogou inst spider
sogou news spider
sogou orion spider
sogou spider2
sogou web spider
twitterbot
whatsapp
yacybot
yandex
yandexmobilebot
yepbot
yeti
yioopbot
yoozbot
youdaobot
*
addsearchbot
ai2bot
ai2bot-dolma
aihitbot
amazonbot
andibot
anthropic-ai
applebot-extended
awario
bedrockbot
bigsur.ai
brightbot 1.0
bytespider
chatgpt agent
chatgpt-user
claude-searchbot
claude-user
claude-web
claudebot
cloudvertexbot
cohere-ai
cohere-training-data-crawler
cotoyogi
crawlspace
datenbank crawler
devin
diffbot
duckassistbot
echobot bot
echoboxbot
facebookbot
factset_spyderbot
firecrawlagent
friendlycrawler
gemini-deep-research
google-cloudvertexbot
google-extended
google-firebase
googleagent-mariner
googleother
googleother-image
googleother-video
gptbot
iaskspider/2.0
icc-crawler
imagesiftbot
img2dataset
isscyberriskcrawler
kangaroo bot
linerbot
meta-externalagent
meta-externalagent
meta-externalfetcher
meta-externalfetcher
mistralai-user
mistralai-user/1.0
mycentralaiscraperbot
netestate imprint crawler
novaact
oai-searchbot
omgili
omgilibot
openai
operator
pangubot
panscient
panscient.com
perplexity-user
perplexitybot
petalbot
phindbot
poseidon research crawler
qualifiedbot
quillbot
quillbot.com
sbintuitionsbot
scrapy
semrushbot-ocob
semrushbot-swa
shapbot
sidetrade indexer bot
thinkbot
tiktokspider
timpibot
velenpublicwebcrawler
wardbot
webzio-extended
wpbot
yak
yandexadditional
yandexadditionalbot
youbot

Rule	Path
Disallow
Disallow	/

Rule

Path

Disallow

/

Back to top

Comments

robots.txt merged from multiple sources
Source 1: https://www.ditig.com/robots.txt
ROBOTS.TXT
Updates and informantion can be found at:
https://www.ditig.com/publications/robots-txt-template
This document is licensed with a CC BY-NC-SA 4.0 license.
Last update: 2025-03-04
so.com chinese search engine
google.com landing page quality checks
google.com app resource fetcher
bing ads bot
algolia.com search
apple.com search engine
baidu.com chinese search engine
bing.com international search engine
bublup.com suggestion/search engine
commoncrawl.org open repository of web crawl data
cliqz.com german in-product search engine
coccoc.com vietnamese search engine
daum.net korean search engine
dazoo.fr french search engine
deusu.de german search engine
duckduckgo.com international privacy search engine
eurip.com european search engine
exploratodo.com latin search engine
facebook.com social network
feedly.com feed fetcher
findx.com european search engine
goo.ne.jp japanese search engine
google.com international search engine
so.com chinese search engine
goo.ne.jp japanese search engine
istella.it italian search engine
jike.com / chinaso.com chinese search engine
lycos.com & hotbot.com international search engine
mail.ru russian search engine
google.com adsense bot
Preview bot for Microsoft products
mojeek.com search engine
bing.com international search engine
orange.com international search engine
pinterest.com social networtk
botje.nl dutch search engine
qwant.com french search engine
rambler.ru russian search engine
semanticscholar.org scientific search engine
seznam.cz czech search engine
soso.com chinese search engine
yahoo.com international search engine
sogou.com chinese search engine
twitter.com social media bot
whatsapp.com preview bot
yacy.net p2p search software
yandex.com russian search engine
yep.com search engine
search.naver.com south korean search engine
yioop.com international search engine
yooz.ir iranian search engine
youdao.com chinese search engine
crawling rule(s) for above bots
disallow all other bots
----
Additional rules from: https://raw.githubusercontent.com/ai-robots-txt/ai.robots.txt/refs/heads/main/robots.txt
----

Back to top

Warnings

3 invalid lines.

Back to top

projects.gcbbs.netrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

Comments

Warnings

projects.gcbbs.net
robots.txt