fedi.ctu.cx
robots.txt
Robots Exclusion Standard data for fedi.ctu.cx
Resource Scan
Scan Details
Site Domain | fedi.ctu.cx |
Base Domain | ctu.cx |
Scan Status | Ok |
Last Scan | 2025-04-20T08:18:34+00:00 |
Next Scan | 2025-05-20T08:18:34+00:00 |
Last Scan
Scanned | 2025-04-20T08:18:34+00:00 |
URL | https://fedi.ctu.cx/robots.txt |
Domain IPs | 194.59.205.194, 2a03:4000:34:23e::1 |
Response IP | 194.59.205.194 |
Found | Yes |
Hash | c5dd7a60e8be96c61817fa8a510d96ebc6d9029c294ce4760250c23aeb640afa |
SimHash | 362e4b58c1e4 |
Groups
ai2bot
ai2bot-dolma
amazonbot
anthropic-ai
applebot
applebot-extended
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
cohere-training-data-crawler
diffbot
duckassistbot
facebookbot
friendlycrawler
google-extended
googleother
googleother-image
googleother-video
gptbot
iaskspider/2.0
icc-crawler
imagesiftbot
img2dataset
isscyberriskcrawler
kangaroo bot
meta-externalagent
meta-externalfetcher
oai-searchbot
omgili
omgilibot
pangubot
perplexitybot
petalbot
scrapy
sidetrade indexer bot
timpibot
velenpublicwebcrawler
webzio-extended
youbot
Rule | Path |
---|---|
Disallow | / |
awariorssbot
awariosmartbot
dataforseobot
magpie-crawler
meltwater
peer39_crawler
peer39_crawler/1.0
piplbot
scoop.it
seekr
Rule | Path |
---|---|
Disallow | / |
*
Rule | Path |
---|---|
Disallow | /api/ |
Disallow | /auth/ |
Disallow | /oauth/ |
Disallow | /check_your_email |
Disallow | /wait_for_approval |
Disallow | /account_disabled |
Disallow | /signup |
Disallow | /fileserver/ |
Disallow | /users/ |
Disallow | /emoji/ |
Disallow | /admin |
Disallow | /user |
Disallow | /settings/ |
Disallow | /about/suspended |
Disallow | /.well-known/webfinger |
Disallow | /.well-known/nodeinfo |
Disallow | /nodeinfo/ |
Other Records
Field | Value |
---|---|
crawl-delay | 500 |
Comments