clarinette.net
robots.txt

Robots Exclusion Standard data for clarinette.net

Resource Scan

Scan Details

Site Domain clarinette.net
Base Domain clarinette.net
Scan Status Ok
Last Scan2025-11-16T02:21:14+00:00
Next Scan 2025-12-16T02:21:14+00:00

Last Scan

Scanned2025-11-16T02:21:14+00:00
URL https://clarinette.net/robots.txt
Domain IPs 2001:8d8:100f:f000::21b, 217.160.0.216
Response IP 217.160.0.216
Found Yes
Hash 94410a74976cd3c44a551ce7d773d3062390f5b04cb5be5ab7f3dd944481cc28
SimHash f01c93e0cd90

Groups

gptbot
chatgpt-user
google-extended
ccbot
perplexitybot
omgilibot
omgili
facebookbot
diffbot
bytespider
imagesiftbot
cohere-ai
amazonbot
anthropic-ai
claude-web
claudebot
applebot
youbot

Rule Path
Disallow /

semrushbot
ahrefsbot
barkrowler
trendictionbot
seekportbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

Comments

  • bots marketing
  • bot huawei (search and "IA" recommendations)