opensanctions.org
robots.txt

Robots Exclusion Standard data for opensanctions.org

Resource Scan

Scan Details

Site Domain opensanctions.org
Base Domain opensanctions.org
Scan Status Ok
Last Scan2026-01-11T10:47:59+00:00
Next Scan 2026-02-10T10:47:59+00:00

Last Scan

Scanned2026-01-11T10:47:59+00:00
URL https://opensanctions.org/robots.txt
Redirect https://www.opensanctions.org:443/robots.txt
Redirect Domain www.opensanctions.org
Redirect Base opensanctions.org
Domain IPs 34.54.84.66
Redirect IPs 34.54.84.66
Response IP 34.54.84.66
Found Yes
Hash ec141fcf867507403a74e169a694e72269c00419387294fd19af157f4c6e5178
SimHash 780619119ed7

Groups

googlebot
bingbot

Rule Path
Allow /
Disallow /impressum/
Disallow /account/
Disallow /admin/

semrushbot
siteauditbot
semrushbot-ba
semrushbot-si
semrushbot-swa
semrushbot-ct
semrushbot-bm
splitsignalbot
baiduspider
bytespider
imagesiftbot
serpstatbot
blexbot
chatglm-spider
yacybot
dataforseobot
ahrefsbot
addsearchbot
agentic
ai article writer
ai content detector
ai dungeon
ai search engine
ai seo crawler
ai writer
ai21 labs
ai2bot
aibot
aimatrix
aisearchbot
ai training
aitraining
alexa
alpha ai
alphaai
amazon bedrock
amazon-kendra
amazon lex
amazon comprehend
amazon sagemaker
amazon silk
amazon textract
amazonbot
amelia
anderspinkbot
anypicker
anyword
applebot
aria browse
articoolo
automated writer
awariorssbot
awariosmartbot
bardbot
brave leo
bytedance
bytespider
catboost
chatglm
chinchilla
clearscope
cohere
content harmony
content king
content optimizer
content samurai
contentatscale
contentbot
contentedge
conversion ai
copyai
copymatic
copyscape
cotoyogi
crawlq ai
crawlspace
crew ai
crewai
dall-e
dataforseobot
dataprovider
deepai
deepmind
deepseek
diffbot
doubao ai
duckassistbot
firecrawl
flyriver
frase ai
friendlycrawler
gemma
genai
goose
grendizer
grok
gt bot
gtbot
hemingway editor
hugging face
hypotenuse ai
iaskspider
icc-crawler
imagesiftbot
img2dataset
ink editor
inkforall
intelliseek
inferkit
isscyberriskcrawler
jasperai
kafkai
kangaroo
keyword density ai
komobot
llama
magpie-crawler
marketmuse
meltwater
metatagbot
narrative
neevabot
neural text
neuralseo
oai-searchbot
omgili
openbot
opentext ai
outwrite
page analyzer ai
pangubot
paperlibot
paraphraser.io
petalbot
phindbot
piplbot
prowritingaid
quillbot
robotspider
rytr
saplingai
scalenut
scraper
scrapy
scriptbook
seo content machine
seo robot
sentibot
sidetrade
simplified ai
skydancer
slickwrite
spin rewriter
spinbot
stability
stablediffusionbot
sudowrite
surfer ai
text blaze
textcortex
the knowledge ai
timpibot
vidnami ai
webzio
whisper
wordai
wordtune
wormsgtp
wpbot
writecream
writerzen
writescope
writesonic
xai
xbot
youbot
zero gtp
zerochat
zimm
yandex
yandexbot
dotbot
mithril-crawler
aliyunsecbot

Rule Path
Disallow /

facebookexternalhit
facebookcatalog
meta-externalagent
facebookbot
facebookexternalhit
open ai
openai
gpt
grammarly
anthropic
claude
common crawl
commoncrawl
meta ai
meta-ai
meta-external
metaai
copilot
deepl
bingai
bingbot-chat
mistral
google bard ai
google-cloudvertexbot
google-extended
googleother
gemini
perplexitybot

Rule Path
Disallow /advancedsearch/
Disallow /impressum/
Disallow /account/
Disallow /admin/

*

Rule Path
Disallow /impressum/
Disallow /account/
Disallow /admin/

Other Records

Field Value
sitemap https://www.opensanctions.org/sitemap.xml

Warnings

  • 1 invalid line.