openriskmanual.org
robots.txt
Robots Exclusion Standard data for openriskmanual.org
Resource Scan
Scan Details
| Site Domain | openriskmanual.org |
| Base Domain | openriskmanual.org |
| Scan Status | Ok |
| Last Scan | 2025-11-28T21:46:43+00:00 |
| Next Scan | 2025-12-12T21:46:43+00:00 |
Last Scan
| Scanned | 2025-11-28T21:46:43+00:00 |
| URL | https://openriskmanual.org/robots.txt |
| Domain IPs | 104.21.93.75, 172.67.206.166, 2606:4700:3032::ac43:cea6, 2606:4700:3034::6815:5d4b |
| Response IP | 172.67.206.166 |
| Found | Yes |
| Hash | b1e1da3db8b3029355a9ed399f09e3ef675be7e40b6849be1f65aa165f272be6 |
| SimHash | 46374913c594 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
*
| Rule | Path |
|---|---|
| Disallow | /wiki/Special%3A |
| Disallow | /wiki/Special%3A |
Other Records
| Field | Value |
|---|---|
| crawl-delay | 10 |
ai2bot
ai2bot-dolma
amazonbot
ahrefsbot
applebot
applebot-extended
bytespider
brightbot
barkrowler
ccbot
chatgpt-user
claude-web
claudebot
chatglm-spider
diffbot
dotbot
facebookbot
friendlycrawler
gptbot
icc-crawler
imagesiftbot
iboubot
meta-externalagent
meta-externalfetcher
mj12bot
oai-searchbot
perplexitybot
petalbot
scrapy
timpibot
thinkbot
velenpublicwebcrawler
webzio-extended
youbot
anthropic-ai
cohere-ai
facebookexternalhit
iaskspider/2.0
img2dataset
omgili
omgilibot
semrushbot
| Rule | Path |
|---|---|
| Disallow | / |
Warnings
- `content-signal` is not a known field.
Comments