messari.io
robots.txt

Robots Exclusion Standard data for messari.io

Resource Scan

Scan Details

Site Domain messari.io
Base Domain messari.io
Scan Status Ok
Last Scan2024-09-17T02:29:36+00:00
Next Scan 2024-10-17T02:29:36+00:00

Last Scan

Scanned2024-09-17T02:29:36+00:00
URL https://messari.io/robots.txt
Domain IPs 104.18.6.70, 104.18.7.70, 2606:4700::6812:646, 2606:4700::6812:746
Response IP 104.18.7.70
Found Yes
Hash 714ae175abc5d484c4e94dd0ab2d3096166295a47fbb9b28b32bd38a02ad059f
SimHash d0194840c092

Groups

*

Rule Path
Disallow /pdf/*
Disallow /report-pdf/*

gptbot
google-extended
ccbot
chatgpt-user
anthropic-ai
omgilibot
omgili
facebookbot
diffbot
bytespider
imagesiftbot
cohere-ai
claude-web
perplexitybot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://messari.io/sitemaps/sitemap.xml