marinetraffic.org
robots.txt

Robots Exclusion Standard data for marinetraffic.org

Resource Scan

Scan Details

Site Domain marinetraffic.org
Base Domain marinetraffic.org
Scan Status Ok
Last Scan2026-02-07T09:40:43+00:00
Next Scan 2026-02-14T09:40:43+00:00

Last Scan

Scanned2026-02-07T09:40:43+00:00
URL https://marinetraffic.org/robots.txt
Redirect https://www.marinetraffic.org/robots.txt
Redirect Domain www.marinetraffic.org
Redirect Base marinetraffic.org
Domain IPs 104.26.14.67, 104.26.15.67, 172.67.72.96, 2606:4700:20::681a:e43, 2606:4700:20::681a:f43, 2606:4700:20::ac43:4860
Redirect IPs 104.26.14.67, 104.26.15.67, 172.67.72.96, 2606:4700:20::681a:e43, 2606:4700:20::681a:f43, 2606:4700:20::ac43:4860
Response IP 104.26.15.67
Found Yes
Hash 8dcc8c2332fd9663da6b5c01b72bb6329e103ccf143d741f6765436290c236c3
SimHash 6858d940e314

Groups

*

Rule Path
Allow

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

seekr

Rule Path
Disallow /

Comments

  • Block Open AI
  • Block Google (Gemini)
  • Block Claude
  • Block CommonCrawl
  • Block Diffbot
  • Block Meta (Facebook)
  • Block ByteDance
  • Block Webz.io
  • Block ImagesiftBot
  • Block Meltwater
  • Block Seekr