fishbase.org
robots.txt

Robots Exclusion Standard data for fishbase.org

Resource Scan

Scan Details

Site Domain fishbase.org
Base Domain fishbase.org
Scan Status Ok
Last Scan2025-06-15T14:46:24+00:00
Next Scan 2025-07-15T14:46:24+00:00

Last Scan

Scanned2025-06-15T14:46:24+00:00
URL https://www.fishbase.org/robots.txt
Domain IPs 192.134.151.83, 212.201.155.242
Response IP 192.134.151.83
Found Yes
Hash 9b1c0d4f7620506008772a75bf118f44bb0c9833bb82bbda21287de4b7b84fe9
SimHash ee744d5bc686

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /tmp/

Other Records

Field Value
crawl-delay 10

gptbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

Comments

  • Please note: There are a lot of pages on this site, and there are
  • some misbehaved spiders out there that go _way_ too fast. If you're
  • irresponsible, your access to the site may be blocked.