ipni.org
robots.txt

Robots Exclusion Standard data for ipni.org

Resource Scan

Scan Details

Site Domain ipni.org
Base Domain ipni.org
Scan Status Ok
Last Scan2025-10-28T12:12:34+00:00
Next Scan 2025-11-27T12:12:34+00:00

Last Scan

Scanned2025-10-28T12:12:34+00:00
URL https://ipni.org/robots.txt
Domain IPs 35.189.213.105
Response IP 35.189.213.105
Found Yes
Hash c8a666fb158a0e9602a040e0762f6488ebc083a76c3169a72dcd3b4774143f7d
SimHash d654dd71ea0b

Groups

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

zoombot

Rule Path
Disallow /

ru_bot

Rule Path
Disallow /

pinterestbot

Rule Path
Disallow /

rytebot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

screaming frog seo spider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

siteauditbot

Rule Path
Disallow /

keys-so-bot

Rule Path
Disallow /

yak

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

aliyunsecbot

Rule Path
Disallow /

awariobot

Rule Path
Disallow /

farmerduckybot

Rule Path
Disallow /

Warnings

  • 1 invalid line.