infili.de
robots.txt

Robots Exclusion Standard data for infili.de

Resource Scan

Scan Details

Site Domain infili.de
Base Domain infili.de
Scan Status Ok
Last Scan2025-10-25T20:36:13+00:00
Next Scan 2025-11-01T20:36:13+00:00

Last Scan

Scanned2025-10-25T20:36:13+00:00
URL https://infili.de/robots.txt
Domain IPs 104.21.72.187, 172.67.153.221, 2606:4700:3031::6815:48bb, 2606:4700:3034::ac43:99dd
Response IP 104.21.72.187
Found Yes
Hash 8b7e360644877ae60be14afd9e8a0a9ea5d46157890005ebc3222c4fc7cc365f
SimHash 4018d1b1e991

Groups

*

Rule Path
Allow /

gptbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

ccbot

Rule Path
Allow /

anthropic-ai

Rule Path
Allow /

claude-web

Rule Path
Allow /

applebot

Rule Path
Allow /

applebot-extended

Rule Path
Allow /

amazonbot

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

youbot

Rule Path
Allow /

diffbot

Rule Path
Allow /

facebookbot

Rule Path
Allow /

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

slurp

Rule Path
Allow /

twitterbot

Rule Path
Allow /

linkedinbot

Rule Path
Allow /

whatsapp

Rule Path
Allow /

baiduspider

Rule Path
Allow /

yandexbot

Rule Path
Allow /

semrushbot

Rule Path
Allow /

ahrefsbot

Rule Path
Allow /

mj12bot

Rule Path
Allow /

Comments

  • AI and ML crawlers - Explicitly allowed