mila-learn.com
robots.txt

Robots Exclusion Standard data for mila-learn.com

Resource Scan

Scan Details

Site Domain mila-learn.com
Base Domain mila-learn.com
Scan Status Ok
Last Scan2025-07-25T00:10:57+00:00
Next Scan 2025-08-24T00:10:57+00:00

Last Scan

Scanned2025-07-25T00:10:57+00:00
URL https://mila-learn.com/robots.txt
Redirect https://cdn.prod.website-files.com/6511391b1b938de715a35f8a/686e4c72f8dad5deccd2d7fc_robots.txt
Redirect Domain cdn.prod.website-files.com
Redirect Base website-files.com
Domain IPs 13.33.88.110, 13.33.88.6, 13.33.88.62, 13.33.88.8
Redirect IPs 104.18.160.117, 104.18.161.117, 2606:4700::6812:a075, 2606:4700::6812:a175
Response IP 104.18.160.117
Found Yes
Hash 7114a3f9db5e227d28fcc627f724aa34c7c378d6d19bc87090fbe7142b82b5f1
SimHash 6f12b963b737

Groups

gptbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

ccbot

Rule Path
Allow /

anthropic-ai

Rule Path
Allow /

claude-web

Rule Path
Allow /

googlebot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

bingbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

*

Rule Path
Allow /
Allow /llm.txt
Allow /IndexNow.json

Other Records

Field Value
sitemap https://poppins.io/sitemap.xml

Comments

  • Robots.txt pour poppins.io
  • Optimisé pour le GEO (Generative Engine Optimization)
  • Crawlers IA et LLMs bienvenus
  • Google et Bing
  • Autres crawlers autorisés
  • Fichiers importants pour le GEO
  • Sitemap