digidom.pro
robots.txt

Robots Exclusion Standard data for digidom.pro

Resource Scan

Scan Details

Site Domain digidom.pro
Base Domain digidom.pro
Scan Status Ok
Last Scan2025-11-22T21:59:58+00:00
Next Scan 2025-12-22T21:59:58+00:00

Last Scan

Scanned2025-11-22T21:59:58+00:00
URL https://digidom.pro/robots.txt
Redirect https://www.digidom.pro/robots.txt
Redirect Domain www.digidom.pro
Redirect Base digidom.pro
Domain IPs 35.246.248.138
Redirect IPs 35.242.229.239
Response IP 35.246.184.45
Found Yes
Hash 9e33fdbe2e4d1be2f4b5674392d2a383b1519ca2c69911c9c8ab5c6a1b6958cd
SimHash 711e9944ac34

Groups

*
gptbot

Rule Path
Allow /

google-extended

Rule Path
Allow /

ccbot

Rule Path
Allow /

claudebot

Rule Path
Allow /

claude-web

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.digidom.pro/sitemap/sitemap.all.xml

Comments

  • LLM / IA Crawlers — autorisations explicites
  • OpenAI (ChatGPT / GPTBot)
  • Google-Extended (contrôle l’usage data par les produits IA de Google)
  • Common Crawl (utilisé par plusieurs modèles)
  • Anthropic (Claude)
  • Perplexity