nicolasgrenie.com
robots.txt

Robots Exclusion Standard data for nicolasgrenie.com

Resource Scan

Scan Details

Site Domain nicolasgrenie.com
Base Domain nicolasgrenie.com
Scan Status Ok
Last Scan2025-11-20T19:11:45+00:00
Next Scan 2025-12-04T19:11:45+00:00

Last Scan

Scanned2025-11-20T19:11:45+00:00
URL https://nicolasgrenie.com/robots.txt
Domain IPs 185.158.133.1
Response IP 185.158.133.1
Found Yes
Hash dbb4f499ad865cc6f80528efcc58f77581a4f54d4b4a20d9c389c96240452bd9
SimHash 221c99cd2771

Groups

*

Rule Path
Allow /

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

twitterbot

Rule Path
Allow /

facebookexternalhit

Rule Path
Allow /

gptbot

Rule Path
Allow /

google-extended

Rule Path
Allow /

ccbot

Rule Path
Allow /

anthropic-ai

Rule Path
Allow /

claude-web

Rule Path
Allow /

Other Records

Field Value
sitemap https://nicolasgrenie.com/sitemap.xml

Comments

  • Important crawlers
  • AI Crawlers
  • LLMs file for AI training data
  • See: https://nicolasgrenie.com/llms.txt