romaricpascal.com
robots.txt

Robots Exclusion Standard data for romaricpascal.com

Resource Scan

Scan Details

Site Domain romaricpascal.com
Base Domain romaricpascal.com
Scan Status Ok
Last Scan2025-10-27T06:33:56+00:00
Next Scan 2025-11-26T06:33:56+00:00

Last Scan

Scanned2025-10-27T06:33:56+00:00
URL http://romaricpascal.com/robots.txt
Redirect https://romaricpascal.is/robots.txt
Redirect Domain romaricpascal.is
Redirect Base romaricpascal.is
Domain IPs 213.186.33.5
Redirect IPs 2001:41d0:301::20, 46.105.57.169
Response IP 46.105.57.169
Found Yes
Hash 97eb9767901aea14e444324f978fbebeefab87aa31c67319de1eb39a46f69d2d
SimHash b2640ac04503

Groups

*

Rule Path
Disallow /creations/
Disallow /fr/creations/
Disallow /playing/
Disallow /fr/playing/

google-extended

Rule Path
Disallow /

Comments

  • Private pages
  • Google's AI training, which work only through robots.txt
  • https://neil-clarke.com/block-the-bots-that-feed-ai-models-by-scraping-your-website/