r4re.art
robots.txt

Robots Exclusion Standard data for r4re.art

Resource Scan

Scan Details

Site Domain r4re.art
Base Domain r4re.art
Scan Status Ok
Last Scan2025-09-22T18:04:54+00:00
Next Scan 2025-10-22T18:04:54+00:00

Last Scan

Scanned2025-09-22T18:04:54+00:00
URL https://r4re.art/robots.txt
Domain IPs 104.21.39.218, 172.67.148.246, 2606:4700:3033::ac43:94f6, 2606:4700:3034::6815:27da
Response IP 104.21.39.218
Found Yes
Hash 5a7129277f59c4cb68ed3087f74d05c41aa0eeb8f15a49bc694bd796906efa8c
SimHash b052b91a0737

Groups

*

Rule Path
Disallow /cache/*
Disallow /components/*
Disallow /core/*
Disallow /static/*

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

Comments

  • www.robotstxt.org/
  • www.google.com/support/webmasters/bin/answer.py?hl=en&answer=156449
  • https://developers.google.com/search/reference/robots_txt?csw=1#url-matching-based-on-path-values
  • https://support.google.com/webmasters/answer/6080548?hl=en
  • https://www.searchenginejournal.com/technical-seo/url-parameter-handling/?amp
  • Used for many other (non-commercial) purposes as well
  • For new training only
  • Not for training, only for user requests
  • Marker for disabling Bard and Vertex AI
  • Speech synthesis only?
  • Multi-purpose, commercial uses; including LLMs