opalesurfcasting.net
robots.txt

Robots Exclusion Standard data for opalesurfcasting.net

Resource Scan

Scan Details

Site Domain opalesurfcasting.net
Base Domain opalesurfcasting.net
Scan Status Ok
Last Scan2026-01-22T20:39:05+00:00
Next Scan 2026-01-29T20:39:05+00:00

Last Scan

Scanned2026-01-22T20:39:05+00:00
URL https://opalesurfcasting.net/robots.txt
Redirect https://www.opalesurfcasting.net/robots.txt
Redirect Domain www.opalesurfcasting.net
Redirect Base opalesurfcasting.net
Domain IPs 109.238.10.175
Redirect IPs 109.238.10.175
Response IP 109.238.10.175
Found Yes
Hash e4d67707b0dac621e1a75f0822a784f6f1f7281cd65c48af5086be8879c48a81
SimHash 4c16df5d2653

Groups

*

Rule Path
Allow /edupict/www/images/
Allow /local/cache-css/
Allow /local/cache-js/
Allow /local/cache-vignettes/
Allow /local/cache-TeX/
Allow /local/cache-gd2/
Allow /plugins-dist/medias/prive/vignettes/
Allow /squelettes/css/
Allow /squelettes/js/
Allow /squelettes/tarteaucitron/
Allow /squelettes/inclure/lamaillebanner/
Allow /squelettes/NAVPICS/
Allow /squelettes/epnjs/dist/epn.js
Allow /squelettes/puce.gif
Allow /geoportail/
Disallow /local/
Disallow /plugins-dist/
Disallow /lib/
Disallow /plugins/
Disallow /prive/
Disallow /squelettes-dist/
Disallow /squelettes/
Allow /prive/javascript/SearchHighlight.js
Allow /plugins/auto/saisies/v3.56.6/css/saisies.css
Allow /plugins/auto/saisies/v3.56.6/javascript/saisies.js

googlebot-image

Rule Path
Disallow /IMG/jpg/la_rochelle_pallice_pont_de_re.jpg
Disallow /IMG/merlan.jpg
Disallow /IMG/jpg/merlan-oye-plage.jpg
Disallow /local/cache-vignettes/L300xH225/xmerlan-oye-plage-8ab70.jpg.pagespeed.ic.YAamfNHyvXrXDBFSUtII.jpg

baiduspider-image

Rule Path
Allow /edupict/www/images/
Allow /local/cache-css/
Allow /local/cache-js/
Allow /local/cache-vignettes/
Allow /local/cache-TeX/
Allow /plugins-dist/medias/prive/vignettes/
Allow /squelettes/css/
Allow /squelettes/mes_fonctions_geo_carto.js
Allow /IMG/logo_opale_surfcasting.jpg
Disallow /local/
Disallow /plugins-dist/
Disallow /lib/
Disallow /plugins/
Disallow /prive/
Disallow /squelettes-dist/
Disallow /squelettes/
Disallow /IMG/jpg/
Disallow /IMG/*.jpg$
Disallow /wetterzentrale/

websiteoutlook
cutestat
velenpublicwebcrawler
mixnodecache
ltx71
turnitinbot
seobility
the knowledge ai
yisouspider
serpstatbot
dataforseobot

Rule Path
Disallow /

gptbot
ccbot
oai-searchbot
claudebot
claude-web
claude-searchbot
claude-user
perplexitybot
pplcrawl
pplscout
mistralbot
google-extended
google-ai
googleother
googleother-image
googleother-video
amazonbot
bytespider
img2dataset
cohere-ai
diffbot
awariorssbot
awariosmartbot
friendlycrawler
meltwater
magpie-crawler
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
piplbot
scoop.it
seekr
youbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Allow /

perplexity-user

Rule Path
Allow /

perplexityuser

Rule Path
Allow /

perplexity-webview

Rule Path
Allow /

perplexityapp

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.opalesurfcasting.net/sitemap.xml

Comments

  • robots.txt
  • @url: https://www.opalesurfcasting.net
  • @generator: SPIP 3.2.19
  • @template: squelettes/robots.txt.html
  • ------------------------------------------------------------
  • 1) Règles SPIP standards (inchangées)
  • ------------------------------------------------------------
  • Exceptions SPIP
  • ------------------------------------------------------------
  • 2) Règles images pour Googlebot-Image (inchangé)
  • ------------------------------------------------------------
  • ------------------------------------------------------------
  • 3) Baiduspider-image (inchangé)
  • ------------------------------------------------------------
  • ------------------------------------------------------------
  • 4) Crawlers indésirables classiques (scrapers, SEO spammers)
  • ------------------------------------------------------------
  • ------------------------------------------------------------
  • 5) IA – BLOQUÉES (entraînement, index massifs, scrapers IA)
  • ------------------------------------------------------------
  • OpenAI (entraînement + search)
  • Anthropic / Claude
  • Perplexity – crawlers
  • Mistral
  • Google AI (hors indexation classique)
  • Amazon / Meta / autres scrapers IA
  • Commercial NLP scrapers
  • ------------------------------------------------------------
  • 6) IA – AUTORISÉES (trafic humain réel)
  • ------------------------------------------------------------
  • ChatGPT – accès déclenché par un utilisateur (Browser Mode)
  • Perplexity – trafic humain via WebView / App
  • ------------------------------------------------------------
  • 7) Sitemap
  • ------------------------------------------------------------