crociere.com
robots.txt

Robots Exclusion Standard data for crociere.com

Resource Scan

Scan Details

Site Domain crociere.com
Base Domain crociere.com
Scan Status Ok
Last Scan2024-09-17T04:21:43+00:00
Next Scan 2024-10-17T04:21:43+00:00

Last Scan

Scanned2024-09-17T04:21:43+00:00
URL https://crociere.com/robots.txt
Redirect https://www.crociere.com/robots.txt
Redirect Domain www.crociere.com
Redirect Base crociere.com
Domain IPs 18.172.170.36, 18.172.170.57, 18.172.170.7, 18.172.170.8
Redirect IPs 18.155.68.105, 18.155.68.127, 18.155.68.27, 18.155.68.31
Response IP 18.155.68.127
Found Yes
Hash 016b44d0a5bdf15733be72a764efeedf242bc4e82e281ab70889d3a21caf0ae3
SimHash 0843df92e780

Groups

*

Rule Path
Disallow /

googlebot
googlebot-image
mediapartners-google
googlebot-news
googlebot-video
google-extended
adsbot-google
adsbot-google-mobile
adsbot-google-mobile-apps
bingbot
adidxbot
duckduckbot
slurp
oncrawl
qwantify
pinterest
ahrefsbot
ahrefssiteaudit
semrushbot
ahrefssiteaudit
barkrowler
chatgpt-user
gptbot

Rule Path
Allow /
Disallow /ajax/affinage.php
Disallow /ajax/dateJackpot.php
Disallow /ajax/fiche.php
Disallow /ajax/jourferie.php
Disallow /ajax/moteur.php
Disallow /ajax/similaires.php
Disallow /ajax_jourferie.php
Disallow /ajax_moteur.php
Disallow /ajax_moteur_affinage.php
Disallow /ajax_moteurBateau.php
Disallow /ajax_moteurCompagnie.php
Disallow */action?
Disallow /search?
Disallow /search_resa?
Disallow */package-*
Disallow */Mj*
Disallow /fp-*

adsbot-google
adsbot-google-mobile
adsbot-google-mobile-apps

Rule Path
Allow /********************

feedfetcher-google
googleproducer

Rule Path
Allow /********************
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow *package2.html
Disallow /comp_101/
Disallow /ajax/
Disallow /cron/
Disallow /listeV4.php
Disallow /liste.php
Disallow /liste-produits/
Allow /*.jpg************
Allow /*.jpeg************
Allow /*.png************
Allow /*.gif************
Allow /*.webp************
Allow /*.svg************
Allow /*.tif************
Allow /*.mp4***********
Allow /*.css******************
Allow /*.js****************
Allow /*.pdf***************

Other Records

Field Value
sitemap https://www.crociere.com/sitemaps/it/sitemap-index.xml

Comments

  • Block access to all UA by default
  • Allow access to specific UA
  • Disallow miscellaneous page pattern
  • Allow search to Ads-bot
  • Allow Discover
  • Disallow Blog URL pattern
  • Allow images and special medias