crocieraonline.com
robots.txt

Robots Exclusion Standard data for crocieraonline.com

Resource Scan

Scan Details

Site Domain crocieraonline.com
Base Domain crocieraonline.com
Scan Status Ok
Last Scan2024-09-22T16:40:55+00:00
Next Scan 2024-10-22T16:40:55+00:00

Last Scan

Scanned2024-09-22T16:40:55+00:00
URL https://crocieraonline.com/robots.txt
Redirect https://www.crocieraonline.com/robots.txt
Redirect Domain www.crocieraonline.com
Redirect Base crocieraonline.com
Domain IPs 13.35.18.109, 13.35.18.5, 13.35.18.83, 13.35.18.96
Redirect IPs 108.157.254.100, 108.157.254.20, 108.157.254.66, 108.157.254.82
Response IP 108.157.254.82
Found Yes
Hash 507bb6085927837ad3b6b8358910718a18efcdab524132dc77679a0103798678
SimHash 4e43dd98e700

Groups

*

Rule Path
Disallow /
Disallow /ajax/affinage.php
Disallow /ajax/dateJackpot.php
Disallow /ajax/fiche.php
Disallow /ajax/jourferie.php
Disallow /ajax/moteur.php
Disallow /ajax/similaires.php
Disallow /ajax_jourferie.php
Disallow /ajax_moteur.php
Disallow /ajax_moteur_affinage.php
Disallow /ajax_moteurBateau.php
Disallow /ajax_moteurCompagnie.php
Disallow */action?
Disallow /search?
Disallow /search_resa?
Disallow */package-*
Disallow /fp-*
Disallow */tag/*

googlebot
googlebot-image
mediapartners-google
googlebot-news
googlebot-video
google-extended
adsbot-google
adsbot-google-mobile
adsbot-google-mobile-apps
bingbot
adidxbot
duckduckbot
slurp
oncrawl
qwantify
pinterest
ahrefsbot
ahrefssiteaudit
semrushbot
ahrefssiteaudit
barkrowler
chatgpt-user
gptbot

Rule Path
Allow /
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow *package2.html
Disallow /comp_101/
Disallow /ajax/
Disallow /cron/
Disallow /listeV4.php
Disallow /liste-produits/
Disallow */tag/*
Disallow /fp-*
Disallow */blog-viaggio/?s=*

feedfetcher-google
googleproducer

Rule Path
Allow /********************
Allow /*.jpg************
Allow /*.jpeg************
Allow /*.png************
Allow /*.gif************
Allow /*.webp************
Allow /*.svg************
Allow /*.tif************
Allow /*.mp4***********
Allow /*.css******************
Allow /*.js****************
Allow /*.pdf***************

Other Records

Field Value
sitemap https://www.crocieraonline.com/sitemaps/it/sitemap-index.xml

Comments

  • Block access to all UA by default
  • Disallow miscellaneous page pattern
  • Allow access to specific UA
  • Disallow Blog URL pattern
  • Allow Discover
  • Allow images and special medias