cruiseturtle.com
robots.txt

Robots Exclusion Standard data for cruiseturtle.com

Resource Scan

Scan Details

Site Domain cruiseturtle.com
Base Domain cruiseturtle.com
Scan Status Ok
Last Scan2026-01-14T00:34:13+00:00
Next Scan 2026-01-21T00:34:13+00:00

Last Scan

Scanned2026-01-14T00:34:13+00:00
URL https://cruiseturtle.com/robots.txt
Redirect https://www.cruiseturtle.com/robots.txt
Redirect Domain www.cruiseturtle.com
Redirect Base cruiseturtle.com
Domain IPs 51.77.240.240
Redirect IPs 51.77.240.240
Response IP 51.77.240.240
Found Yes
Hash b6429885f83827e88fae1418cc48e01fbf9520b321bca6ea8d87d69c77c6f367
SimHash 7014c35049f5

Groups

*

Rule Path
Allow /*.js*
Allow /*.css*
Allow /*.png*
Allow /*.jpg*
Allow /*.gif*
Allow /*.webp*
Disallow /administrator/
Disallow /cli/
Disallow /installation/
Disallow /language/
Disallow /logs/
Disallow /tmp/
Allow /components/com_bagallery/
Disallow /*84-news
Disallow /*77-kreuzfahrt-ziele
Disallow /*78-kreuzfahrt-schiffe
Disallow /*104-reisetagebuch
Disallow /*91-reisetagebuch
Disallow /*93-reisetagebuch
Disallow /*87-kreuzfahrt-angebote
Disallow /*77-news
Disallow /*?
Allow /kreuzfahrt-news*?
Allow /index.php?option*

Other Records

Field Value
sitemap https://www.cruiseturtle.com/index.php?option=com_jmap&view=sitemap&format=xml
sitemap https://www.cruiseturtle.com/index.php?option=com_jmap&view=sitemap&format=images
sitemap https://www.cruiseturtle.com/index.php?option=com_jmap&view=sitemap&format=gnews
sitemap https://www.cruiseturtle.com/index.php?option=com_jmap&view=sitemap&format=hreflang

Comments

  • Disallow: /cache/
  • Disallow: /includes/
  • Disallow: /libraries/
  • Bot aussperren
  • User-agent: Amazonbot
  • User-agent: Anthropic-ai
  • User-agent: Applebot-Extended
  • User-agent: AwarioRssBot
  • User-agent: AwarioSmartBot
  • User-agent: Bytespider
  • User-agent: CCBot
  • User-agent: ChatGPT-User
  • User-agent: ClaudeBot
  • User-agent: Claude-Web
  • User-agent: Cohere-ai
  • User-agent: DataForSeoBot
  • User-agent: FacebookBot
  • User-agent: Google-Extended
  • User-agent: GPTBot
  • User-agent: ImagesiftBot
  • User-agent: Magpie-crawler
  • User-agent: Omgili
  • User-agent: Omgilibot
  • User-agent: Peer39_crawler
  • User-agent: Peer39_crawler/1.0
  • User-agent: PerplexityBot
  • User-agent: YouBot
  • Disallow: /
  • JSitemap entries
  • alle Urls mit ? nicht indexieren, ausser alle kreuzfahrt-news?