griechenland.net
robots.txt

Robots Exclusion Standard data for griechenland.net

Resource Scan

Scan Details

Site Domain griechenland.net
Base Domain griechenland.net
Scan Status Ok
Last Scan2024-11-14T22:54:44+00:00
Next Scan 2024-11-21T22:54:44+00:00

Last Scan

Scanned2024-11-14T22:54:44+00:00
URL https://griechenland.net/robots.txt
Redirect https://www.griechenland.net/robots.txt
Redirect Domain www.griechenland.net
Redirect Base griechenland.net
Domain IPs 2406:da18:9d0:143f:2124:4e9c:36a9:d9de, 52.221.42.138
Redirect IPs 104.21.37.219, 172.67.213.156, 2606:4700:3032::ac43:d59c, 2606:4700:3034::6815:25db
Response IP 104.21.37.219
Found Yes
Hash 23e7ffc63f7b9d7d97bfb24fd9d4725371c2271bea6b79977b998d84712b7831
SimHash 83169459c2c4

Groups

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

adidxbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

bingpreview

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

duckduckbot-https

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

trendictionbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

baiduspider
yisouspider

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

blexbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

sentibot

Rule Path
Disallow /

petalbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Allow /*.js*
Allow /*.css*
Allow /*.png*
Allow /*.jpg*
Allow /*.gif*
Disallow /administrator/
Disallow /bin/
Disallow /cache/
Disallow /cli/
Allow /images/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /libraries/
Disallow /logs/
Allow /media/
Disallow /tmp/
Disallow /index.php?option=com_jmap&view=sitemap&format=gnews
Disallow /index.php?option=com_jmap&view=sitemap
Allow /plugins/system/lazyloadforjoomla/assets/images/blank.gif
Allow /plugins/system/maximenuckmobile/assets/maximenuckmobile.js
Allow /plugins/system/maximenuckmobile/themes/default/maximenuckmobile.css
Allow /modules/mod_maximenuck/themes/default/css/maximenuck.php?monid=maximenunews
Allow /modules/mod_maximenuck/assets/maximenuresponsiveck.css
Allow /modules/mod_maximenuck/themes/default/css/maximenuck.php?monid=maximenutop
Allow /modules/mod_responsivebannerslider/assets/modernizr.min.js
Allow /modules/mod_responsivebannerslider/assets/adaptor/box-slider-all.jquery.min.js
Allow /components/com_k2/js/
Allow /modules/mod_news_pro_gk5/cache/
Allow /modules/mod_b2j_k2_calendar/tmpl/images/
Allow /modules/mod_pagepeel_banner/assets/
Allow /modules/mod_weather_gk4/icons/meteocons_light/
Allow /modules/mod_randompoll/assets/
Allow /modules/mod_news_pro_gk5/portal_modes/news_gallery/images/

Other Records

Field Value
sitemap https://www.griechenland.net/component/jmap/sitemap/xml
sitemap https://www.griechenland.net/component/jmap/sitemap/gnews

Comments

  • If the Joomla site is installed within a folder such as at
  • e.g. www.example.com/joomla/ the robots.txt file MUST be
  • moved to the site root at e.g. www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to the disallowed
  • path, e.g. the Disallow rule for the /administrator/ folder
  • MUST be changed to read Disallow: /joomla/administrator/
  • Disallow: /
  • Crawl-delay: 10
  • Disallow: /
  • test if delay works
  • JSitemap entries

Warnings

  • 1 invalid line.