zesox.de
robots.txt

Robots Exclusion Standard data for zesox.de

Resource Scan

Scan Details

Site Domain zesox.de
Base Domain zesox.de
Scan Status Ok
Last Scan5/15/2025, 10:16:54 PM
Next Scan 5/22/2025, 10:16:54 PM

Last Scan

Scanned5/15/2025, 10:16:54 PM
URL https://zesox.de/robots.txt
Redirect https://www.zesox.de/robots.txt
Redirect Domain www.zesox.de
Redirect Base zesox.de
Domain IPs 104.21.34.211, 172.67.165.146, 2606:4700:3032::ac43:a592, 2606:4700:3034::6815:22d3
Redirect IPs 104.21.34.211, 172.67.165.146, 2606:4700:3032::ac43:a592, 2606:4700:3034::6815:22d3
Response IP 104.21.34.211
Found Yes
Hash a716dc50e0615ff665b1e9121102d22eacdc7f7e5508862efbfb0dc5e34660e5
SimHash 201a8d5067ad

Groups

*

Rule Path
Disallow /news/cgi-bin/
Disallow /news/wp-admin/
Disallow /news/wp-includes/
Disallow /news/wp-content/themes/
Disallow /news/wp-content/plugins/
Disallow /news/trackback/
Disallow /news/*?*
Disallow /news/*/trackback/

googlebot

Rule Path
Disallow /news/*.php$
Disallow /news/*.js$
Disallow /news/*.inc$
Disallow /news/*.css$
Disallow /news/*.gz$
Disallow /news/*.cgi$
Disallow /news/*.wmv$
Disallow /news/*.cgi$
Disallow /news/*.xhtml$
Disallow /news/*.php*
Disallow /news*/trackback*
Disallow /news/*?*
Disallow /news/feed/
Disallow /news/wp-*
Allow /news/wp-content/uploads/
Allow /news/wp-content/angelbilder/

googlebot-image

Rule Path
Allow /news/*

mediapartners-google*

Rule Path
Disallow /news/*?*
Allow /news/wp-content/
Allow /news/tag/
Allow /news/category/
Allow /news/*.php$
Allow /news/*.js$
Allow /news/*.inc$
Allow /news/*.css$
Allow /news/*.gz$
Allow /news/*.cgi$
Allow /news/*.wmv$
Allow /news/*.cgi$
Allow /news/*.xhtml$
Allow /news/*.php*
Allow /news/*.gif$
Allow /news/*.jpg$
Allow /news/*.png$

*

Rule Path
Disallow /admin/
Disallow /core/
Disallow /tmp/
Disallow /views/
Disallow /setup/
Disallow /log/
Disallow /newsletter/
Disallow /en/newsletter/
Disallow /index.php?cl=newsletter
Disallow /agb/
Disallow /en/terms/
Disallow /warenkorb/
Disallow /en/cart/
Disallow /index.php?cl=basket
Disallow /mein-konto/
Disallow /en/my-account/
Disallow /index.php?cl=account
Disallow /mein-merkzettel/
Disallow /en/my-wishlist/
Disallow /index.php?cl=account_noticelist
Disallow /mein-wunschzettel/
Disallow /en/my-gift-registry/
Disallow /index.php?cl=account_wishlist
Disallow /konto-eroeffnen/
Disallow /en/open-account/
Disallow /index.php?cl=register
Disallow /passwort-vergessen/
Disallow /en/forgot-password/
Disallow /index.php?cl=forgotpwd
Disallow /index.php?cl=moredetails
Disallow /index.php?cl=review
Disallow /index.php?cl=search
Disallow /EXCEPTION_LOG.txt
Disallow /*?cl=newsletter
Disallow /*%26cl%3Dnewsletter
Disallow /*?cl=basket
Disallow /*%26cl%3Dbasket
Disallow /*?cl=account
Disallow /*%26cl%3Daccount
Disallow /*?cl=account_noticelist
Disallow /*%26cl%3Daccount_noticelist
Disallow /*?cl=account_wishlist
Disallow /*%26cl%3Daccount_wishlist
Disallow /*?cl=register
Disallow /*%26cl%3Dregister
Disallow /*?cl=forgotpwd
Disallow /*%26cl%3Dforgotpwd
Disallow /*?cl=moredetails
Disallow /*%26cl%3Dmoredetails
Disallow /*?cl=review
Disallow /*%26cl%3Dreview
Disallow /*?cl=search
Disallow /*%26cl%3Dsearch
Disallow /*%26fnc%3Dtobasket
Disallow /*%26fnc%3Dtocomparelist
Disallow /*%26addcompare%3D
Disallow /*/sid/
Disallow /*?sid=
Disallow /*%26sid%3D
Disallow /*?cur=
Disallow /*%26cur

waybackmachine

Rule Path
Disallow /

iaarchiver

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

Other Records

Field Value
sitemap http://www.zesox.de/news/sitemap_index.xml
sitemap http://www.zesox.de/sitemaps/sitemap-index.xml

Comments

  • robots.txt - zesox
  • Monday, 10.09.2012
  • access allow
  • ---------------------------------------------------------
  • N E W S
  • ---------------------------------------------------------
  • disallow all files in these directories
  • disallow all files ending with these extensions
  • allow google image bot to search all images
  • allow adsense bot on entire site
  • ---------------------------------------------------------
  • S H O P
  • ---------------------------------------------------------
  • wildcards at the end, because of some crawlers see it as errors