voelkl.com
robots.txt

Robots Exclusion Standard data for voelkl.com

Resource Scan

Scan Details

Site Domain voelkl.com
Base Domain voelkl.com
Scan Status Ok
Last Scan2026-01-03T23:59:00+00:00
Next Scan 2026-02-02T23:59:00+00:00

Last Scan

Scanned2026-01-03T23:59:00+00:00
URL https://voelkl.com/robots.txt
Domain IPs 216.24.57.251, 216.24.57.7
Response IP 216.24.57.7
Found Yes
Hash d1ee0edec13079cc1f0e6c69eebac58c98a6b6e012d7ae0db06af15af7b52a78
SimHash 666e9d7004f1

Groups

*

Rule Path
Disallow *.php$
Disallow *.php?*
Disallow /*/video
Disallow /video
Disallow /*/wp-content
Disallow /wp-content
Disallow /*/api
Disallow /api
Disallow /en-ru
Disallow /ru-ru
Disallow /en-sa
Disallow /en-ue
Disallow /en-kg
Disallow /en-mx

Other Records

Field Value
crawl-delay 20

dotbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

springbot

Rule Path
Disallow /

mozilla/5.0 (compatible; mj12bot/v1.4.8; http://mj12bot.com/)

Rule Path
Disallow /

Other Records

Field Value
sitemap https://volkl.com/sitemap.xml

Comments

  • Volkl - robots.txt
  • Rules below should apply to all user-agents
  • Don't allow not existing files
  • Don't allow unused directories
  • Don't allow unused langs
  • Crawl Delay - <num> sec delay
  • Sitemaps
  • Block Bots