hypercipol.com
robots.txt

Robots Exclusion Standard data for hypercipol.com

Resource Scan

Scan Details

Site Domain hypercipol.com
Base Domain hypercipol.com
Scan Status Ok
Last Scan2025-07-03T19:58:29+00:00
Next Scan 2025-08-02T19:58:29+00:00

Last Scan

Scanned2025-07-03T19:58:29+00:00
URL https://hypercipol.com/robots.txt
Domain IPs 104.21.12.26, 172.67.151.83, 2606:4700:3035::ac43:9753, 2606:4700:3036::6815:c1a
Response IP 104.21.12.26
Found Yes
Hash bd15d51a1578935cbb9b39f8b55f0d7044c4ad2236367300d7887476fafe68f2
SimHash 24165ad2c1a1

Groups

*

Rule Path
Allow /
Disallow /danke.html
Allow /index.html
Allow /unternehmen.html
Allow /routen.html
Allow /tipps.html
Allow /geschichte.html
Allow /kontakt.html
Allow /datenschutz.html
Allow /agb.html
Allow /cookie-richtlinie.html
Allow /styles.css
Allow /script.js
Allow /*.svg
Allow /favicon.svg

Other Records

Field Value
crawl-delay 1

*

Rule Path
Allow /*.css$
Allow /*.js$

*

Rule Path
Allow /*.svg$
Allow /*.png$
Allow /*.jpg$
Allow /*.jpeg$
Allow /*.gif$
Allow /*.webp$
Disallow /cgi-bin/
Disallow /tmp/
Disallow /private/
Disallow /admin/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /*.php$
Allow /.well-known/

Other Records

Field Value
sitemap https://hypercipol.com/sitemap.xml

Comments

  • Sitemap
  • Block access to certain files
  • Allow search engines to crawl main content
  • Allow legal pages
  • Allow static resources
  • Crawl delay (be respectful)
  • Cache policy suggestions
  • CSS and JS files
  • Images
  • Block common bot traps and unnecessary paths
  • Allow well-known paths
  • Host directive

Warnings

  • `host` is not a known field.