v-h.media
robots.txt

Robots Exclusion Standard data for v-h.media

Resource Scan

Scan Details

Site Domain v-h.media
Base Domain v-h.media
Scan Status Ok
Last Scan2026-02-08T22:04:52+00:00
Next Scan 2026-02-22T22:04:52+00:00

Last Scan

Scanned2026-02-08T22:04:52+00:00
URL https://v-h.media/robots.txt
Domain IPs 104.21.76.182, 172.67.198.105, 2606:4700:3034::6815:4cb6, 2606:4700:3036::ac43:c669
Response IP 104.21.76.182
Found Yes
Hash a75b90ae8f66ca4dd54872e199a592bd15f66ed8f486d1dfb6b3c7a8d5fc245f
SimHash ca1a495f0bd0

Groups

*

Rule Path
Disallow /filestore

Other Records

Field Value
crawl-delay 10

Comments

  • Sample robots.txt file - ensures that a Google Appliance can still access the spider page (if configured)
  • and assumes an installation in the site root. For sites in a subfolder you must move the robots.txt file
  • to the site root and alter the paths accordingly.