media.empik.com
robots.txt

Robots Exclusion Standard data for media.empik.com

Resource Scan

Scan Details

Site Domain media.empik.com
Base Domain empik.com
Scan Status Ok
Last Scan2024-05-14T14:58:05+00:00
Next Scan 2024-06-13T14:58:05+00:00

Last Scan

Scanned2024-05-14T14:58:05+00:00
URL https://media.empik.com/robots.txt
Domain IPs 104.17.218.109, 104.17.219.109, 2606:4700::6811:da6d, 2606:4700::6811:db6d
Response IP 104.17.218.109
Found Yes
Hash ab5d6e44552f49bfde0187c679efd603b271d0ea2aa9b3756b53cbd22b2e8891
SimHash b83f431103d2

Groups

*

Rule Path
Disallow /

Comments

  • This robots.txt file requests that search engines and other
  • automated web-agents don't try to index the files in this
  • directory (/www/). This file is required in the event that
  • you use OpenX without virtual domains (i.e. you use a single
  • URL to run both the admin interface and the delivery engine),
  • and have the web root set to this directory.