/.well-known/

Log In Sign Up

media.empik.com
robots.txt

Robots Exclusion Standard data for media.empik.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	media.empik.com
Base Domain	empik.com
Scan Status	Ok
Last Scan	2024-10-11T15:56:58+00:00
Next Scan	2024-11-10T15:56:58+00:00

Last Scan

Scanned	2024-10-11T15:56:58+00:00
URL	https://media.empik.com/robots.txt
Domain IPs	104.17.218.109, 104.17.219.109, 2606:4700::6811:da6d, 2606:4700::6811:db6d
Response IP	104.17.219.109
Found	Yes
Hash	ab5d6e44552f49bfde0187c679efd603b271d0ea2aa9b3756b53cbd22b2e8891
SimHash	b83f431103d2

Groups

*

Rule

Path

Disallow

/

Back to top

Comments

This robots.txt file requests that search engines and other
automated web-agents don't try to index the files in this
directory (/www/). This file is required in the event that
you use OpenX without virtual domains (i.e. you use a single
URL to run both the admin interface and the delivery engine),
and have the web root set to this directory.

Back to top