/.well-known/

Log In Sign Up

media.economist.com
robots.txt

Robots Exclusion Standard data for media.economist.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	media.economist.com
Base Domain	economist.com
Scan Status	Ok
Last Scan	2024-05-08T09:19:21+00:00
Next Scan	2024-05-22T09:19:21+00:00

Last Scan

Scanned	2024-05-08T09:19:21+00:00
URL	https://media.economist.com/robots.txt
Domain IPs	54.192.18.10, 54.192.18.25, 54.192.18.7, 54.192.18.75
Response IP	13.33.88.94
Found	Yes
Hash	ee32285f64438ea2b739a7e8376f8142e2e5e0f70ac967e92c6a6ccdc8c6f35b
SimHash	381cb91a4774

Groups

*

Rule

Path

Allow

/sites/default/files

Disallow

/

Back to top

Comments

robots.txt
This file is to limit crawling to the files directory for varnish and cdn
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/wc/robots.html
For syntax checking, see:
http://www.sxw.org.uk/computing/robots/check.html
Directories

Back to top