media.guim.co.uk
robots.txt

Robots Exclusion Standard data for media.guim.co.uk

Resource Scan

Scan Details

Site Domain media.guim.co.uk
Base Domain guim.co.uk
Scan Status Ok
Last Scan2024-05-22T05:15:10+00:00
Next Scan 2024-06-21T05:15:10+00:00

Last Scan

Scanned2024-05-22T05:15:10+00:00
URL https://media.guim.co.uk/robots.txt
Domain IPs 151.101.1.111, 151.101.129.111, 151.101.193.111, 151.101.65.111, 2a04:4e42:200::367, 2a04:4e42:400::367, 2a04:4e42:600::367, 2a04:4e42::367
Response IP 199.232.45.111
Found Yes
Hash e8d6c28a704329b75541640fd629c10383bc5c77eefcd2256bad8c16d37e18da
SimHash 8804d800cbf3

Groups

twitterbot

Rule Path
Disallow

*

Rule Path
Disallow /