static-cdn.ubuntu-de.org
robots.txt

Robots Exclusion Standard data for static-cdn.ubuntu-de.org

Resource Scan

Scan Details

Site Domain static-cdn.ubuntu-de.org
Base Domain ubuntu-de.org
Scan Status Ok
Last Scan2024-10-27T19:50:38+00:00
Next Scan 2024-11-26T19:50:38+00:00

Last Scan

Scanned2024-10-27T19:50:38+00:00
URL https://static-cdn.ubuntu-de.org/robots.txt
Domain IPs 2001:4dd0:f100:0:dead:beef:cafe:1, 87.79.26.37
Response IP 87.79.26.37
Found Yes
Hash 51d517a274bf4d00f29cd448a37c5688edcc7259091789064a15adee899dcdc5
SimHash 6c04de5086db

Groups

*

Rule Path
Disallow /search
Disallow /_image?
Disallow /Benutzer/
Disallow /users/
Disallow /*?flavour=mobile

Other Records

Field Value
crawl-delay 10

becomebot

Rule Path
Disallow /

yahoo! slurp

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /