aeg.de
robots.txt

Robots Exclusion Standard data for aeg.de

Resource Scan

Scan Details

Site Domain aeg.de
Base Domain aeg.de
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-09-04T09:42:18+00:00
Next Scan 2024-12-03T09:42:18+00:00

Last Successful Scan

Scanned2024-04-15T05:18:40+00:00
URL https://aeg.de/robots.txt
Redirect https://www.aeg.de/robots.txt
Redirect Domain www.aeg.de
Redirect Base aeg.de
Domain IPs 104.76.134.128, 2600:1413:b000:382::395d, 2600:1413:b000:384::395d
Redirect IPs 23.199.150.169
Response IP 104.76.129.42
Found Yes
Hash 0992247ea8a8e537b40a00ea250bbf50e64956ce78725116584bedbd0bb49040
SimHash a11d405440ba

Groups

gsa-crawler

Rule Path
Disallow

mj12bot

Rule Path
Disallow /

*

Rule Path
Disallow /recycle-bin/
Disallow /util/
Disallow /secui/
Disallow /additions/
Disallow /templates/
Disallow /my-pages-container.*
Disallow /global/images/icons/
Disallow /localfiles/
Disallow /documents/
Disallow /pagefiles/
Disallow /global-pages/global-menu/store-locator/
Disallow /webresource.*
Disallow /node*.*
Disallow /find/
Disallow /compare/
Disallow /link/
Disallow /overlays/online-retailer/
Disallow /downloadpdf/

Other Records

Field Value
sitemap https://www.aeg.de/globalassets/sitemaps/www.aeg.de/sitemapindex.xml