europages.de
robots.txt

Robots Exclusion Standard data for europages.de

Resource Scan

Scan Details

Site Domain europages.de
Base Domain europages.de
Scan Status Ok
Last Scan2024-09-19T23:41:28+00:00
Next Scan 2024-09-26T23:41:28+00:00

Last Scan

Scanned2024-09-19T23:41:28+00:00
URL https://europages.de/robots.txt
Redirect https://www.europages.de:443/robots.txt
Redirect Domain www.europages.de
Redirect Base europages.de
Domain IPs 18.159.30.51, 3.67.217.179, 52.58.21.186
Redirect IPs 65.9.112.126, 65.9.112.86, 65.9.112.88, 65.9.112.92
Response IP 52.85.49.42
Found Yes
Hash 9c73935eb07d595f74af9a1b3abb22244d870071ab53702105704b2b0178ad04
SimHash a7762c70e292

Groups

*

Rule Path
Disallow */captcha/*
Disallow */myEuropages/*
Disallow */dataviz/*
Disallow */pdf/*
Disallow */1-10/*
Disallow */11-50/*
Disallow */51-100/*
Disallow */101-200/*
Disallow */201-500/*
Disallow */%3E500/*
Disallow */favicon.ico
Disallow *Vcard.html
Disallow */taxonomy/*
Disallow */ss-*
Disallow */karte.html
Disallow */cartographie*
Disallow */Dokumentation.html
Disallow */undefined
Disallow */bch-*
Disallow */bcg-*
Disallow */bcc-*
Disallow */bcca-*
Disallow */bci-*
Disallow */psrw/*
Disallow */did-*
Disallow *businesscard/pages/snippets/*
Disallow */c/*.html
Disallow */h/*.html
Disallow */b/*.html
Disallow /account/*
Disallow /contact/*
Disallow /help*
Disallow /shortlist*
Disallow *?optimize=*
Disallow *?qs=*
Disallow */cat-1-*
Disallow */unverified/*
Disallow */verified/*
Disallow */map/*
Disallow */bs/*/map
Disallow */pg-*

mediapartners-google

Rule Path
Disallow

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.europages.de/businessSectorsSitemap.xml
sitemap https://www.europages.de/serpCompaniesSitemap.xml