europages.lt
robots.txt

Robots Exclusion Standard data for europages.lt

Resource Scan

Scan Details

Site Domain europages.lt
Base Domain europages.lt
Scan Status Ok
Last Scan2024-06-24T00:49:38+00:00
Next Scan 2024-07-01T00:49:38+00:00

Last Scan

Scanned2024-06-24T00:49:38+00:00
URL https://europages.lt/robots.txt
Redirect https://www.europages.lt:443/robots.txt
Redirect Domain www.europages.lt
Redirect Base europages.lt
Domain IPs 18.184.188.62, 18.194.143.203, 18.194.168.27
Redirect IPs 18.173.121.124, 18.173.121.52, 18.173.121.64, 18.173.121.71
Response IP 18.165.171.93
Found Yes
Hash 1f31a94c3270f61ec459c6fe14293150eb955376bff3465cd7609820190fc8a4
SimHash a3760e70e2b2

Groups

*

Rule Path
Disallow */captcha/*
Disallow */myEuropages/*
Disallow */dataviz/*
Disallow */pdf/*
Disallow */1-10/*
Disallow */11-50/*
Disallow */51-100/*
Disallow */101-200/*
Disallow */201-500/*
Disallow */%3E500/*
Disallow */favicon.ico
Disallow *Vcard.html
Disallow */taxonomy/*
Disallow */ss-*
Disallow */zemelapis.html
Disallow */cartographie*
Disallow */dokumentai.html
Disallow */undefined
Disallow */bch-*
Disallow */bcg-*
Disallow */bcc-*
Disallow */bcca-*
Disallow */bci-*
Disallow */psrw/*
Disallow */did-*
Disallow *businesscard/pages/snippets/*
Disallow */c/*.html
Disallow */h/*.html
Disallow */b/*.html
Disallow /account/*
Disallow /contact/*
Disallow /help*
Disallow /shortlist*
Disallow *?optimize=*
Disallow *?qs=*
Disallow */cat-1-*
Disallow */unverified/*
Disallow */verified/*
Disallow */map/*
Disallow */bs/*/map
Disallow */pg-*

mediapartners-google

Rule Path
Disallow

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.europages.lt/businessSectorsSitemap.xml
sitemap https://www.europages.lt/serpCompaniesSitemap.xml