europages.info
robots.txt

Robots Exclusion Standard data for europages.info

Resource Scan

Scan Details

Site Domain europages.info
Base Domain europages.info
Scan Status Ok
Last Scan2024-06-24T02:09:10+00:00
Next Scan 2024-07-24T02:09:10+00:00

Last Scan

Scanned2024-06-24T02:09:10+00:00
URL https://europages.info/robots.txt
Redirect https://www.europages.info:443/robots.txt
Redirect Domain www.europages.info
Redirect Base europages.info
Domain IPs 18.184.188.62, 18.194.143.203, 18.194.168.27
Redirect IPs 18.173.121.124, 18.173.121.52, 18.173.121.64, 18.173.121.71
Response IP 18.165.171.22
Found Yes
Hash 244cfc3f7d079f88c385790269cea8848ed8e591cc7e0cec6f7f97775913d2c5
SimHash a3766c70e2b2

Groups

*

Rule Path
Disallow */captcha/*
Disallow */myEuropages/*
Disallow */dataviz/*
Disallow */pdf/*
Disallow */1-10/*
Disallow */11-50/*
Disallow */51-100/*
Disallow */101-200/*
Disallow */201-500/*
Disallow */%3E500/*
Disallow */favicon.ico
Disallow *Vcard.html
Disallow */taxonomy/*
Disallow */ss-*
Disallow */karta.html
Disallow */cartographie*
Disallow */undefined
Disallow */bch-*
Disallow */bcg-*
Disallow */bcc-*
Disallow */bcca-*
Disallow */bci-*
Disallow */psrw/*
Disallow */did-*
Disallow *businesscard/pages/snippets/*
Disallow */c/*.html
Disallow */h/*.html
Disallow */b/*.html
Disallow /account/*
Disallow /contact/*
Disallow /help*
Disallow /shortlist*
Disallow *?optimize=*
Disallow *?qs=*
Disallow */cat-1-*
Disallow */unverified/*
Disallow */verified/*
Disallow */map/*
Disallow */bs/*/map
Disallow */pg-*

mediapartners-google

Rule Path
Disallow

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.europages.info/businessSectorsSitemap.xml
sitemap https://www.europages.info/serpCompaniesSitemap.xml

Warnings

  • 1 invalid line.