marc-cain.com
robots.txt

Robots Exclusion Standard data for marc-cain.com

Resource Scan

Scan Details

Site Domain marc-cain.com
Base Domain marc-cain.com
Scan Status Ok
Last Scan2024-09-19T21:09:42+00:00
Next Scan 2024-10-19T21:09:42+00:00

Last Scan

Scanned2024-09-19T21:09:42+00:00
URL https://marc-cain.com/robots.txt
Redirect https://www.marc-cain.com/robots.txt
Redirect Domain www.marc-cain.com
Redirect Base marc-cain.com
Domain IPs 104.24.40.8, 159.69.77.193, 159.69.96.138
Redirect IPs 104.27.194.88, 104.27.195.88
Response IP 104.27.194.88
Found Yes
Hash e6c96785e3eb5898e71935adb38114c21f4cbc8c133c7ecd2b5aee6f99b181ad
SimHash ac928d506bfd

Groups

*

Rule Path
Disallow /admin/
Disallow /Core/
Disallow /tmp/
Disallow /views/
Disallow /Setup/
Disallow /log/
Disallow /newsletter/
Disallow /en/newsletter/
Disallow /index.php?cl=newsletter
Disallow /AGB/
Disallow /en/Terms-and-Conditions/
Disallow /warenkorb/
Disallow /en/cart/
Disallow /index.php?cl=basket
Disallow /mein-konto/
Disallow /en/my-account/
Disallow /index.php?cl=account
Disallow /mein-merkzettel/
Disallow /en/my-wishlist/
Disallow /index.php?cl=account_noticelist
Disallow /mein-wunschzettel/
Disallow /en/my-gift-registry/
Disallow /index.php?cl=account_wishlist
Disallow /konto-eroeffnen/
Disallow /en/open-account/
Disallow /index.php?cl=register
Disallow /passwort-vergessen/
Disallow /en/forgot-password/
Disallow /index.php?cl=forgotpwd
Disallow /index.php?cl=moredetails
Disallow /index.php?cl=review
Disallow /index.php?cl=search
Disallow /EXCEPTION_LOG.txt
Disallow /*?cl=newsletter
Disallow /*%26cl%3Dnewsletter
Disallow /*?cl=basket
Disallow /*%26cl%3Dbasket
Disallow /*?cl=account
Disallow /*%26cl%3Daccount
Disallow /*?cl=account_noticelist
Disallow /*%26cl%3Daccount_noticelist
Disallow /*?cl=account_wishlist
Disallow /*%26cl%3Daccount_wishlist
Disallow /*?cl=register
Disallow /*%26cl%3Dregister
Disallow /*?cl=forgotpwd
Disallow /*%26cl%3Dforgotpwd
Disallow /*?cl=moredetails
Disallow /*%26cl%3Dmoredetails
Disallow /*?cl=review
Disallow /*%26cl%3Dreview
Disallow /*?cl=search
Disallow /*%26cl%3Dsearch
Disallow /*%26fnc%3Dtobasket
Disallow /*%26fnc%3Dtocomparelist
Disallow /*%26addcompare%3D
Disallow /*/sid/
Disallow /*?sid=
Disallow /*%26sid%3D
Disallow /*?cur=
Disallow /*%26cur

pinterestbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 0.2

Comments

  • wildcards at the end, because of some crawlers see it as errors