officeday.lt
robots.txt

Robots Exclusion Standard data for officeday.lt

Resource Scan

Scan Details

Site Domain officeday.lt
Base Domain officeday.lt
Scan Status Ok
Last Scan2024-06-24T02:03:08+00:00
Next Scan 2024-07-24T02:03:08+00:00

Last Scan

Scanned2024-06-24T02:03:08+00:00
URL https://officeday.lt/robots.txt
Redirect https://www.officeday.lt/robots.txt
Redirect Domain www.officeday.lt
Redirect Base officeday.lt
Domain IPs 104.26.8.207, 104.26.9.207, 172.67.69.75, 2606:4700:20::681a:8cf, 2606:4700:20::681a:9cf, 2606:4700:20::ac43:454b
Redirect IPs 104.26.8.207, 104.26.9.207, 172.67.69.75, 2606:4700:20::681a:8cf, 2606:4700:20::681a:9cf, 2606:4700:20::ac43:454b
Response IP 172.67.69.75
Found Yes
Hash 8d13dcdcc7e771c28e6c01a6b45ea90855ddd5c39b58d4f32f4abb481a45aaec
SimHash 2a9289f04bfc

Groups

*

Rule Path
Disallow /admin/
Disallow /core/
Disallow /tmp/
Disallow /views/
Disallow /setup/
Disallow /log/
Disallow /newsletter/
Disallow /en/newsletter/
Disallow /index.php?cl=newsletter
Disallow /agb/
Disallow /en/terms/
Disallow /warenkorb/
Disallow /en/cart/
Disallow /index.php?cl=basket
Disallow /mein-konto/
Disallow /en/my-account/
Disallow /index.php?cl=account
Disallow /mein-merkzettel/
Disallow /en/my-wishlist/
Disallow /index.php?cl=account_noticelist
Disallow /mein-wunschzettel/
Disallow /en/my-gift-registry/
Disallow /index.php?cl=account_wishlist
Disallow /konto-eroeffnen/
Disallow /en/open-account/
Disallow /index.php?cl=register
Disallow /passwort-vergessen/
Disallow /en/forgot-password/
Disallow /index.php?cl=forgotpwd
Disallow /index.php?cl=moredetails
Disallow /index.php?cl=review
Disallow /index.php?cl=search
Disallow /EXCEPTION_LOG.txt
Disallow /*?new_sorting=
Disallow /*%26new_sorting%3D
Disallow /*?sorting=
Disallow /*%26sorting%3D
Disallow /*?cl=newsletter
Disallow /*%26cl%3Dnewsletter
Disallow /*?cl=basket
Disallow /*%26cl%3Dbasket
Disallow /*?cl=account
Disallow /*%26cl%3Daccount
Disallow /*?cl=account_noticelist
Disallow /*%26cl%3Daccount_noticelist
Disallow /*?cl=account_wishlist
Disallow /*%26cl%3Daccount_wishlist
Disallow /*?cl=register
Disallow /*%26cl%3Dregister
Disallow /*?cl=forgotpwd
Disallow /*%26cl%3Dforgotpwd
Disallow /*?cl=moredetails
Disallow /*%26cl%3Dmoredetails
Disallow /*?cl=review
Disallow /*%26cl%3Dreview
Disallow /*?cl=search
Disallow /*%26cl%3Dsearch
Disallow /*%26fnc%3Dtobasket
Disallow /*%26fnc%3Dtocomparelist
Disallow /*%26addcompare%3D
Disallow /*/sid/
Disallow /*?sid=
Disallow /*%26sid%3D
Disallow /*?cur=
Disallow /*%26cur

Other Records

Field Value
crawl-delay 20

semrushbot

Rule Path
Disallow /

applebot
bingbot
msnbot
msnbot-media
adidxbot
bingpreview

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

googlebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Comments

  • wildcards at the end, because of some crawlers see it as errors