marieclaire.kz
robots.txt

Robots Exclusion Standard data for marieclaire.kz

Resource Scan

Scan Details

Site Domain marieclaire.kz
Base Domain marieclaire.kz
Scan Status Ok
Last Scan2024-11-03T15:56:46+00:00
Next Scan 2024-11-10T15:56:46+00:00

Last Scan

Scanned2024-11-03T15:56:46+00:00
URL https://marieclaire.kz/robots.txt
Response IP 195.226.222.198
Found Yes
Hash 2837fd17b5ce161b26bd76a208e52a19eee3ff1394cfffa53a5d6854184c6a7d
SimHash 7919504783e7

Groups

mail.ru

Rule Path
Allow */yanews/
Allow /rss-feeds/
Allow /rss-feeds/yanews.xml

vkshare
vkrobot/1.0
facebookexternalhit
meta-externalagent

Rule Path
Allow /s/
Allow /rss-feeds/
Allow /rss-feeds/novapress-facebook.xml

yandexnews

Rule Path
Allow /rss-feeds/yanews.xml
Allow /rss/yandex/
Allow */?from=yanews
Allow */yanews/
Allow /rss-feeds/yanews-webmaster.xml

yandex

Rule Path
Allow /rss-feeds/yanews.xml
Allow /rss-feeds/yanews-webmaster.xml
Disallow *?
Disallow /useragreement/
Disallow /unsubscribe/
Disallow */yanews/
Disallow /pages/
Disallow /gm-api/
Disallow /81006599/
Disallow /22729373807/
Disallow /s/

*

Rule Path
Disallow *?
Disallow /useragreement/
Disallow /unsubscribe/
Disallow */yanews/
Disallow /pages/
Disallow /gm-api/
Disallow /81006599/
Disallow /22729373807/
Disallow /s/

Other Records

Field Value
sitemap https://marieclaire.kz/sitemaps/index.xml.gz