magazine.marieclaire.com.tw
robots.txt

Robots Exclusion Standard data for magazine.marieclaire.com.tw

Resource Scan

Scan Details

Site Domain magazine.marieclaire.com.tw
Base Domain marieclaire.com.tw
Scan Status Ok
Last Scan2024-09-22T17:02:14+00:00
Next Scan 2024-10-22T17:02:14+00:00

Last Scan

Scanned2024-09-22T17:02:14+00:00
URL https://magazine.marieclaire.com.tw/robots.txt
Domain IPs 35.241.47.28
Response IP 35.241.47.28
Found Yes
Hash 8890bee59d11840092c633989d7b1157da6ccdd0c170b2b444ee06723d2a990c
SimHash a8948c104493

Groups

*

Rule Path
Disallow /preview/
Disallow /admin/
Disallow /ap/
Disallow /etc/
Disallow /tmp/
Disallow /ADdemo/
Disallow /channel/demo_view/
Disallow /mobile/
Disallow /talk/view/
Disallow /slide_content/
Disallow /slide/
Disallow /slice/
Disallow /share/fb/

Other Records

Field Value
sitemap https://www.marieclaire.com.tw/sitemap.xml
sitemap https://www.marieclaire.com.tw/google-news.xml