archi.net.tw
robots.txt

Robots Exclusion Standard data for archi.net.tw

Resource Scan

Scan Details

Site Domain archi.net.tw
Base Domain archi.net.tw
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-09-23T07:19:51+00:00
Next Scan 2025-12-22T07:19:51+00:00

Last Successful Scan

Scanned2024-03-02T23:55:43+00:00
URL https://archi.net.tw/robots.txt
Domain IPs 104.21.96.144, 172.67.182.94, 2606:4700:3031::ac43:b65e, 2606:4700:3032::6815:6090
Response IP 104.21.96.144
Found Yes
Hash a942201f5ffb1871c1d492388b4fab35a348859e3aaa4c284052dcf36cdbce84
SimHash ca3c68540f91

Groups

*

Rule Path
Disallow /vipweb_domain/cusid/*
Disallow /decomap/*
Disallow /pages/*
Disallow /sitemapxml/*
Disallow /tw/news/special-visit.asp$
Disallow /*/*.pdf$
Disallow /tw/inquiry/*
Disallow /tw/admincust/*
Disallow /tw/member/*
Disallow /tw/company/*/faq-list-1.html
Disallow /tw/company/*/faqadd.html

Other Records

Field Value
sitemap https://www.archi.net.tw/Sitemap.xml

Comments

  • 20230226
  • 20221004
  • 202212170721 Disallow: /.well-known/
  • 20221010 Disallow: /pages/keyword2/
  • 202211270823
  • 202212170721 Disallow: /tw/news/special-comp-*
  • 202212170721 Disallow: /tw/news/plot*
  • 202212091640
  • 202212170721 Disallow: /pages/faq/*
  • 202402262034O 20221004C Disallow: /*/*.pdf$