hacc.ctbc.edu.tw
robots.txt

Robots Exclusion Standard data for hacc.ctbc.edu.tw

Resource Scan

Scan Details

Site Domain hacc.ctbc.edu.tw
Base Domain ctbc.edu.tw
Scan Status Ok
Last Scan2026-02-22T08:21:39+00:00
Next Scan 2026-03-24T08:21:39+00:00

Last Scan

Scanned2026-02-22T08:21:39+00:00
URL https://hacc.ctbc.edu.tw/robots.txt
Domain IPs 43.254.17.40
Response IP 43.254.17.40
Found Yes
Hash d1249da1d498fca3ed109d830a0722312b0b901cfcecf8b35952fa3e7a112a0f
SimHash 5e245848e657

Groups

*

Rule Path
Disallow
Allow /wp-json/wp/v2/
Disallow /?s=
Disallow /*?replytocom=
Disallow /trackback/
Disallow /feed/
Disallow /*/feed/
Disallow /xmlrpc.php
Disallow /wp-content/uploads/wp-import-export-lite/

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Allow /

Other Records

Field Value
sitemap https://hacc.ctbc.edu.tw/sitemap_index.xml

Comments

  • ================
  • Main Allow Rules
  • ================
  • 不需要被索引的 WordPress 系統頁
  • WP Import Export
  • ================
  • AI Crawlers Control
  • ================