hacc.ctbc.edu.tw
robots.txt

Robots Exclusion Standard data for hacc.ctbc.edu.tw

Archived Snapshots

Resource Scan

Scan Details

Site Domain	hacc.ctbc.edu.tw
Base Domain	ctbc.edu.tw
Scan Status	Ok
Last Scan	2026-02-22T08:21:39+00:00
Next Scan	2026-03-24T08:21:39+00:00

Last Scan

Scanned	2026-02-22T08:21:39+00:00
URL	https://hacc.ctbc.edu.tw/robots.txt
Domain IPs	43.254.17.40
Response IP	43.254.17.40
Found	Yes
Hash	d1249da1d498fca3ed109d830a0722312b0b901cfcecf8b35952fa3e7a112a0f
SimHash	5e245848e657

Groups

*

Rule	Path
Disallow
Allow	/wp-json/wp/v2/
Disallow	/?s=
Disallow	/*?replytocom=
Disallow	/trackback/
Disallow	/feed/
Disallow	/*/feed/
Disallow	/xmlrpc.php
Disallow	/wp-content/uploads/wp-import-export-lite/

Rule

Path

Disallow

Allow

/wp-json/wp/v2/

Disallow

/?s=

Disallow

/*?replytocom=

Disallow

/trackback/

Disallow

/feed/

Disallow

/*/feed/

Disallow

/xmlrpc.php

Disallow

/wp-content/uploads/wp-import-export-lite/

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

/

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

/

google-extended

Rule	Path
Allow	/

Rule

Path

Allow

/

Back to top

Other Records

Field	Value
sitemap	https://hacc.ctbc.edu.tw/sitemap_index.xml

Field

Value

sitemap

https://hacc.ctbc.edu.tw/sitemap_index.xml

Back to top

Comments

================
Main Allow Rules
================
ä¸éè¦è¢«ç´¢å¼ç WordPress ç³»çµ±é
WP Import Export
================
AI Crawlers Control
================

Back to top

hacc.ctbc.edu.twrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

gptbot

ccbot

anthropic-ai

chatgpt-user

google-extended

Other Records

Comments

hacc.ctbc.edu.tw
robots.txt