cmchk.org.hk
robots.txt

Robots Exclusion Standard data for cmchk.org.hk

Resource Scan

Scan Details

Site Domain cmchk.org.hk
Base Domain cmchk.org.hk
Scan Status Ok
Last Scan2025-11-22T21:21:36+00:00
Next Scan 2025-12-22T21:21:36+00:00

Last Scan

Scanned2025-11-22T21:21:36+00:00
URL https://www.cmchk.org.hk/robots.txt
Domain IPs 216.6.5.49, 23.251.120.93, 2405:2000:2600:100::36, 2602:ffe4:401:1b::27
Response IP 23.251.120.93
Found Yes
Hash 20688201d7c0753f65466718e4c18621d6628cbf56d057c4050c43e712abdbe3
SimHash c85d571e8483

Groups

adsbot-google

Rule Path
Disallow /

webgains-bot

Rule Path
Disallow /

facebookexternalhit

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

python-requests

Rule Path
Disallow /pcm/chs/pcmtradeshow_search_data_2018.php

*

Rule Path
Disallow /*.htm$
Disallow /pcm/chi/*.htm$
Disallow /pcm/chs/*.htm$
Disallow /pcm/eng/*.htm$
Disallow /cmp/chi/*.htm$
Disallow /cmp/chs/*.htm$
Disallow /cmp/eng/*.htm$
Allow /pdf/*.pdf$

Other Records

Field Value
sitemap https://www.cmchk.org.hk/sitemaps.xml