hsbc.co.id
robots.txt

Robots Exclusion Standard data for hsbc.co.id

Resource Scan

Scan Details

Site Domain hsbc.co.id
Base Domain hsbc.co.id
Scan Status Ok
Last Scan2026-03-26T17:15:41+00:00
Next Scan 2026-04-25T17:15:41+00:00

Last Scan

Scanned2026-03-26T17:15:41+00:00
URL https://www.hsbc.co.id/robots.txt
Domain IPs 3.165.102.103, 3.165.102.47, 3.165.102.59, 3.165.102.83
Response IP 3.165.102.83
Found Yes
Hash 5108a245b51637000ed27449a93f53ec032a6540709d483285170f1834a1b2f8
SimHash 1bc418552493

Groups

*

Rule Path
Disallow /1/3/*
Disallow /clp/*
Disallow /messages/
Disallow /*.pdf$
Disallow /*ep_testing
Disallow /*ep_login
Disallow /ep_docs/*.pdf
Disallow /ep_internal/*.pdf
Disallow /*hsbc-token-testing*
Disallow /cms-dashboard*
Disallow /cms-admin*
Disallow /content/dam/hsbc/ar/promos*
Disallow /content/dam/hsbc/gr/promos*
Disallow /content/dam/hsbc/ca/promos*
Disallow /content/dam/hsbc/om/promos*
Disallow /content/dam/hsbc/am/promos*
Disallow /branch-login/*
Disallow /staff-emergency-comms/*.html
Disallow /remote-access-instructions/*.html
Disallow /remote-access/logon
Disallow /remote-access/*.html
Disallow /cgi-bin

Other Records

Field Value
sitemap https://www.hsbc.co.id/sitemaps.xml

Comments

  • Introduce Sitemaps