it-square.hk
robots.txt

Robots Exclusion Standard data for it-square.hk

Resource Scan

Scan Details

Site Domain it-square.hk
Base Domain it-square.hk
Scan Status Ok
Last Scan2025-11-21T07:54:35+00:00
Next Scan 2025-12-21T07:54:35+00:00

Last Scan

Scanned2025-11-21T07:54:35+00:00
URL https://it-square.hk/robots.txt
Domain IPs 13.33.45.2, 13.33.45.51, 13.33.45.54, 13.33.45.59
Response IP 13.33.45.51
Found Yes
Hash 1b9d4aaa0723cdf68a71f77f4a136ec4f240edcf7a88a6f6687a2beb9c0a74b8
SimHash 401b98f3e7b1

Groups

*

Rule Path
Allow /

googlebot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

bingbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 2

slurp

Rule Path
Allow /
Disallow /admin/
Disallow /api/
Disallow /private/
Disallow /_next/
Disallow /test-data/
Allow /api/sitemap.xml
Allow /favicon.ico
Allow /*.css
Allow /*.js

Other Records

Field Value
crawl-delay 2

gptbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

chatgpt-user

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

ccbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

google-extended

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

anthropic-ai

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

claude-web

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://it-square.hk/sitemap.xml

Comments

  • Robots.txt for IT Square - Hong Kong Technology News
  • https://it-square.hk
  • Specific rules for major search engines
  • Block access to admin and private areas
  • Allow access to important files
  • Sitemap location
  • Additional directives for AI crawlers and LLMs
  • Block aggressive crawlers