hkapps.ai
robots.txt

Robots Exclusion Standard data for hkapps.ai

Resource Scan

Scan Details

Site Domain hkapps.ai
Base Domain hkapps.ai
Scan Status Ok
Last Scan2025-10-13T00:55:38+00:00
Next Scan 2025-10-20T00:55:38+00:00

Last Scan

Scanned2025-10-13T00:55:38+00:00
URL https://hkapps.ai/robots.txt
Domain IPs 104.21.86.11, 172.67.213.188, 2606:4700:3031::6815:560b, 2606:4700:3031::ac43:d5bc
Response IP 172.67.213.188
Found Yes
Hash 8a36fe51e0edb33c8dd02296202a1b5ea465c76cf71803c653b76c009f82c305
SimHash ef0b3a416c20

Groups

*

Rule Path
Allow /
Allow /about-hk-apps
Allow /hk-apps-services
Allow /hk-apps-products
Allow /hk-apps-blog
Allow /contact-hk-apps
Allow /careers-hk-apps
Allow /products/
Allow /ai-media-buying
Allow /marketing-automation
Allow /website-developement
Allow /mobile-app-development
Allow /programmatic-advertising
Allow /blog/
Disallow /admin/
Disallow /private/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /cgi-bin/
Disallow /tmp/
Disallow /temp/
Disallow /search
Disallow /?s=
Disallow /*?search=
Disallow /*?filter=
Disallow /*?sort=
Disallow /thank-you
Disallow /confirmation
Disallow /success
Disallow /dev/
Disallow /staging/
Disallow /test/
Disallow /_dev/
Allow /assets/
Allow /css/
Allow /js/
Allow /images/
Allow *.css
Allow *.js
Allow *.png
Allow *.jpg
Allow *.jpeg
Allow *.gif
Allow *.webp
Allow *.svg
Allow *.ico

googlebot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

bingbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 2

yandexbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 2

*

Rule Path
Disallow /api/
Disallow /.env
Disallow /config/
Disallow /database/

Other Records

Field Value
sitemap https://hkapps.ai/sitemap.xml

Comments

  • Robots.txt for HK Apps - Optimized for hkapps.ai
  • Updated: June 2, 2025
  • Allow all search engines to crawl
  • Specifically allow important pages for HK Apps ranking
  • Allow product pages
  • Allow blog for HK Apps content
  • Disallow admin and private areas
  • Disallow search and filter pages that could create duplicate content
  • Disallow thank you and confirmation pages
  • Disallow development and staging areas
  • Allow CSS, JS, and image files for proper rendering
  • Google-specific directives
  • Bing-specific directives
  • Yandex-specific directives
  • Prevent scraping of sensitive data
  • Sitemap location for Google Search Console
  • Host directive (preferred domain)

Warnings

  • `host` is not a known field.