healthcomp.com
robots.txt

Robots Exclusion Standard data for healthcomp.com

Resource Scan

Scan Details

Site Domain healthcomp.com
Base Domain healthcomp.com
Scan Status Ok
Last Scan2025-10-24T05:21:12+00:00
Next Scan 2025-11-23T05:21:12+00:00

Last Scan

Scanned2025-10-24T05:21:12+00:00
URL https://healthcomp.com/robots.txt
Domain IPs 35.215.101.48
Response IP 35.215.101.48
Found Yes
Hash a9fe8083e28669dae646d452ab5c741f4c63fc62a695f7ae06753c8847258030
SimHash 295a5840eef9

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /wp-content/cache/
Disallow /wp-content/uploads/
Disallow /wp-json/
Disallow /cgi-bin/
Disallow /readme.html
Disallow /xmlrpc.php
Disallow /wp-login.php
Disallow /wp-login/
Disallow /wp-register.php
Disallow /wp-register/
Disallow /wp-activate.php
Disallow /wp-activate/
Disallow /author/
Disallow /comments/
Disallow /trackback/
Disallow /comment-page-
Disallow /category/
Disallow /tag/
Disallow */feed/
Disallow */comments/
Disallow */trackback/
Disallow */wp-
Disallow *?
Disallow /*?
Disallow /page/
Disallow /index.php
Disallow /search/
Disallow /archives/
Disallow /date/
Disallow /tag/
Disallow /author/
Disallow /calendar/
Disallow /attachment/
Disallow /gallery/
Disallow /wp-includes/
Allow /wp-content/uploads/

*

Rule Path
Disallow /*.pdf$
Disallow /*.doc$
Disallow /*.docx$
Disallow /*.xls$
Disallow /*.xlsx$

Other Records

Field Value
sitemap http://www.healthcomp.com/sitemap.xml

Comments

  • Block bots that do not support robots.txt extensions
  • Block certain bots from crawling
  • User-agent: BadBot1
  • User-agent: BadBot2
  • Disallow: /