helpmebreath.com
robots.txt

Robots Exclusion Standard data for helpmebreath.com

Resource Scan

Scan Details

Site Domain helpmebreath.com
Base Domain helpmebreath.com
Scan Status Ok
Last Scan2025-11-04T21:18:10+00:00
Next Scan 2025-11-11T21:18:10+00:00

Last Scan

Scanned2025-11-04T21:18:10+00:00
URL https://helpmebreath.com/robots.txt
Domain IPs 13.215.239.219, 52.74.6.109
Response IP 52.74.6.109
Found Yes
Hash c52b84f58b3e21628b17a778481c842b7cd46674ff44b068f100f1c4cfca87c9
SimHash 2d0168c56775

Groups

*

Rule Path
Disallow /admin/
Disallow /login/
Disallow /cgi-bin/
Disallow /tmp/
Allow /

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://helpmebreath.com/sitemap.xml

Comments

  • robots.txt for https://helpmebreath.com/
  • -------------------------------------------------
  • 1. Default rules for *every* crawler
  • Block typical non-public areas (edit the paths to match your CMS)
  • If you use URL parameters for searches or filters you don’t want indexed,
  • uncomment the next line and adjust the pattern:
  • Disallow: /*?*sessionid=
  • 2. Allow everything else
  • 3. Point crawlers to your XML sitemap
  • 4. (Optional) Explicit directives for major bots
  • — Google
  • — Bing
  • -------------------------------------------------