pulsepoint.org
robots.txt

Robots Exclusion Standard data for pulsepoint.org

Resource Scan

Scan Details

Site Domain pulsepoint.org
Base Domain pulsepoint.org
Scan Status Ok
Last Scan2025-07-02T19:49:17+00:00
Next Scan 2025-07-09T19:49:17+00:00

Last Scan

Scanned2025-07-02T19:49:17+00:00
URL https://pulsepoint.org/robots.txt
Redirect https://www.pulsepoint.org/robots.txt
Redirect Domain www.pulsepoint.org
Redirect Base pulsepoint.org
Domain IPs 198.185.159.144
Redirect IPs 94.247.142.1
Response IP 94.247.142.1
Found Yes
Hash 4bf1f776790c48aed06ae878c046083bffd8843e5da062e4876caeeee738975e
SimHash c17099563f93

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://www.pulsepoint.org/sitemaps-1-sitemap.xml
sitemap https://www.near-registry.org/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://www.pulsepoint.org/
  • live - don't allow web crawlers to index cpresources/ or vendor/