cancergeek.squarespace.com
robots.txt
Robots Exclusion Standard data for cancergeek.squarespace.com
Resource Scan
Scan Details
| Site Domain | cancergeek.squarespace.com |
| Base Domain | squarespace.com |
| Scan Status | Ok |
| Last Scan | 2025-10-24T18:22:18+00:00 |
| Next Scan | 2025-11-23T18:22:18+00:00 |
Last Scan
| Scanned | 2025-10-24T18:22:18+00:00 |
| URL | https://cancergeek.squarespace.com/robots.txt |
| Domain IPs | 198.185.159.176, 198.185.159.177, 198.49.23.176, 198.49.23.177 |
| Response IP | 198.49.23.177 |
| Found | Yes |
| Hash | fcacfd5aabe289e976bc0970a97a3bab4e421012ec7296e64c056017842b2914 |
| SimHash | 3c127760adda |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /display/admin/ |
| Disallow | /display/Search |
| Disallow | /display/Login |
| Disallow | /display/RecoverPassword |
| Disallow | /login |
| Disallow | /contributor |
| Disallow | /journal/category |
| Disallow | /journal/week |
| Disallow | /journal/month |
| Disallow | /journal/recommend |
| Disallow | /journal/author |
| Disallow | /cancergeek/category |
| Disallow | /cancergeek/week |
| Disallow | /cancergeek/month |
| Disallow | /cancergeek/recommend |
| Disallow | /cancergeek/author |
Comments