cancergeek.squarespace.com
robots.txt

Robots Exclusion Standard data for cancergeek.squarespace.com

Resource Scan

Scan Details

Site Domain cancergeek.squarespace.com
Base Domain squarespace.com
Scan Status Ok
Last Scan2025-10-24T18:22:18+00:00
Next Scan 2025-11-23T18:22:18+00:00

Last Scan

Scanned2025-10-24T18:22:18+00:00
URL https://cancergeek.squarespace.com/robots.txt
Domain IPs 198.185.159.176, 198.185.159.177, 198.49.23.176, 198.49.23.177
Response IP 198.49.23.177
Found Yes
Hash fcacfd5aabe289e976bc0970a97a3bab4e421012ec7296e64c056017842b2914
SimHash 3c127760adda

Groups

*

Rule Path
Disallow /display/admin/
Disallow /display/Search
Disallow /display/Login
Disallow /display/RecoverPassword
Disallow /login
Disallow /contributor
Disallow /journal/category
Disallow /journal/week
Disallow /journal/month
Disallow /journal/recommend
Disallow /journal/author
Disallow /cancergeek/category
Disallow /cancergeek/week
Disallow /cancergeek/month
Disallow /cancergeek/recommend
Disallow /cancergeek/author

Comments

  • Squarespace Standard Robot Exclusion
  • Access is disallowed to functional / filtering URLs