astridterrazas.com
robots.txt

Robots Exclusion Standard data for astridterrazas.com

Resource Scan

Scan Details

Site Domain astridterrazas.com
Base Domain astridterrazas.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-10-03T23:47:34+00:00
Next Scan 2025-10-10T23:47:34+00:00

Last Successful Scan

Scanned2025-09-02T12:13:17+00:00
URL https://astridterrazas.com/robots.txt
Domain IPs 198.185.159.144, 198.185.159.145, 198.49.23.144, 198.49.23.145
Response IP 198.185.159.144
Found Yes
Hash 9709d2a4b80a54e2b9a2f6cfc2499ebc2e1771916937baf18a33d5e6bc8be6d7
SimHash 11901d42e081

Groups

amazonbot
anthropic-ai
applebot-extended
ccbot
chatgpt-user
claude-web
claudebot
cohere-ai
duckassistbot
facebookbot
google-cloudvertexbot
google-extended
gptbot
meta-externalagent
meta-externalagent
perplexitybot
quora-bot
adsbot-google
adsbot-google-mobile
adsbot-google-mobile-apps
*

Rule Path
Disallow /config
Disallow /search
Disallow /account$
Disallow /account/
Disallow /commerce/digital-download/
Disallow /api/
Allow /api/ui-extensions/
Disallow /static/
Disallow /*?author=*
Disallow /*%26author%3D*
Disallow /*?tag=*
Disallow /*%26tag%3D*
Disallow /*?month=*
Disallow /*%26month%3D*
Disallow /*?view=*
Disallow /*%26view%3D*
Disallow /*?format=json
Disallow /*%26format%3Djson
Disallow /*?format=page-context
Disallow /*%26format%3Dpage-context
Disallow /*?format=main-content
Disallow /*%26format%3Dmain-content
Disallow /*?format=json-pretty
Disallow /*%26format%3Djson-pretty
Disallow /*?format=ical
Disallow /*%26format%3Dical
Disallow /*?reversePaginate=*
Disallow /*%26reversePaginate%3D*

Other Records

Field Value
sitemap http://astridterrazas.com/sitemap.xml

Comments

  • Squarespace Robots Txt