gabrielneuman.com
robots.txt

Robots Exclusion Standard data for gabrielneuman.com

Resource Scan

Scan Details

Site Domain gabrielneuman.com
Base Domain gabrielneuman.com
Scan Status Ok
Last Scan2025-10-04T09:10:11+00:00
Next Scan 2025-11-03T09:10:11+00:00

Last Scan

Scanned2025-10-04T09:10:11+00:00
URL https://gabrielneuman.com/robots.txt
Domain IPs 199.16.172.61, 199.16.173.243
Response IP 199.16.173.243
Found Yes
Hash d1845c53b996cfe83a24f8a3f23612ba89b67229f5ff62757bf6864a8a4c31d0
SimHash 4b19c950c261

Groups

gptbot

Rule Path
Allow /

oai-searchbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

claudebot

Rule Path
Allow /

geminibot

Rule Path
Allow /

google-extended

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

ccbot

Rule Path
Allow /

applebot

Rule Path
Allow /

applebot-extended

Rule Path
Allow /

amazonbot

Rule Path
Allow /

meta-externalagent

Rule Path
Allow /

youbot

Rule Path
Allow /

*

Rule Path
Disallow /calendar/action*
Disallow /events/action*
Disallow /wp-admin/
Disallow /wp-login.php
Disallow /*?utm*
Disallow /*?sessionid*
Allow /wp-content/uploads/
Allow /*.css$
Allow /*.js$

Other Records

Field Value
crawl-delay 3

Other Records

Field Value
sitemap https://gabrielneuman.com/sitemap_index.xml

Comments

  • — Reglas específicas para bots de IA —
  • — Reglas generales para todos los bots —
  • — Mapa del sitio —