marcelonaccarato.com
robots.txt

Robots Exclusion Standard data for marcelonaccarato.com

Resource Scan

Scan Details

Site Domain marcelonaccarato.com
Base Domain marcelonaccarato.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-11-19T13:45:36+00:00
Next Scan 2026-01-18T13:45:36+00:00

Last Successful Scan

Scanned2025-08-28T21:10:25+00:00
URL https://marcelonaccarato.com/robots.txt
Domain IPs 213.158.86.26
Response IP 213.158.86.26
Found Yes
Hash cef43fe4270289eb60dc56b0db7f8ca764eb151146fdf6043e8463552b8dc76b
SimHash 28d54e044052

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /wp-includes/
Disallow /wp-admin/
Allow /feed/$
Disallow /feed
Disallow /comments/feed
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$
Allow /*.js$
Allow /*.css$
Disallow /*.pdf$
Disallow /*?*

googlebot-image

Rule Path
Allow /wp-content/uploads/

adsbot-google

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

libwww

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

gurujibot

Rule Path
Disallow /

hl_ftien_spider

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

yeti

Rule Path
Disallow /

yodaobot

Rule Path
Disallow /
Disallow /gracias-por-suscribirte

Other Records

Field Value
sitemap https://marcelonaccarato.com/sitemaps.xml

Comments

  • Bloquear o permitir acceso a contenido adjunto. (Si la instalación está en /public_html).
  • Impedir el acceso a los diferentes feed que genere la página
  • Impedir URLs terminadas en /trackback/ que sirven como Trackback URL.
  • Evita bloqueos de CSS y JS.
  • Bloquear todos los pdfs
  • Bloquear parámetros
  • Lista de bots que deberías permitir.
  • Lista de bots bloqueados
  • Desautorizar a páginas innecesarias
  • Añadimos una indicación de la localización del sitemap