paginacentral.com.mx
robots.txt

Robots Exclusion Standard data for paginacentral.com.mx

Resource Scan

Scan Details

Site Domain paginacentral.com.mx
Base Domain paginacentral.com.mx
Scan Status Ok
Last Scan2026-01-04T04:04:31+00:00
Next Scan 2026-01-11T04:04:31+00:00

Last Scan

Scanned2026-01-04T04:04:31+00:00
URL https://paginacentral.com.mx/robots.txt
Domain IPs 104.21.22.196, 172.67.206.226, 2606:4700:3031::ac43:cee2, 2606:4700:3037::6815:16c4
Response IP 172.67.206.226
Found Yes
Hash 6588f20ec3a1c995cf520811628fa9523f9c3674d7f11892f951826d7e059af3
SimHash e8d44e824032

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /wp-includes/
Disallow /wp-admin/
Disallow /publicidad/
Allow /feed/$
Disallow /feed
Disallow /comments/feed
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$
Allow /*.js$
Allow /*.css$
Disallow /*.pdf$

googlebot-image

Rule Path
Allow /wp-content/uploads/

adsbot-google

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

libwww

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

gurujibot

Rule Path
Disallow /

hl_ftien_spider

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

yeti

Rule Path
Disallow /

yodaobot

Rule Path
Disallow /

Comments

  • Bloquear o permitir acceso a contenido adjunto. (Si la instalación está en /public_html).
  • Impedir el acceso a los diferentes feed que genere la página
  • Impedir URLs terminadas en /trackback/ que sirven como Trackback URL.
  • Evita bloqueos de CSS y JS.
  • Bloquear todos los pdfs
  • Bloquear parámetros
  • Lista de bots que deberías permitir.
  • Lista de bots bloqueados

Warnings

  • 1 invalid line.