estudioteca.net
robots.txt

Robots Exclusion Standard data for estudioteca.net

Archived Snapshots

Resource Scan

Scan Details

Site Domain	estudioteca.net
Base Domain	estudioteca.net
Scan Status	Ok
Last Scan	2026-01-26T21:28:51+00:00
Next Scan	2026-02-02T21:28:51+00:00

Last Scan

Scanned	2026-01-26T21:28:51+00:00
URL	https://estudioteca.net/robots.txt
Domain IPs	2001:41d0:301:4::23, 51.254.16.36
Response IP	51.254.16.36
Found	Yes
Hash	e51d912e11df9eb2c7880782d215798ac42cb001682e53a03f6c0d5bc730ef70
SimHash	98f946100452

Groups

*

Rule	Path
Allow	/wp-content/uploads/
Disallow	/wp-content/plugins/
Disallow	/wp-content/themes/
Disallow	/wp-includes/
Disallow	/wp-admin/
Disallow	/go/
Disallow	/wp-
Disallow	/?s=
Disallow	/search

Rule

Path

Allow

/wp-content/uploads/

Disallow

/wp-content/plugins/

Disallow

/wp-content/themes/

Disallow

/wp-includes/

Disallow

/wp-admin/

Disallow

/go/

Disallow

/wp-

Disallow

/?s=

Disallow

/search

mediapartners-google

Rule	Path
Disallow
Disallow	/feed
Disallow	/comments/feed
Disallow	/*/feed/$
Disallow	/*/feed/rss/$
Disallow	/*/trackback/$
Disallow	///feed/$
Disallow	///feed/rss/$
Disallow	///trackback/$
Disallow	///*/feed/$
Disallow	///*/feed/rss/$
Disallow	///*/trackback/$

Rule

Path

Disallow

/feed

Disallow

/comments/feed

Disallow

/*/feed/$

Disallow

/*/feed/rss/$

Disallow

/*/trackback/$

Disallow

/*/*/feed/$

Disallow

/*/*/feed/rss/$

Disallow

/*/*/trackback/$

Disallow

/*/*/*/feed/$

Disallow

/*/*/*/feed/rss/$

Disallow

/*/*/*/trackback/$

msiecrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

/

webcopier

Rule	Path
Disallow	/

Rule

Path

Disallow

/

httrack

Rule	Path
Disallow	/

Rule

Path

Disallow

/

microsoft.url.control

Rule	Path
Disallow	/

Rule

Path

Disallow

/

libwww

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	http://tu-web/sitemap.xml

Field

Value

sitemap

http://tu-web/sitemap.xml

Back to top

Comments

robots.txt para tu blog en WordPress.
Usar bajo propia responsabilidad, que nos conocemos }:)
http://sigt.net/archivo/robotstxt-para-wordpress.xhtml
Primero el contenido adjunto.
También podemos desindexar todo lo que empiece
por wp-. Es lo mismo que los Disallow de arriba pero
incluye cosas como wp-rss.php
Sitemap permitido, búsquedas no.
Permitimos el feed general para Google Blogsearch.
Impedimos que permalink/feed/ sea indexado ya que el
feed con los comentarios suele posicionarse en lugar de
la entrada y desorienta a los usuarios.
Lo mismo con URLs terminadas en /trackback/ que sólo
sirven como Trackback URI (y son contenido duplicado).
A partir de aquí es opcional pero recomendado.
Lista de bots que suelen respetar el robots.txt pero rara
vez hacen un buen uso del sitio y abusan bastante
Añadir al gusto del consumidor

Back to top

estudioteca.netrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

mediapartners-google

msiecrawler

webcopier

httrack

microsoft.url.control

libwww

Other Records

Comments

estudioteca.net
robots.txt