urielmania.com.mx
robots.txt

Robots Exclusion Standard data for urielmania.com.mx

Archived Snapshots

Resource Scan

Scan Details

Site Domain	urielmania.com.mx
Base Domain	urielmania.com.mx
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2025-04-07T15:03:34+00:00
Next Scan	2025-04-08T15:03:34+00:00

Last Successful Scan

Scanned	2025-03-31T15:03:28+00:00
URL	https://urielmania.com.mx/robots.txt
Domain IPs	104.21.20.164, 172.67.193.53, 2606:4700:3035::ac43:c135, 2606:4700:3037::6815:14a4
Response IP	172.67.193.53
Found	Yes
Hash	60f3506a6022607bcea8896da7bcabb31a5222aa91a863c9219e23ac01074e56
SimHash	985d1c100457

Groups

*

Rule	Path
Allow	/wp-content/uploads/
Disallow	/wp-content/plugins/
Disallow	/wp-content/themes/
Disallow	/wp-includes/
Disallow	/wp-admin/
Disallow	/wp-
Disallow	/?s=
Disallow	/search
Allow	/feed/$
Disallow	/feed
Disallow	/comments/feed
Disallow	/*/feed/$
Disallow	/*/feed/rss/$
Disallow	/*/trackback/$
Disallow	///feed/$
Disallow	///feed/rss/$
Disallow	///trackback/$
Disallow	///*/feed/$
Disallow	///*/feed/rss/$
Disallow	///*/trackback/$

Rule

Path

Allow

/wp-content/uploads/

Disallow

/wp-content/plugins/

Disallow

/wp-content/themes/

Disallow

/wp-includes/

Disallow

/wp-admin/

Disallow

/wp-

Disallow

/?s=

Disallow

/search

Allow

/feed/$

Disallow

/feed

Disallow

/comments/feed

Disallow

/*/feed/$

Disallow

/*/feed/rss/$

Disallow

/*/trackback/$

Disallow

/*/*/feed/$

Disallow

/*/*/feed/rss/$

Disallow

/*/*/trackback/$

Disallow

/*/*/*/feed/$

Disallow

/*/*/*/feed/rss/$

Disallow

/*/*/*/trackback/$

msiecrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

webcopier

Rule	Path
Disallow	/

Rule

Path

Disallow

httrack

Rule	Path
Disallow	/

Rule

Path

Disallow

microsoft.url.control

Rule	Path
Disallow	/

Rule

Path

Disallow

libwww

Rule	Path
Disallow	/

Rule

Path

Disallow

noxtrumbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	50

Field

Value

crawl-delay

msnbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	30

Field

Value

crawl-delay

slurp

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

Other Records

Field	Value
sitemap	http://tu-web/sitemap.xml

Field

Value

sitemap

http://tu-web/sitemap.xml

Comments

robots.txt para tu blog en WordPress.
Usar bajo propia responsabilidad, que nos conocemos }:)
http://sigt.net/archivo/robotstxt-para-wordpress.xhtml
Primero el contenido adjunto.
Tambiï¿½n podemos desindexar todo lo que empiece
por wp-. Es lo mismo que los Disallow de arriba pero
incluye cosas como wp-rss.php
Sitemap permitido, bï¿½squedas no.
Permitimos el feed general para Google Blogsearch.
Impedimos que permalink/feed/ sea indexado ya que el
feed con los comentarios suele posicionarse en lugar de
la entrada y desorienta a los usuarios.
Lo mismo con URLs terminadas en /trackback/ que sï¿½lo
sirven como Trackback URI (y son contenido duplicado).
A partir de aquï¿½ es opcional pero recomendado.
Lista de bots que suelen respetar el robots.txt pero rara
vez hacen un buen uso del sitio y abusan bastanteï¿½
Aï¿½adir al gusto del consumidorï¿½
Slurp (Yahoo!), Noxtrum y el bot de MSN a veces tienen
idas de pinza, toca decirles que reduzcan la marcha.
El valor es en segundos y podï¿½is dejarlo bajo e ir
subiendo hasta el punto ï¿½ptimo.

urielmania.com.mxrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

msiecrawler

webcopier

httrack

microsoft.url.control

libwww

noxtrumbot

Other Records

msnbot

Other Records

slurp

Other Records

Other Records

Comments

urielmania.com.mx
robots.txt