elblogdemama.es
robots.txt

Robots Exclusion Standard data for elblogdemama.es

Archived Snapshots

Resource Scan

Scan Details

Site Domain	elblogdemama.es
Base Domain	elblogdemama.es
Scan Status	Ok
Last Scan	2024-07-06T00:07:37+00:00
Next Scan	2024-07-13T00:07:37+00:00

Last Scan

Scanned	2024-07-06T00:07:37+00:00
URL	https://elblogdemama.es/robots.txt
Domain IPs	75.102.57.85
Response IP	75.102.57.85
Found	Yes
Hash	7535a476e67b828b6667ed36ca9302d7ebf4db23c75f42730d181170c1ac41a4
SimHash	62d458928453

Groups

*

Rule	Path
Allow	/wp-content/uploads/
Disallow	/cgi-bin
Disallow	/wp-content/plugins/
Disallow	/wp-content/themes/
Disallow	/wp-includes/
Disallow	/wp-admin/
Disallow	/wp-
Disallow	/?s=
Disallow	/search/
Allow	/feed/$
Allow	/feed/instant-articles
Allow	/feed/podcast
Allow	/feed
Allow	/*/feed/$
Allow	///feed/$
Allow	///*/feed/$
Disallow	/*/feed/rss/$
Disallow	/comments/feed
Disallow	/*/trackback/$
Disallow	///feed/rss/$
Disallow	///trackback/$
Disallow	///*/feed/rss/$
Disallow	///*/trackback/$
Allow	/*.js$
Allow	/*.css$

Rule

Path

Allow

/wp-content/uploads/

Disallow

/cgi-bin

Disallow

/wp-content/plugins/

Disallow

/wp-content/themes/

Disallow

/wp-includes/

Disallow

/wp-admin/

Disallow

/wp-

Disallow

/?s=

Disallow

/search/

Allow

/feed/$

Allow

/feed/instant-articles

Allow

/feed/podcast

Allow

/feed

Allow

/*/feed/$

Allow

/*/*/feed/$

Allow

/*/*/*/feed/$

Disallow

/*/feed/rss/$

Disallow

/comments/feed

Disallow

/*/trackback/$

Disallow

/*/*/feed/rss/$

Disallow

/*/*/trackback/$

Disallow

/*/*/*/feed/rss/$

Disallow

/*/*/*/trackback/$

Allow

/*.js$

Allow

/*.css$

googlebot-image

Rule	Path
Allow	/wp-content/uploads/

Rule

Path

Allow

/wp-content/uploads/

adsbot-google

Rule	Path
Allow	/

Rule

Path

Allow

googlebot-mobile

Rule	Path
Allow	/

Rule

Path

Allow

yandex

Rule	Path
Allow	/yandex_e75a3d9629f3f146.html

Rule

Path

Allow

/yandex_e75a3d9629f3f146.html

msiecrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

webcopier

Rule	Path
Disallow	/

Rule

Path

Disallow

httrack

Rule	Path
Disallow	/

Rule

Path

Disallow

microsoft.url.control

Rule	Path
Disallow	/

Rule

Path

Disallow

libwww

Rule	Path
Disallow	/

Rule

Path

Disallow

noxtrumbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	50

Field

Value

crawl-delay

msnbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	30

Field

Value

crawl-delay

slurp

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

Other Records

Field	Value
sitemap	https://elblogdemama.es/sitemap_index.xml
sitemap	https://elblogdemama.es/post-sitemap.xml
sitemap	https://elblogdemama.es/page-sitemap.xml
sitemap	https://elblogdemama.es/category-sitemap.xml
sitemap	https://elblogdemama.es/podcast-sitemap.xml
sitemap	http://cdn.attracta.com/sitemap/6044571.xml.gz
sitemap	http://cdn.attracta.com/sitemap/6044571.xml.gz

Field

Value

sitemap

https://elblogdemama.es/sitemap_index.xml

sitemap

https://elblogdemama.es/post-sitemap.xml

sitemap

https://elblogdemama.es/page-sitemap.xml

sitemap

https://elblogdemama.es/category-sitemap.xml

sitemap

https://elblogdemama.es/podcast-sitemap.xml

sitemap

http://cdn.attracta.com/sitemap/6044571.xml.gz

sitemap

http://cdn.attracta.com/sitemap/6044571.xml.gz

Comments

robots.txt para un blog WordPress.
Bloquear o permitir acceso a contenido adjunto. (Si la instalación está en /public_html).
Desindexar carpetas que empiecen por wp-
Permitir Feed general para Google Blogsearch.
Impedir que /permalink/feed/ sea indexado pues el feed de comentarios suele posicionarse antes de los post.
Impedir URLs terminadas en /trackback/ que sirven como Trackback URI (contenido duplicado).
Evita bloqueos de CSS y JS.
Lista de bots que deberías permitir.
Lista de bots que generan consultas abusivas aunque siguen las pautas del archivo robots.txt
Slurp (Yahoo!), Noxtrum y el bot de MSN que suelen generar excesivas consultas.
Begin Attracta SEO Tools Sitemap. Do not remove
End Attracta SEO Tools Sitemap. Do not remove
Begin Attracta SEO Tools Sitemap. Do not remove
End Attracta SEO Tools Sitemap. Do not remove

elblogdemama.esrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

googlebot-image

adsbot-google

googlebot-mobile

yandex

msiecrawler

webcopier

httrack

microsoft.url.control

libwww

noxtrumbot

Other Records

msnbot

Other Records

slurp

Other Records

Other Records

Comments

elblogdemama.es
robots.txt