bebetests.com
robots.txt

Robots Exclusion Standard data for bebetests.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	bebetests.com
Base Domain	bebetests.com
Scan Status	Ok
Last Scan	2025-05-30T03:46:57+00:00
Next Scan	2025-06-06T03:46:57+00:00

Last Scan

Scanned	2025-05-30T03:46:57+00:00
URL	https://bebetests.com/robots.txt
Domain IPs	75.102.57.85
Response IP	75.102.57.85
Found	Yes
Hash	368b2d0644a01bb0cad64bbbcb21e3151b7cca1228f4cde5008e6bad3d986900
SimHash	6af458900553

Groups

*

Rule	Path
Allow	/wp-content/uploads/
Disallow	/cgi-bin
Disallow	/wp-content/plugins/
Disallow	/wp-content/themes/
Disallow	/wp-includes/
Disallow	/wp-admin/
Disallow	/?s=
Disallow	/search
Allow	/feed/$
Disallow	/feed
Disallow	/comments/feed
Disallow	/*/feed/$
Disallow	/*/feed/rss/$
Disallow	/*/trackback/$
Disallow	///feed/$
Disallow	///feed/rss/$
Disallow	///trackback/$
Disallow	///*/feed/$
Disallow	///*/feed/rss/$
Disallow	///*/trackback/$
Allow	/*.js$
Allow	/*.css$

Rule

Path

Allow

/wp-content/uploads/

Disallow

/cgi-bin

Disallow

/wp-content/plugins/

Disallow

/wp-content/themes/

Disallow

/wp-includes/

Disallow

/wp-admin/

Disallow

/?s=

Disallow

/search

Allow

/feed/$

Disallow

/feed

Disallow

/comments/feed

Disallow

/*/feed/$

Disallow

/*/feed/rss/$

Disallow

/*/trackback/$

Disallow

/*/*/feed/$

Disallow

/*/*/feed/rss/$

Disallow

/*/*/trackback/$

Disallow

/*/*/*/feed/$

Disallow

/*/*/*/feed/rss/$

Disallow

/*/*/*/trackback/$

Allow

/*.js$

Allow

/*.css$

googlebot-image

Rule	Path
Allow	/wp-content/uploads/

Rule

Path

Allow

/wp-content/uploads/

adsbot-google

Rule	Path
Allow	/

Rule

Path

Allow

googlebot-mobile

Rule	Path
Allow	/

Rule

Path

Allow

yandex

Rule	Path
Allow	/yandex_e75a3d9629f3f146.html

Rule

Path

Allow

/yandex_e75a3d9629f3f146.html

msiecrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

webcopier

Rule	Path
Disallow	/

Rule

Path

Disallow

httrack

Rule	Path
Disallow	/

Rule

Path

Disallow

microsoft.url.control

Rule	Path
Disallow	/

Rule

Path

Disallow

libwww

Rule	Path
Disallow	/

Rule

Path

Disallow

noxtrumbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	50

Field

Value

crawl-delay

msnbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	30

Field

Value

crawl-delay

slurp

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

Other Records

Field	Value
sitemap	http://cdn.attracta.com/sitemap/6055229.xml.gz

Field

Value

sitemap

http://cdn.attracta.com/sitemap/6055229.xml.gz

Comments

robots.txt para un blog WordPress.
Bloquear o permitir acceso a contenido adjunto. (Si la instalación está en /public_html).
Desindexar carpetas que empiecen por wp-
Permitir Feed general para Google Blogsearch.
Impedir que /permalink/feed/ sea indexado pues el feed de comentarios suele posicionarse antes de los post.
Impedir URLs terminadas en /trackback/ que sirven como Trackback URI (contenido duplicado).
Evita bloqueos de CSS y JS.
Lista de bots que deberías permitir.
Lista de bots que generan consultas abusivas aunque siguen las pautas del archivo robots.txt
Slurp (Yahoo!), Noxtrum y el bot de MSN que suelen generar excesivas consultas.
Begin Attracta SEO Tools Sitemap. Do not remove
End Attracta SEO Tools Sitemap. Do not remove

bebetests.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

googlebot-image

adsbot-google

googlebot-mobile

yandex

msiecrawler

webcopier

httrack

microsoft.url.control

libwww

noxtrumbot

Other Records

msnbot

Other Records

slurp

Other Records

Other Records

Comments

bebetests.com
robots.txt