concienciaeco.com
robots.txt

Robots Exclusion Standard data for concienciaeco.com

Resource Scan

Scan Details

Site Domain concienciaeco.com
Base Domain concienciaeco.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2024-07-22T01:25:47+00:00
Next Scan 2024-10-20T01:25:47+00:00

Last Successful Scan

Scanned2024-03-30T21:37:59+00:00
URL https://concienciaeco.com/robots.txt
Domain IPs 104.26.12.118, 104.26.13.118, 172.67.75.9, 2606:4700:20::681a:c76, 2606:4700:20::681a:d76, 2606:4700:20::ac43:4b09
Response IP 172.67.75.9
Found Yes
Hash 6ed36f14d564730a62f6d014e6f39f4ca09e373d7987b63e92ce36028642adda
SimHash 0a7cce105d11

Groups

*

Rule Path
Allow /wp-content/uploads/
Allow /wp-content/gallery/
Allow /wp-content/plugins/
Allow /wp-content/themes/
Allow /wp-includes/
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /al_azar.php
Disallow /wp-
Disallow */comments
Disallow /index.php
Allow /feed/$
Disallow /*/*/page/$
Disallow /feed
Disallow /comments/feed
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$

yandex

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value Comment
crawl-delay 50 specifies a 50 second timeout

bingbot

No rules defined. All paths allowed.

Other Records

Field Value Comment
crawl-delay 50 specifies a 50 second timeout

noxtrumbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 50

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 50

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 50

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 50

Comments

  • Permitimos el feed general para Google Blogsearch.
  • Impedimos que permalink/feed/ sea indexado ya que el
  • feed con los comentarios suele posicionarse en lugar de
  • la entrada y desorienta a los usuarios.
  • Lo mismo con URLs terminadas en /trackback/ que sólo
  • sirven como Trackback URI (y son contenido duplicado).
  • Bots no permitidos
  • para evitar ataque desde http://www.80legs.com/webcrawler.html
  • Bots controlados

Warnings

  • 2 invalid lines.