todosurf.com
robots.txt

Robots Exclusion Standard data for todosurf.com

Resource Scan

Scan Details

Site Domain todosurf.com
Base Domain todosurf.com
Scan Status Ok
Last Scan2024-11-15T14:28:21+00:00
Next Scan 2024-11-22T14:28:21+00:00

Last Scan

Scanned2024-11-15T14:28:21+00:00
URL https://todosurf.com/robots.txt
Redirect https://www.todosurf.com/robots.txt
Redirect Domain www.todosurf.com
Redirect Base todosurf.com
Domain IPs 104.21.0.129, 172.67.150.248, 2606:4700:3035::6815:81, 2606:4700:3035::ac43:96f8
Redirect IPs 104.21.0.129, 172.67.150.248, 2606:4700:3035::6815:81, 2606:4700:3035::ac43:96f8
Response IP 172.67.150.248
Found Yes
Hash f7469a7a90b564f4f569def6646131e9fa1539d5a927ab1aa69460048226f5b1
SimHash e85d7c824cc7

Groups

*

Rule Path
Allow /
Disallow /wp-admin/
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /wp-includes/

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

libwww

Rule Path
Disallow /

orthogaffe

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

wget

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

noxtrumbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 50

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 50

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 50

*

Rule Path
Disallow /*?

*

Rule Path
Disallow /?s=
Disallow /search

*

Rule Path
Disallow /trackback
Disallow /*trackback
Disallow /*/trackback

*

Rule Path
Allow /feed/$
Disallow /feed/
Disallow /comments/feed/
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$

Other Records

Field Value
sitemap https://www.todosurf.com/sitemap_index.xml

Comments

  • Primero el contenido adjunto.
  • Sitemap permitido.
  • Bloqueamos los siguientes bots poco útiles para no sobrecargar el servidor.
  • Slurp (Yahoo!), Noxtrum y el bot de MSN a veces tienen
  • idas de pinza, toca decirles que reduzcan la marcha.
  • El valor es en segundos y podéis dejarlo bajo e ir
  • subiendo hasta el punto óptimo.
  • Bloqueo de las URL dinamicas
  • Bloqueo de busquedas
  • Bloqueo de trackbacks
  • Disallow: /*trackback*
  • Bloqueo de feeds para crawlers