cursosapnetweaver.com
robots.txt

Robots Exclusion Standard data for cursosapnetweaver.com

Resource Scan

Scan Details

Site Domain cursosapnetweaver.com
Base Domain cursosapnetweaver.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-05-31T02:48:08+00:00
Next Scan 2025-08-29T02:48:08+00:00

Last Successful Scan

Scanned2023-08-11T01:51:29+00:00
URL https://cursosapnetweaver.com/robots.txt
Domain IPs 104.21.85.14, 172.67.200.179, 2606:4700:3031::ac43:c8b3, 2606:4700:3035::6815:550e
Response IP 104.21.85.14
Found Yes
Hash 6648ff35fa1739ab49facb7af3be5ba9863fdb183c843f336ad8e6a897fb9220
SimHash e8dd7dc8cce5

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

twitterbot

Rule Path
Disallow *

*

Rule Path
Disallow /wp-admin/*
Disallow /cgi-bin
Disallow /wp-content/plugins/*
Disallow /wp-content/themes/*
Disallow /*/trackback/
Disallow /wp-includes
Disallow /*/attachment/
Disallow /tag/*/page/
Disallow /tag/*/feed/
Disallow */page/*
Disallow /comments/
Disallow /xmlrpc.php
Disallow /?attachment_id*
Disallow /*?m=
Disallow /*?m=*
Disallow /20*
Disallow */wp-content/plugins/*
Disallow */wp-login.php*
Disallow *?s=*
Disallow */mis-feeds/*
Disallow *feedzy-category*
Disallow */blog/*
Disallow */author/*
Disallow */search/*
Disallow */wp-login*
Disallow */?s=*
Disallow */wp-admin/admin-ajax.php*
Disallow /*showComment*
Disallow /*?

*

Rule Path
Disallow /?s=
Disallow /search

*

Rule Path
Disallow /trackback
Disallow /*trackback
Disallow /*trackback*
Disallow /*/trackback

*

Rule Path
Allow /feed/$
Disallow /feed/
Disallow /comments/feed/
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$

noxtrumbot
msnbot
slurp
msiecrawler

Rule Path
Allow *

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

libwww

Rule Path
Disallow /

orthogaffe

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

wget

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

npbot

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

googlebot

Rule Path
Allow /*.css$
Allow /*.js$
Disallow /wp-content/plugins/link-juice-optimizer/public/js/link-juice-optimizer.js

Other Records

Field Value
sitemap https://cursosapnetweaver.com/post-sitemap.xml
sitemap https://cursosapnetweaver.com/wp-content/plugins/sitemap-imagenes/sitemap_imagenes.xml

Comments

  • Bloqueo basico para todos los bots y crawlers
  • puede dar problemas por bloqueo de recursos en GWT
  • Bloqueo de las URL dinamicas
  • Bloqueo de busquedas
  • Bloqueo de trackbacks
  • Bloqueo de feeds para crawlers
  • Ralentizamos algunos bots que se suelen volver locos
  • Crawl-delay: 20
  • Crawl-delay: 20
  • Crawl-delay: 20
  • Bloqueo de bots y crawlers poco utiles
  • Previene problemas de recursos bloqueados en Google Webmaster Tools
  • Sitemaps