ohmagazinerd.com
robots.txt

Robots Exclusion Standard data for ohmagazinerd.com

Resource Scan

Scan Details

Site Domain ohmagazinerd.com
Base Domain ohmagazinerd.com
Scan Status Ok
Last Scan2024-11-03T21:44:24+00:00
Next Scan 2024-11-10T21:44:24+00:00

Last Scan

Scanned2024-11-03T21:44:24+00:00
URL https://ohmagazinerd.com/robots.txt
Domain IPs 104.21.92.166, 172.67.196.141, 2606:4700:3030::ac43:c48d, 2606:4700:3031::6815:5ca6
Response IP 172.67.196.141
Found Yes
Hash d4c048b874a66f2408513d2a3709c66eaeaca6d42ea3ebf240836410c5e91b6d
SimHash 289c52d1c9a0

Groups

*

Rule Path
Allow /wp-content/uploads/*
Allow /wp-content/*.js
Allow /wp-content/*.css
Allow /wp-includes/*.js
Allow /wp-includes/*.css
Disallow /cgi-bin
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /wp-includes/
Disallow /*/attachment/
Disallow /tag/*/page/
Disallow /tag/*/feed/
Disallow /page/
Disallow /comments/
Disallow /xmlrpc.php
Disallow /?attachment_id*
Disallow /*?
Disallow /?blackhole

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /?s=
Disallow /search

*

Rule Path
Disallow /trackback
Disallow /*trackback
Disallow /*trackback*
Disallow /*/trackback

*

Rule Path
Allow /feed/$
Allow /feed/
Disallow /comments/feed/
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$

noxtrumbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

facebookexternalhit

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

fastly

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

aspiegelbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

marfeelman

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

applewebkit/537.36

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

grapeshotcrawler

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 40

ahrefsbot
ahrefssiteaudit
adbeat_bot
alexibot
appengine
aqua_products
archive.org_bot
archive
asterias
b2w/0.1
backdoorbot/1.0
becomebot
blekkobot
blexbot
blowfish/1.0
bookmark search tool
botalot
builtbottough
bullseye/1.0
bunnyslippers
ccbot
cheesebot
cherrypicker
cherrypickerelite/1.0
cherrypickerse/1.0
chroot
copernic
copyrightcheck
cosmos
crescent
crescent internet toolpak http ole control v.1.0
dittospyder
dotbot
dumbot
emailcollector
emailsiphon
emailwolf
enterprise_search
enterprise_search/1.0
erocrawler
es
exabot
extractorpro
fairad client
flaming attackbot
foobot
gaisbot
getright/4.2
gigabot
grub
grub-client
go-http-client
harvest/1.5
hatena antenna
hloader
http://www.searchengineworld.com bot
http://www.webmasterworld.com bot
httplib
humanlinks
ia_archiver
ia_archiver/1.6
infonavirobot
iron33/1.0.2
jamesbot
jennybot
jetbot
jetbot/1.0
jorgee
kenjin spider
keyword density/0.9
larbin
lexibot
libweb/clshttp
linkextractorpro
linkpadbot
linkscan/8.1a unix
linkwalker
lnspiderguy
looksmart
lwp-trivial
lwp-trivial/1.34
mata hari
megalodon
microsoft url control
microsoft url control - 5.01.4511
microsoft url control - 6.00.8169
miixpc
miixpc/4.2
mister pix
mj12bot
moget
moget/2.1
mozilla
mozilla
mozilla/3
mozilla/4
mozilla/4.0 (compatible; bullseye; windows 95)
mozilla/4.0 (compatible; msie 4.0; windows 2000)
mozilla/4.0 (compatible; msie 4.0; windows 95)
mozilla/4.0 (compatible; msie 4.0; windows 98)
mozilla/4.0 (compatible; msie 4.0; windows nt)
mozilla/4.0 (compatible; msie 4.0; windows xp)
mozilla/5
msiecrawler
naver
nerdybot
netants
netmechanic
nicerspro
nutch
offline explorer
openbot
openfind
openfind data gathere
oracle ultra search
perman
propowerbot/2.14
prowebwalker
psbot
python-urllib
queryn metasearch
radiation retriever 1.1
repomonkey
repomonkey bait & tackle/v1.01
rma
rogerbot
scooter
screaming frog seo spider
searchpreview
semrushbot
semrushbot
semrushbot-sa
seokicks-robot
sitesnagger
sootle
spankbot
spanner
spbot
stanford
stanford comp sci
stanford compclub
stanford compsciclub
stanford spiderboys
surveybot
surveybot_ignoreip
suzuran
szukacz/1.4
szukacz/1.4
teleport
teleportpro
telesoft
teoma
the intraformant
thenomad
tocrawl/urldispatcher
true_robot
true_robot/1.0
turingos
typhoeus
url control
url_spider_pro
urly warning
vci
vci webviewer vci webviewer win32
web image collector
webauto
webbandit
webbandit/3.50
webcopier
webenhancer
webmasterworld extractor
webmasterworldforumbot
websauger
website quester
webster pro
webstripper
webvac
webzip
webzip/4.0
wget
wget/1.5.3
wget/1.6
www-collector-e
xenu's
xenu's link sleuth 1.1c
zeus
zeus 32297 webster pro v2.9 win32
zeus link scout
libwww-perl
w3.org

Rule Path
Disallow /

googlebot

Rule Path
Allow /*.css$
Allow /*.js$

Other Records

Field Value
sitemap http://ohmagazine.listindiario.com/sitemap_index.xml
sitemap http://ohmagazine.listindiario.com/post-sitemap.xml
sitemap http://ohmagazine.listindiario.com/page-sitemap.xml
sitemap http://ohmagazine.listindiario.com/ajde_events-sitemap.xml
sitemap http://ohmagazine.listindiario.com/tdb_templates-sitemap.xml
sitemap http://ohmagazine.listindiario.com/category-sitemap.xml
sitemap http://ohmagazine.listindiario.com/post_tag-sitemap.xml
sitemap http://ohmagazine.listindiario.com/author-sitemap.xml

Comments

  • robots de Digo Networks
  • es necesario personalizar algunas opciones o puede dar problemas
  • Bloqueo basico para todos los bots y crawlers
  • puede dar problemas por bloqueo de recursos en GWT
  • Bloqueo de las URL dinamicas
  • Bloqueo de busquedas
  • Bloqueo de trackbacks
  • Bloqueo de feeds para crawlers
  • Ralentizamos algunos bots que se suelen volver locos
  • Bloqueo de bots y crawlers poco utiles
  • Previene problemas de recursos bloqueados en Google Webmaster Tools
  • En condiciones normales este es el sitemap
  • Si utilizas Yoast SEO estos son los sitemaps principales