technoreeze.com
robots.txt

Robots Exclusion Standard data for technoreeze.com

Resource Scan

Scan Details

Site Domain technoreeze.com
Base Domain technoreeze.com
Scan Status Ok
Last Scan2026-02-13T07:30:12+00:00
Next Scan 2026-02-20T07:30:12+00:00

Last Scan

Scanned2026-02-13T07:30:12+00:00
URL https://technoreeze.com/robots.txt
Domain IPs 31.22.4.18
Response IP 31.22.4.18
Found Yes
Hash 05cd1590ae3c5e7c4574a7aa99c9a18a0bcb51d18aadf2e6b01ba94475929bfb
SimHash e379fb738c4f

Groups

*

Rule Path
Allow /wp-content/uploads/*
Allow /wp-content/*.js
Allow /wp-content/*.css
Allow /wp-includes/*.js
Allow /wp-includes/*.css
Disallow /cgi-bin
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /wp-includes/
Disallow /wp-json/*
Disallow /*/attachment/
Disallow /tag/*/page/
Disallow /tag/*/feed/
Disallow /page/
Disallow /comments/
Disallow /xmlrpc.php
Disallow /?attachment_id*
Disallow /*?

*

Rule Path
Disallow /?s=
Disallow /search

*

Rule Path
Disallow /trackback
Disallow /*trackback
Disallow /*trackback*
Disallow /*/trackback

*

Rule Path
Allow /feed/$
Disallow /feed/
Disallow /comments/feed/
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$

msiecrawler
webcopier
httrack
microsoft.url.control
libwww
orthogaffe
ubicrawler
doc
zao
sitecheck.internetseer.com
zealbot
msiecrawler
sitesnagger
webstripper
webcopier
fetch
offline explorer
teleport
teleportpro
webzip
linko
httrack
microsoft.url.control
xenu
larbin
libwww
zyborg
download ninja
wget
grub-client
k2spider
npbot
webreaper
crystalsemantics
grapeshot
rogerbot
exabot
mj12bot
dotbot
gigabot
ahrefsbot
blackwidow
chinaclaw
custo
disco
download\ demon
ecatch
eirgrabber
emailsiphon
emailwolf
express\ webpictures
extractorpro
eyenetie
flashget
getright
getweb!
go!zilla
go-ahead-got-it
grabnet
grafula
hmview
httrack
image\ stripper
image\ sucker
indy\ library
interget
internet\ ninja
jetcar
joc\ web\ spider
larbin
leechftp
mass\ downloader
midown\ tool
mister\ pix
navroad
nearsite
netants
netspider
net\ vampire
netzip
octopus
offline\ explorer
offline\ navigator
pagegrabber
papa\ foto
pavuk
pcbrowser
realdownload
reget
sitesnagger
smartdownload
superbot
superhttp
surfbot
takeout
teleport\ pro
voideye
web\ image\ collector
web\ sucker
webauto
webcopier
webfetch
webgo\ is
webleacher
webreaper
websauger
website\ extractor
website\ quester
webstripper
webwhacker
webzip
wget
widow
wwwoffle
xaldon\ webspider
zeus

Rule Path
Disallow /

googlebot

Rule Path
Allow /*.css$
Allow /*.js$

Other Records

Field Value
sitemap http://www.technoreeze.com/sitemap.xml
sitemap http://www.technoreeze.com/sitemap_index.xml
sitemap http://www.technoreeze.com/category-sitemap.xml
sitemap http://www.technoreeze.com/page-sitemap.xml
sitemap http://www.technoreeze.com/post-sitemap.xml

Comments

  • Bloqueo de las URL dinamicas
  • Bloqueo de busquedas
  • Bloqueo de trackbacks
  • Bloqueo de feeds para crawlers
  • Ralentizamos algunos bots que se suelen volver locos
  • User-agent: noxtrumbot
  • Crawl-delay: 20
  • User-agent: msnbot
  • Crawl-delay: 20
  • User-agent: Slurp
  • Crawl-delay: 20
  • Bloqueo de bots y crawlers poco utiles
  • Previene problemas de recursos bloqueados en Google Webmaster Tools
  • Sitemaps
  • Sitemaps Yoast SEO