bitethemusic.com
robots.txt

Robots Exclusion Standard data for bitethemusic.com

Resource Scan

Scan Details

Site Domain bitethemusic.com
Base Domain bitethemusic.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-09-05T20:34:00+00:00
Next Scan 2025-12-04T20:34:00+00:00

Last Successful Scan

Scanned2021-12-10T20:33:26+00:00
URL https://bitethemusic.com/robots.txt
Response IP 104.21.38.247
Found Yes
Hash 8bad40bd097d9ad88e0d1c424183d61797d25d1ad279f8b5c98572bceb8c6ebb
SimHash e0dc5a80c9e0

Groups

*

Rule Path
Allow /wp-content/uploads/*
Allow /wp-content/*.js
Allow /wp-content/*.css
Allow /wp-content/*.ttf
Allow /wp-includes/*.js
Allow /wp-includes/*.css
Allow /wp-content/*.ttf
Allow /wp-admin/admin-ajax.php
Allow /*.css$
Allow /*.js$
Allow /*.ttf
Allow /*.png
Allow *.jpg
Allow *.jpeg
Allow *.gif
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /cgi-bin
Disallow /*/attachment/
Disallow /?attachment_id*
Disallow /tag/
Disallow /tag/*/page/
Disallow /tag/*/feed/
Disallow /page/
Disallow /comments/
Disallow /xmlrpc.php

*

Rule Path
Disallow /?s=
Disallow /search

*

Rule Path
Disallow /trackback
Disallow /*trackback
Disallow /*trackback*
Disallow /*/trackback

*

Rule Path
Allow /feed/$
Disallow /feed/
Disallow /comments/feed/
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$

noxtrumbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

libwww

Rule Path
Disallow /

orthogaffe

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

wget

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

npbot

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

Other Records

Field Value
sitemap https://bitethemusic.com/sitemap_index.xml

Comments

  • Robots para bitethemusic.com
  • 2020-04-22
  • Fonts exceptions
  • Images exceptions
  • Ficheros adjuntos
  • Desindexa las palabras de los posts
  • Otros elementos
  • Bloqueo de las URL dinamicas
  • Eliminado porque las URLs del theme usan versiones con ?
  • Disallow: /*?
  • Bloqueo de busquedas
  • Bloqueo de trackbacks
  • Bloqueo de feeds para crawlers
  • Ralentizamos algunos bots que se suelen volver locos
  • Bloqueo de bots y crawlers poco utiles
  • Si utilizas Yoast SEO estos son los sitemaps principales