canal21ebre.com
robots.txt

Robots Exclusion Standard data for canal21ebre.com

Resource Scan

Scan Details

Site Domain canal21ebre.com
Base Domain canal21ebre.com
Scan Status Ok
Last Scan2024-09-26T02:50:25+00:00
Next Scan 2024-10-03T02:50:25+00:00

Last Scan

Scanned2024-09-26T02:50:25+00:00
URL https://canal21ebre.com/robots.txt
Redirect https://www.canal21ebre.com/robots.txt
Redirect Domain www.canal21ebre.com
Redirect Base canal21ebre.com
Domain IPs 148.251.48.80
Redirect IPs 148.251.48.80
Response IP 148.251.48.80
Found Yes
Hash 2aa0e39761211ff21d886c22f15748699e64682a2329cccfdc31ae7a743d3489
SimHash 839f6051ce87

Groups

*

Rule Path
Disallow /wp-login
Disallow /wp-admin
Disallow /*/feed/
Disallow /*/trackback/
Disallow /*/attachment/
Disallow /*/page/
Disallow /*/feed/
Disallow /tag/*/page/
Disallow /tag/*/feed/
Disallow /page/
Disallow /comments/
Disallow /xmlrpc.php
Disallow /*?s=
Disallow /*amp%3D1
Disallow /*amp
Disallow /?attachment_id*
Disallow /tag/*/?amp=1
Disallow /?amp=1
Disallow /?amp
Disallow /*/?amp=1
Disallow /*amp
Disallow /*/noticia.php?id=*

orthogaffe

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

nutch

Rule Path
Disallow /

spock

Rule Path
Disallow /

omniexplorer_bot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

becomebot

Rule Path
Disallow /

geniebot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mlbot

Rule Path
Disallow /

linguee bot

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

sbider/nutch

Rule Path
Disallow /

jyxobot

Rule Path
Disallow /

magent

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

speedy spider

Rule Path
Disallow /

shopwiki

Rule Path
Disallow /

huasai

Rule Path
Disallow /

datacha0s

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

atomic_email_hunter

Rule Path
Disallow /

mp3bot

Rule Path
Disallow /

winhttp

No rules defined. All paths allowed.

Comments

  • Actualització dia 10/07/2023
  • los rastreadores tendrían que ser amables y obedecer
  • a menos que estén alimentando los motores de búsqueda.
  • Algunos robots son conocidos por ser un problema, sobre todo los destinadas a copiar
  • sitios enteros o descargarlas para verlos sin conexión. Por favor obedeced mi robots.txt.

Warnings

  • 3 invalid lines.