wikidex.net
robots.txt

Robots Exclusion Standard data for wikidex.net

Resource Scan

Scan Details

Site Domain wikidex.net
Base Domain wikidex.net
Scan Status Ok
Last Scan2024-10-05T16:31:44+00:00
Next Scan 2024-10-12T16:31:44+00:00

Last Scan

Scanned2024-10-05T16:31:44+00:00
URL https://wikidex.net/robots.txt
Domain IPs 141.95.160.66, 2001:41d0:304:200::825e
Response IP 141.95.160.66
Found Yes
Hash 2d9d2614fb67bf4a927e1e874ec04c600aad78e46e2dd7c295a12ccb02fddf9f
SimHash c198795a6bb3

Groups

*

Rule Path
Disallow /index.php?
Disallow /wiki/Especial%3A
Disallow /*?*title=Especial%3A
Disallow /index.php/Especial%3A
Disallow /wiki/Special%3A
Disallow /*?*title=Special%3A
Disallow /index.php/Special%3A
Disallow /*?action=
Disallow /*?*&action=
Disallow /*?feed=
Disallow /*?*&feed=
Disallow /*?from=
Disallow /*?*&from=
Disallow /*?oldid=
Disallow /*?*&oldid=
Disallow /*?printable=
Disallow /*?*&printable=
Disallow /*?redirect=
Disallow /*?*&redirect=
Disallow /*?useskin=
Disallow /*?*&useskin=
Disallow /*?uselang=
Disallow /*?*&uselang=
Disallow /*?veaction=
Disallow /*?*&veaction=
Disallow /api/

googlebot-image

Rule Path
Disallow /wiki/Archivo

webreaper

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

baiduspider
baiduspider-video
baiduspider-image

Rule Path
Disallow /

naverbot
yeti

Rule Path
Disallow /

moget
ichiro

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

httrack

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

semrushbot-swa

Rule Path
Disallow /

semrushbot-ct

Rule Path
Disallow /

semrushbot-bm

Rule Path
Disallow /

semrushbot-seoab

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

alexibot

Rule Path
Disallow /

alphaseobot

Rule Path
Disallow /

alphaseobot-sa

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

zoombot

Rule Path
Disallow /

linguee

Rule Path
Disallow /

magpie-crawler/1.1

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

sogou spider2

Rule Path
Disallow /

sogou orion spider

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

buck/2.2

Rule Path
Disallow /

yak

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.wikidex.net/sitemap/sitemap-index-wikidexwiki.xml

Comments

  • Prevenir a google imágenes intentar indexar páginas que no lo son
  • A capture bot, downloads gazillions of pages with no public benefit
  • http://www.webreaper.net/
  • Yandex es un buscador ruso, realmente no creo que venga nada útil de allí
  • China
  • Korea
  • Japón
  • Vietnam
  • Alguien intento bajarse todo el wiki con HTTrack...
  • Bots de marketing

Warnings

  • 2 invalid lines.