del-2.org
robots.txt

Robots Exclusion Standard data for del-2.org

Resource Scan

Scan Details

Site Domain del-2.org
Base Domain del-2.org
Scan Status Ok
Last Scan2024-10-06T07:40:07+00:00
Next Scan 2024-10-13T07:40:07+00:00

Last Scan

Scanned2024-10-06T07:40:07+00:00
URL https://del-2.org/robots.txt
Domain IPs 167.235.36.80, 2a01:4f8:262:235e::2
Response IP 167.235.36.80
Found Yes
Hash 480994f8150403e6edd96ffea28c0e796fc946baa326257a3b007ce46e12803c
SimHash 921fd743ef58

Groups

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

sistrix crawler

Rule Path
Disallow /

uptimerobot/2.0

Rule Path
Disallow /

proximic

Rule Path
Disallow /

nerdybot

Rule Path
Disallow /

ezooms robot

Rule Path
Disallow /

perl lwp

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

netestate ne crawler (+http://www.website-datenbank.de/)

Rule Path
Disallow /

wiseguys robot

Rule Path
Disallow /

turnitin robot

Rule Path
Disallow /

pimonster

Rule Path
Disallow /

pimonster

Rule Path
Disallow /

pi-monster

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

eccp/1.0 (search@eniro.com)

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yandex

Rule Path Comment
Disallow / blocks access to whole site

baiduspider
baiduspider-video
baiduspider-image

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

gsa-crawler (enterprise; t4-knhh62cdkc2w3; gsa_manage@nikon-sys.co.jp)

Rule Path
Disallow /
Disallow /*/details%3Bjsessionid%3D*

megaindex.ru/2.0

Rule Path
Disallow /

grapeshotcrawler/2.0

Rule Path
Disallow

grapeshot

Rule Path
Disallow

icc-crawler/2.0

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

Comments

  • Block MJ12bot as it is just noise
  • Block Ahrefs
  • Block Sogou
  • Block SEOkicks
  • Block BlexBot
  • Block SISTRIX
  • Block Uptime robot
  • Block Ezooms Robot
  • Block Perl LWP
  • Block BlexBot
  • Block netEstate NE Crawler (+http://www.website-datenbank.de/)
  • Block WiseGuys Robot
  • Block Turnitin Robot
  • Block Heritrix - archive.org
  • User-agent: Heritrix
  • Disallow: /
  • Block pricepi
  • Block Searchmetrics Bot
  • Block Eniro
  • Block YandexBot
  • Block Baidu
  • Block SoGou
  • Block Youdao
  • Block Nikon JP Crawler
  • Block PDP URL with JsessionID
  • Block MegaIndex.ru
  • DEAKTIVIERT wg. Urbanmedia Crawler

Warnings

  • 2 invalid lines.