nefmi.gov.hu
robots.txt

Robots Exclusion Standard data for nefmi.gov.hu

Resource Scan

Scan Details

Site Domain nefmi.gov.hu
Base Domain gov.hu
Scan Status Failed
Failure StageFetching resource.
Failure ReasonRequest timed out.
Last Scan2024-08-22T05:01:33+00:00
Next Scan 2024-11-20T05:01:33+00:00

Last Successful Scan

Scanned2024-04-02T04:59:01+00:00
URL https://nefmi.gov.hu/robots.txt
Domain IPs 84.206.27.91
Response IP 84.206.27.91
Found Yes
Hash f48923e77c6f89464b0bca16753d7dc1f3fe109c4a8eb5c56faa8677d0367395
SimHash a45b4882cef5

Groups

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

wget

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

npbot

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

mediapartners-google*

Rule Path
Disallow /

yahoo!-adcrawler

Rule Path
Disallow /

*

Rule Path
Disallow /main.php/
Disallow /letolt/nemzet/naric/kozokttv_angol.pdf
Disallow /letolt/kozokt/erettsegi2005/tanaroknak/
Disallow /letolt/kozokt/erettsegi2005/tanaroknak_old/
Disallow /letolt/kozokt/erettsegi/tervezet/

Other Records

Field Value
crawl-delay 1

Comments

  • ---------------------------------------------------------------------
  • Webra3 default robots.txt v1.0 2009-03-25 BT
  • ---------------------------------------------------------------------
  • Alapertelmezett robots.txt a publikus W3 site-ok ala.
  • Elesiteskor atnevezendo robots.txt-re.
  • ---------------------------------------------------------------------
  • ---------------------------------------------------------------------
  • Disallow website copiers and unfriendly bots
  • ---------------------------------------------------------------------
  • ---------------------------------------------------------------------
  • Advertising related bots
  • ---------------------------------------------------------------------
  • ---------------------------------------------------------------------
  • Friendly, low-speed bots are welcome
  • ---------------------------------------------------------------------
  • ---------------------------------------------------------------------
  • Disallowed URL-s
  • ---------------------------------------------------------------------