netfan.pl
robots.txt

Robots Exclusion Standard data for netfan.pl

Resource Scan

Scan Details

Site Domain netfan.pl
Base Domain netfan.pl
Scan Status Ok
Last Scan2024-09-08T21:07:43+00:00
Next Scan 2024-10-08T21:07:43+00:00

Last Scan

Scanned2024-09-08T21:07:43+00:00
URL https://www.netfan.pl/robots.txt
Domain IPs 77.55.134.231
Response IP 77.55.134.231
Found Yes
Hash 8c55a17d6bcccfd542c1c80fa35f31a59a52b819b2403f3ef127931c9ad4e733
SimHash 4a5ad703e431

Groups

*

Rule Path
Disallow /admpanel/
Disallow /cron/
Disallow /download/
Disallow /errordoc/
Disallow /ftp/
Disallow /includes/
Disallow /shortpixel/
Disallow /templates/

Other Records

Field Value
crawl-delay 10

facebookexternalhit

Rule Path
Allow /

*

Rule Path
Disallow /

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Disallow /images/

a6-indexer

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

alphaseobot

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

alphaseobot-sa

Rule Path
Disallow /

applebot

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

bingbot/2.0

Rule Path
Disallow /

blackboard safeassign

Rule Path
Disallow /

blexbot/1.0

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

crawler4j

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

liebaofast

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

mauibot (crawler.feedback+wc@gmail.com)

Rule Path
Disallow /

mqqbrowser

Rule Path
Disallow /

nimbostratus-bot/v1.3.2

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

semrushbot-ct

Rule Path
Disallow /

semrushbot-bm

Rule Path
Disallow /

semrushbot-seoab

Rule Path
Disallow /

semrushbot/6~bl

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

megaindex.com

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

sputnikbot/2.3

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

ucbrowser

Rule Path
Disallow /

yacybot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

gsa-crawler (enterprise; t4-knhh62cdkc2w3; gsa_manage@nikon-sys.co.jp)

Rule Path
Disallow /

netestate ne crawler (+http://www.website-datenbank.de/)

Rule Path
Disallow /

wiseguys robot

Rule Path
Disallow /

turnitin robot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

turnitin bot

Rule Path
Disallow /

turnitinbot/3.0 (http://www.turnitin.com/robot/crawlerinfo.html)

Rule Path
Disallow /

turnitinbot/3.0

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

pimonster

Rule Path
Disallow /

pimonster

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

eccp/1.0 (search@eniro.com)

Rule Path
Disallow /

yandex

Rule Path
Disallow /

gptbot/1.2

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://netfan.pl/sitemap.xml

Comments

  • Directories
  • Block bots
  • www.sentibot.eu
  • RDH, 08.19.19: I really don't want to block Applebot, but for now, I am. It is crawling us too much
  • RDH, 05.13.20: I really don't want to block bing, but for now, I am. It is also already in htaccess rules
  • https://megaindex.com/crawler
  • Block SoGou
  • Block Youdao
  • Block Nikon JP Crawler
  • Block netEstate NE Crawler (+http://www.website-datenbank.de/)
  • Block WiseGuys Robot
  • Block Turnitin Robot
  • Block Heritrix
  • Block pricepi
  • Block Searchmetrics Bot
  • Block Eniro
  • Block YandexBot
  • Block GPTBot/1.2
  • Block barkrowler
  • Amazonbot
  • DataForSeoBot