4x4parts.fi
robots.txt

Robots Exclusion Standard data for 4x4parts.fi

Resource Scan

Scan Details

Site Domain 4x4parts.fi
Base Domain 4x4parts.fi
Scan Status Ok
Last Scan2024-09-23T16:51:54+00:00
Next Scan 2024-10-23T16:51:54+00:00

Last Scan

Scanned2024-09-23T16:51:54+00:00
URL https://www.4x4parts.fi/robots.txt
Domain IPs 51.75.75.23
Response IP 51.75.75.23
Found Yes
Hash fa087858843751265a2d7860d804aaa039e8a88caa4784f5be958f29b9a286ae
SimHash db1ca11d64b2

Groups

cliqzbot/3.0

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

paracrawl

Rule Path
Disallow /

scrapy/1.5.0

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

velenpublicwebcrawler (velen.io)

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

semrushbot/2~bl

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

pcore-http

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

crawler.feedback+wc@gmail.com

Rule Path
Disallow /

cyotekwebcopy/1.0

Rule Path
Disallow /

centurybot9@gmail.com

Rule Path
Disallow /

crawler (crawler.feedback@gmail.com)

Rule Path
Disallow /

crawler

Rule Path
Disallow /

barkrowler/0.7 (+http://www.exensa.com/crawl)

Rule Path
Disallow /

go-http-client/1.1

Rule Path
Disallow /

test crawl

Rule Path
Disallow /

scalaj-http/1.0

Rule Path
Disallow /

bubing

Rule Path
Disallow /

wotbox/2.01

Rule Path
Disallow /

ccbot/2.0

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

ebibot

Rule Path
Disallow /

pcore-http/v0.24.5

Rule Path
Disallow /

testitest1

Rule Path
Disallow /

vegi bot

Rule Path
Disallow /

istellabot/t.1

Rule Path
Disallow /

istellabot/t.1.13

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

ltx71 - (http://ltx71.com/)

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

booglebot2

Rule Path
Disallow /

booglebot

Rule Path
Disallow /

booglebot 2.0

Rule Path
Disallow /

booglebot/2.0

Rule Path
Disallow /

mj12bot/v1.0.5

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider/2.0

Rule Path
Disallow /

influencebo

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

acoonbot

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

businessdbbot

Rule Path
Disallow /

superfeedr

Rule Path
Disallow /

flipboardproxy

Rule Path
Disallow /

flipboard

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

rogerbot/1.0

Rule Path
Disallow /

flipboardproxy

Rule Path
Disallow /

swebot

Rule Path
Disallow /

swebot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

www.80legs.com

Rule Path
Disallow /

nerdbynature.bot

Rule Path
Disallow /

comodospider

Rule Path
Disallow /

comodospider/nutch-1.2

Rule Path
Disallow /

daumoa

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

beetlebot

Rule Path
Disallow /

niki-bot

Rule Path
Disallow /

riddler

Rule Path
Disallow /

spbot

Rule Path
Disallow /

icarus6

Rule Path
Disallow /

icarus6

Rule Path
Disallow /

icarus

Rule Path
Disallow /

icarus

Rule Path
Disallow /

icarus6j

Rule Path
Disallow /

siteexplorer

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

knelson

Rule Path
Disallow /

knelson/0.9

Rule Path
Disallow /

wotbox/2.01

Rule Path
Disallow /

blexbot/1.0

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

*

Rule Path
Disallow /*orderby%3D
Disallow /*orderway%3D
Disallow /*tag%3D
Disallow /*id_currency%3D
Disallow /*search_query%3D
Disallow /*back%3D
Disallow /*n%3D
Disallow /*controller%3Daddresses
Disallow /*controller%3Daddress
Disallow /*controller%3Dauthentication
Disallow /*controller%3Dcart
Disallow /*controller%3Ddiscount
Disallow /*controller%3Dfooter
Disallow /*controller%3Dget-file
Disallow /*controller%3Dheader
Disallow /*controller%3Dhistory
Disallow /*controller%3Didentity
Disallow /*controller%3Dimages.inc
Disallow /*controller%3Dinit
Disallow /*controller%3Dmy-account
Disallow /*controller%3Dorder
Disallow /*controller%3Dorder-opc
Disallow /*controller%3Dorder-slip
Disallow /*controller%3Dorder-detail
Disallow /*controller%3Dorder-follow
Disallow /*controller%3Dorder-return
Disallow /*controller%3Dorder-confirmation
Disallow /*controller%3Dpagination
Disallow /*controller%3Dpassword
Disallow /*controller%3Dpdf-invoice
Disallow /*controller%3Dpdf-order-return
Disallow /*controller%3Dpdf-order-slip
Disallow /*controller%3Dproduct-sort
Disallow /*controller%3Dsearch
Disallow /*controller%3Dstatistics
Disallow /*controller%3Dattachment
Disallow /*controller%3Dguest-tracking
Disallow */classes/
Disallow */config/
Disallow */download/
Disallow */mails/
Disallow */modules/
Disallow */translations/
Disallow */tools/
Disallow /*fi/password-recovery
Disallow /*fi/address
Disallow /*fi/addresses
Disallow /*fi/login
Disallow /*fi/cart
Disallow /*fi/discount
Disallow /*fi/order-history
Disallow /*fi/identity
Disallow /*fi/my-account
Disallow /*fi/order-follow
Disallow /*fi/order-slip
Disallow /*fi/order
Disallow /*fi/search
Disallow /*fi/quick-order
Disallow /*fi/guest-tracking
Disallow /*fi/order-confirmation

Comments

  • robots.txt automaticaly generated by PrestaShop e-commerce open-source solution
  • http://www.prestashop.com - http://www.prestashop.com/forums
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/wc/robots.html
  • Private pages
  • Directories
  • Files

Warnings

  • 10 invalid lines.