peru-agricola.com
robots.txt

Robots Exclusion Standard data for peru-agricola.com

Resource Scan

Scan Details

Site Domain peru-agricola.com
Base Domain peru-agricola.com
Scan Status Ok
Last Scan2024-11-19T21:50:37+00:00
Next Scan 2024-11-26T21:50:37+00:00

Last Scan

Scanned2024-11-19T21:50:37+00:00
URL https://peru-agricola.com/robots.txt
Redirect https://www.peru-agricola.com/robots.txt
Redirect Domain www.peru-agricola.com
Redirect Base peru-agricola.com
Domain IPs 104.21.40.13, 172.67.173.218, 2606:4700:3031::6815:280d, 2606:4700:3034::ac43:adda
Redirect IPs 104.21.40.13, 172.67.173.218, 2606:4700:3031::6815:280d, 2606:4700:3034::ac43:adda
Response IP 104.21.40.13
Found Yes
Hash 2dfee9895b5432c6155c1691e50be20762ba99dba9e1f3cc1a581e444c4866a3
SimHash 73b25264c90f

Groups

ahrefsbot

Rule Path
Disallow /

semrushbot-sa
semrushbot-ba
semrushbot-si
semrushbot-swa
semrushbot-ct
semrushbot-bm
semrushbot-seoab
semrushbot

Rule Path
Disallow /

baiduspider
baiduspider-image
baiduspider-video
baiduspider-news
baiduspider-favo
baiduspider-cpro
baiduspider-ads

Rule Path
Disallow /

ahrefsbot
alexibot
amazonbot
apocalxexplorerbot
asterias
backdoorbot/1.0
backlinkcrawler
bizinformation
black hole
blackbird
blowfish/1.0
botalot
builtbottough
bullseye/1.0
bunnyslippers
ccbot
cegbfeieh
cheesebot
cherrypicker
cherrypickerelite/1.0
cherrypickerse/1.0
copyrightcheck
cosmos
crescent
crescent internet toolpak http ole control v.1.0
disco pump 3.1
dittospyder
emailcollector
emailsiphon
emailwolf
erocrawler
extractorpro
flamingo_searchengine
foobot
geedobot
grapeshot
harvest/1.5
hloader
httplib
httrack
httrack 3.0
humanlinks
igentia
imagesiftbot
infonavirobot
jennybot
kenjin spider
lexibot
libweb/clshttp
linkextractorpro
linkscan/8.1a unix
linkwalker
lwp-trivial
lwp-trivial/1.34
mata hari
microsoft url control - 5.01.4511
microsoft url control - 6.00.8169
miixpc
miixpc/4.2
mister pix
mj12bot
mlbot
moget
moget/2.1
naverbot
netants
netattache
netattache light 1.1
netmechanic
nicerspro
niki-bot
offline explorer
openfind
openfind data gathere
propowerbot/2.14
proximic
prowebwalker
psbot
quepasacreep
queryn metasearch
repomonkey
repomonkey bait & tackle/v1.01
rma
seekportbot
sightupbot
sitebot
sitesnagger
sogou web spider
sosospider
spankbot
spanner
speedy
suggybot
superbot
superbot/2.6
sputnikbot
suzuran
szukacz/1.4
teleport
telesoft
the intraformant
thenomad
tighttwatbot
titan
tocrawl/urldispatcher
true_robot
true_robot/1.0
turingos
turnitinbot
urlpouls
urly warning
vci
vci webviewer vci webviewer win32
web image collector
webauto
webbandit
webbandit/3.50
webcopier
webcopy
webenhancer
webmasterworldforumbot
webmirror
webreaper
websauger
website extractor
website quester
webster pro
webstripper
webstripper/2.02
webzip
webzip/4.0
wget
wget/1.5.3
wget/1.6
wikiofeedbot
wikiwix-bot-3.0
winhttrack
www-collector-e
xenu's
xenu's link sleuth 1.1c
yrspider
zealbot
zeus

Rule Path
Disallow /

*

Rule Path
Disallow /buyers.js
Disallow /ad/overview/
Disallow /ad/compare
Disallow /sheet-stats/
Disallow /ad-book/
Disallow /bookmark/
Disallow /flags
Disallow /getfeaturedads/
Disallow /account
Disallow /account/
Disallow /criteo
Disallow /facebooklogin
Disallow /facebookUserData
Disallow /googleLogin
Disallow /linkedinLogin
Disallow /contact/autocompleteContactForm
Disallow /*?*dir=
Disallow /translate
Disallow /translate/check
Disallow /contact/*/callme
Disallow /contact/*/ask-rdv
Disallow /contact/v2/send.html
Disallow /feed/backpage
Disallow /*?*pr=
Disallow /*?*ph=
Disallow /*?*vi=
Disallow /*?*q=
Disallow /search/similars
Disallow /*?*multi_svar=
Disallow /*?*hay=
Disallow /*?*multi_brd=
Disallow /*?*multi_tpe=
Disallow /*?*crn=
Disallow /*?*ob=
Disallow /*?*dir=
Disallow /*?*lw=
Disallow /*?*fam=
Disallow /*?*brd=
Disallow /*?*ctr=
Disallow /*?*cat=
Disallow /*?*rct=
Disallow /*?*var=
Disallow /*?*rgn=
Disallow /*?*dpt=
Disallow /*?*wrrt=
Disallow /*?*wd=
Disallow /*?*transmi=
Disallow /*?*mt=
Disallow /*?*svar=
Disallow /*?*tpe=
Disallow /*?*st=
Disallow /*?*register=
Disallow /*?*requestUri=
Disallow /*?*multi_svar=
Disallow /redirect-to-auction*
Disallow /*/*-search-geo-brand
Disallow /*/*-search-geo-category
Disallow /bookmarksAds/*
Disallow /*?type_chariot=*

linguee bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://www.peru-agricola.com/sitemap.xml

Warnings

  • 2 invalid lines.