bugnard.ch
robots.txt
Robots Exclusion Standard data for bugnard.ch
Resource Scan
Scan Details
Site Domain | bugnard.ch |
Base Domain | bugnard.ch |
Scan Status | Ok |
Last Scan | 2024-11-06T06:01:32+00:00 |
Next Scan | 2024-12-06T06:01:32+00:00 |
Last Scan
Scanned | 2024-11-06T06:01:32+00:00 |
URL | https://bugnard.ch/robots.txt |
Redirect | https://www.bugnard.ch/robots.txt |
Redirect Domain | www.bugnard.ch |
Redirect Base | bugnard.ch |
Domain IPs | 193.246.248.205 |
Redirect IPs | 193.246.248.205 |
Response IP | 193.246.248.205 |
Found | Yes |
Hash | 41a353b9f00fffa04ae7d18df7587cc3aa0609a61cfbb903ae51d293df032f72 |
SimHash | b9444510448f |
Groups
archive.org_bot
arquivo-web-crawler
heritrix
ia_archiver
ia_archiver-web.archive.org
nicecrawler
Rule | Path |
---|---|
Disallow | / |
ahrefsbot
semrushbot
siteauditbot
semrushbot-ba
semrushbot-si
semrushbot-swa
semrushbot-ct
splitsignalbot
semrushbot-coub
barkrowler
blexbot
brightedge crawler
cocolyzebot
dataforseobot
domainstatsbot
dotbot
hypestat
linkdexbot
mj12bot
online-webceo-bot
screaming frog seo spider
senutobot
seobilitybot
seokicks
seolizer
serpstatbot
sitecheckerbotcrawler
zoombot
Rule | Path |
---|---|
Disallow | / |
a patent crawler
adstxtcrawler
awesomecrawler
backlinkcrawler
birdcrawlerbot
cispa webcrawler
companybook-crawler
content crawler spider
converacrawler
crawler4j
crawlyprojectcrawler
deepcrawl
domaincrawler
e.ventures investment crawler
erocrawler
fast enterprise crawler
fast-webcrawler
fr-crawler
garlikcrawler
gingercrawler
gluten free crawler
grapeshotcrawler
gsa-crawler
ias crawler
icc-crawler
ip-web-crawler.com
it2media-domain-crawler
kbcrawl
lssrocketcrawler
mbcrawler
minicrawler
msiecrawler
netestate ne crawler
neticle crawler
nimblecrawler
peer39_crawler
rukicrawler
safesearch microdata crawler
seekport crawler
simplecrawler
sistrix crawler
tocrawl
tombapublicwebcrawler
trendkite-akashic-crawler
ubicrawler
tombapublicwebcrawler
usinenouvellecrawler
webcompanycrawler
Rule | Path |
---|---|
Disallow | / |
url_spider_pro
toutiaospider
sosospider
sogou spider2
sogou inst spider
lnspiderguy
linespider
lb-spider
landau-media-spider
kenjin spider
k2spider
jikespider
jamie's spider
gnam gnam spider
baiduspider-video
baiduspider-news
baiduspider-image
etaospider
sogou spider
Rule | Path |
---|---|
Disallow | / |
bazqux
bitlybot
bublupbot
embedly
flipboardproxy
freshrss
friendica
hatena
iframely
inoreader
mail.ru_bot
miniflux
newsblur
nextcloud
pocketparser
serendeputybot
simplepie
slackbot-linkexpanding
snap url preview service
startmebot
superfeedr
surdotlybot
synapse
tiny tiny rss
vkshare
Rule | Path |
---|---|
Disallow | / |
dataprovider.com
dcrawl
httrack
httrack 3.0
metainspector
newspaper
nutch
offline explorer
openindexspider
scrapy
Rule | Path |
---|---|
Disallow | / |
adbeat_bot
aihitbot
anderspinkbot
archivebot
awariobot
awariosmartbot
bitsightbot
blackboard
brandverity
cincraw
ev-crawler
hubspot
imagesiftbot
ioncrawl
jugendschutzprogramm-crawler
kstandbot
lightspeedsystemscrawler
linkfluence
linkwalker
magpie-crawler
mediatoolkitbot
muckrack
netcraftsurveyagent
netvibes
pandalytics
panscient.com
proximic
scoop.it
seekportbot
smtbot
trendictionbot
trendsmapresolver
turnitin
turnitinbot
tweetmemebot
twingly
um-ln
velenpublicwebcrawler
zoominfobot
Rule | Path |
---|---|
Disallow | / |
eventmachine httpclient
niki-bot
istellabot
go-http-client/1.1
adscanner
ioncrawl
geedoproductsearch
pagepeeker
uptimebot.org
uptimerobot
coccoc
coccocbot
coccocbot-web
Rule | Path |
---|---|
Disallow | / |
*
Rule | Path |
---|---|
Disallow | /api |
Disallow | /fr/commande |
Disallow | /de/bestellung |
Other Records
Field | Value |
---|---|
sitemap | https://www.bugnard.ch/sitemap.xml |
Warnings
- 4 invalid lines.
Comments