mapcarta.com
robots.txt

Robots Exclusion Standard data for mapcarta.com

Resource Scan

Scan Details

Site Domain mapcarta.com
Base Domain mapcarta.com
Scan Status Ok
Last Scan2024-05-25T01:12:03+00:00
Next Scan 2024-06-01T01:12:03+00:00

Last Scan

Scanned2024-05-25T01:12:03+00:00
URL https://mapcarta.com/robots.txt
Domain IPs 104.26.4.19, 104.26.5.19, 172.67.69.76, 2606:4700:20::681a:413, 2606:4700:20::681a:513, 2606:4700:20::ac43:454c
Response IP 104.26.5.19
Found Yes
Hash a63d6b8464d2f216af20cf11ea624c449f0a77009d55496ef2bcee0ca981cb3d
SimHash 630767712ba7

Groups

admantx
aibot
alittle client
aspseek
abonti
aboundex
aboundexbot
acunetix
adstxtcrawlertp
afd-verbotsverfahren
ahrefsbot
aihitbot
aipbot
alexibot
allsubmitter
alligator
alphabot
anarchie
anarchy
anarchy99
ankit
anthill
apexoo
aspiegel
asterias
atomseobot
attach
awariorssbot
awariosmartbot
bbbike
bdcbot
bdfetch
blexbot
backdoorbot
backstreet
backweb
backlink-ceck
backlinkcrawler
badass
bandit
barkrowler
batchftp
battleztar bazinga
betabot
bigfoot
bitacle
blackwidow
black hole
blackboard
blow
blowfish
boardreader
bolt
botalot
brandprotect
brandwatch
buck
buddy
builtbottough
builtwith
bullseye
bunnyslippers
buzzsumo
bytespider
catexplorador
ccbot
code87
cshttp
calculon
cazoodlebot
cegbfeieh
censysinspect
cheteam
cheesebot
cherrypicker
chinaclaw
chlooe
citoid
claritybot
cliqzbot
cloud mapping
cocolyzebot
cogentbot
collector
copier
copyrightcheck
copyscape
cosmos
craftbot
crawling at home project
crazywebcrawler
crescent
crunchbot
curious
custo
cyotekwebcopy
dblbot
diibot
dsearch
dts agent
datacha0s
databasedrivermysqli
demon
deusu
devil
digincore
digitalpebble
dirbuster
disco
discobot
discoverybot
dispatch
dittospyder
dnbcrawler-analytics
dnyzbot
domcopbot
domainappender
domaincrawler
domainsigmacrawler
domainstatsbot
domains project
dotbot
download wonder
dragonfly
drip
eccp/1.0
email siphon
email wolf
easydl
ebingbong
ecxi
eirgrabber
erocrawler
evil
exabot
express webpictures
extlinksbot
extractor
extractorpro
extreme picture finder
eyenetie
ezooms
fdm
fhscan
femtosearchbot
fimap
firefox/7.0
flashget
flunky
foobot
freeuploader
frontpage
fuzz
fyberspider
fyrebot
g-i-g-a-b-o-t
gt::www
galaxybot
genieo
germcrawler
getright
getweb
getintent
gigabot
go!zilla
go-ahead-got-it
gozilla
gotit
grabnet
grabber
grafula
grapefx
grapeshotcrawler
gridbot
headmasterseo
hmview
htmlparser
http::lite
httrack
haansoft
haosouspider
harvest
havij
heritrix
hloader
honolulubot
humanlinks
hybridbot
idbte4m
idbot
irlbot
iblog
id-search
ilsebot
image fetch
image sucker
indeedbot
indy library
infonavirobot
infotekies
intelliseek
interget
internetseer
internet ninja
iria
iskanie
istellabot
joc web spider
jamesbot
jbrofuzz
jennybot
jetcar
jetty
jikespider
joomla
jorgee
justview
jyxobot
kenjin spider
keybot translation-search-machine
keyword density
kinza
kozmosbot
lnspiderguy
lwp::simple
lanshanbot
larbin
leap
leechftp
leechget
lexibot
lftp
libweb
libwhisker
liebaofast
lightspeedsystems
likse
linkscan
linkwalker
linkbot
linkextractorpro
linkpadbot
linksmanager
linqiametadatadownloaderbot
linqiarssbot
linqiascrapebot
lipperhey
lipperhey spider
litemage_walker
lmspider
ltx71
mfc_tear_sample
midown tool
miixpc
mj12bot
mqqbrowser
msfrontpage
msiecrawler
mtrobot
mag-net
magnet
mail.ru_bot
majestic-seo
majestic12
majestic seo
markmonitor
markwatch
mass downloader
masscan
mata hari
mauibot
mb2345browser
meanpath bot
meanpathbot
mediatoolkitbot
megaindex.ru
metauri
micromessenger
microsoft data access
microsoft url control
minefield
mister pix
moblie safari
mojeek
mojolicious
molokaibot
morfeus fucking scanner
mozlila
mr.4x3
msrabot
musobot
nicerspro
npbot
name intelligence
nameprotect
navroad
nearsite
needle
nessus
netants
netlyzer
netmechanic
netspider
netzip
net vampire
netcraft
nettrack
netvibes
nextgensearchbot
nibbler
niki-bot
nikto
nimblecrawler
nimbostratus
ninja
nmap
nuclei
nutch
octopus
offline explorer
offline navigator
oncrawl
openlinkprofiler
openvas
openfind
openvas
orangebot
orangespider
outclicksbot
outfoxbot
pecl::http
phpcrawl
poe-component-client-http
pageanalyzer
pagegrabber
pagescorer
pagething.com
page analyzer
pandalytics
panscient
papa foto
pavuk
peoplepal
petalbot
pi-monster
picscout
picsearch
picturefinder
piepmatz
pimonster
pixray
pleasecrawl
pockey
propowerbot
prowebwalker
probethenet
proximic
psbot
pu_in
pump
pxbroker
pycurl
queryn metasearch
quick-crawler
rssingbot
rainbot
rankactive
rankactivelinkbot
rankflex
rankingbot
rankingbot2
rankivabot
rankurbot
re-re
reget
realdownload
reaper
rebelmouse
recorder
redesscrapy
repomonkey
ripper
rocketcrawler
rogerbot
sbider
seokicks
seokicks-robot
seolyticscrawler
seoprofiler
seostats
sistrix
smtbot
salesintelligent
scanalert
scanbot
scoutjet
scrapy
screaming
screenerbot
screpybot
searchestate
searchmetricsbot
seekport
seekportbot
semanticjuice
semrush
semrushbot
sentibot
senutobot
seositecheckup
seobilitybot
seomoz
shodan
siphon
sitecheckerbotcrawler
siteexplorer
sitelockspider
sitesnagger
sitesucker
site sucker
sitebeam
siteimprove
sitevigil
slysearch
smartdownload
snake
snapbot
snoopy
socialrankiobot
sociscraper
sogou web spider
sosospider
sottopop
spacebison
spammen
spankbot
spanner
spbot
spinn3r
sputnikbot
sqlmap
sqlworm
sqworm
steeler
stripper
sucker
sucuri
superbot
superhttp
surfbot
surveybot
suzuran
swiftbot
szukacz
t0phackteam
t8abot
teleport
teleportpro
telesoft
telesphoreo
telesphorep
thenomad
the intraformant
thumbor
tighttwatbot
tinytestbot
titan
toata
toweyabot
tracemyfile
trendiction
trendictionbot
true_robot
turingos
turnitin
turnitinbot
twengabot
twice
typhoeus
urly.warning
urly warning
unisterbot
upflow
v-bot
vb project
vci
vacuum
vagabondo
velenpublicwebcrawler
vericitecrawler
vidiblescraper
virusdie
voideye
voil
voltron
wasalive-bot
wbsearchbot
webdav
wisenutbot
wpscan
www-collector-e
www-mechanize
www::mechanize
wwwoffle
wallpapers
wallpapers/3.0
wallpapershd
wesee
webauto
webbandit
webcollage
webcopier
webenhancer
webfetch
webfuck
webgo is
webimagecollector
webleacher
webpix
webreaper
websauger
webstripper
websucker
webwhacker
webzip
web auto
web collage
web enhancer
web fetch
web fuck
web pix
web sauger
web sucker
webalta
webmasterworldforumbot
webshag
websiteextractor
websitequester
website quester
webster
whack
whacker
whatweb
who.is bot
widow
winhttrack
wiseguys robot
wonderbot
woobot
wotbox
wprecon
xaldon webspider
xaldon_webspider
xenu
youdaobot
zade
zauba
zermelo
zeus
zitebot
zmeu
zoombot
zoominfobot
zumbot
zyborg
adscanner
archive.org_bot
arquivo-web-crawler
arquivo.pt
autoemailspider
backlink-check
cah.io.community
check1.exe
clark-crawler
coccocbot
cognitiveseo
com.plumanalytics
crawl.sogou.com
crawler.feedback
crawler4j
dataforseo.com
dataforseobot
demandbase-bot
domainsproject.org
ecatch
evc-batch
facebookscraper
gopher
heritrix
instabid
internetvista monitor
ips-agent
isitwp.com
iubenda-radar
linkdexbot
lwp-request
lwp-trivial
magpie-crawler
meanpathbot
mediawords
muhstik-scan
netestate ne crawler
obot
omgili
page scorer
pcbrowser
plumanalytics
polaris version
probe-image-size
ripz
s1z.ru
satoristudio.net
scalaj-http
scan.lol
seobility
seocompany.store
seoscanners
seostar
serpstatbot
sexsearcher
sitechecker.pro
siteripz
sogouspider
sp_auditbot
spyfu
sysscan
takeout
trendiction.com
trendiction.de
ubermetrics-technologies.com
voyagerx.com
webgains-bot
webmeup-crawler
webpros.com
webprosbot
x09mozilla
x22mozilla
xpymep1.exe
zauba.io
zgrab
bytedance

Rule Path
Disallow /

*

Rule Path
Allow /
Disallow /s/
Disallow /de/s/
Disallow /es/s/
Disallow /fr/s/
Disallow /pt/s/
Disallow /dynamic/routing.php
Disallow /dynamic/search.php

Other Records

Field Value
sitemap https://mapcarta.com/sitemap.xml

Comments

  • based on "The Ultimate robots.txt Bot and User-Agent Blocker"
  • copyright, MIT License: https://github.com/mitchellkrogza/nginx-ultimate-bad-bot-blocker

Warnings

  • 5 invalid lines.