prome.pt
robots.txt

Robots Exclusion Standard data for prome.pt

Resource Scan

Scan Details

Site Domain prome.pt
Base Domain prome.pt
Scan Status Ok
Last Scan2024-11-02T23:51:07+00:00
Next Scan 2024-12-02T23:51:07+00:00

Last Scan

Scanned2024-11-02T23:51:07+00:00
URL https://prome.pt/robots.txt
Domain IPs 62.28.222.137
Response IP 62.28.222.137
Found Yes
Hash 03ae844ce56422d61d5729b6e1ba0739c6035b29c9aa4dabe373b3d6bbd964b0
SimHash 7af42840a862

Groups

*

Rule Path
Disallow /admin/
Disallow /bizizi1/
Disallow /cgi-bin/
Disallow /img_upload/
Disallow /utils/
Disallow /sellers/
Disallow /old/

semrushbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

evilbothere

Rule Path
Disallow /

spamspewer

Rule Path
Disallow /

secretagentagent

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

bytedance

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

facebookexternalhit

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

sistrix
sistrix crawler
sistrix

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

jobs.de-robot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

unisterbot

Rule Path
Disallow /

dotbot
dotbot

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

seodiver

Rule Path
Disallow /

spbot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

meanpathbot

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

obot

Rule Path
Disallow /

fr-crawler

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

megaindex.com

Rule Path
Disallow /

cloudservermarketspider

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

careerbot

Rule Path
Disallow /

lipperhey-kaus-australis

Rule Path
Disallow /

seoscanners.net

Rule Path
Disallow /

metajobbot

Rule Path
Disallow /

spiderbot

Rule Path
Disallow /

linkstats

Rule Path
Disallow /

jobboersebot

Rule Path
Disallow /

iccrawler

Rule Path
Disallow /

plista

Rule Path
Disallow /

domain re-animator bot

Rule Path
Disallow /

lipperhey-kaus-australis

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

coccoc

Rule Path
Disallow /

um-ic

Rule Path
Disallow /

mindupbot

Rule Path
Disallow /

sg-orbiter

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

kraken

Rule Path
Disallow /

plukkie

Rule Path
Disallow /

safednsbot

Rule Path
Disallow /

haosouspider

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

openhosebot

Rule Path
Disallow /

screaming frog seo spider

Rule Path
Disallow /

thumbsniper

Rule Path
Disallow /

r6_commentreader

Rule Path
Disallow /

implisensebot

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

adscanner

Rule Path
Disallow /

crawler4j

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

python/3.5 aiohttp

Rule Path
Disallow /

toweya.com

Rule Path
Disallow /

netestate

Rule Path
Disallow /

bubing

Rule Path
Disallow /

linguee

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

sentibot
sentibot

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

domaincrawler

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

indeedbot

Rule Path
Disallow /

garlikcrawler

Rule Path
Disallow /

gosign-security-crawler

Rule Path
Disallow /

siteliner

Rule Path
Disallow /

sabsimbot

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

adsbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

sidetrade

Rule Path
Disallow /

domainstatsbot

Rule Path
Disallow /

inetdex-bot

Rule Path
Disallow /

checkmarknetwork

Rule Path
Disallow /

startmebot

Rule Path
Disallow /

cincraw

Rule Path
Disallow /

df
df bot

Rule Path
Disallow /

i-market-bot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

t3versionsbot

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

buck

Rule Path
Disallow /

yak

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

xing bot
xing

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

serendeputybot

Rule Path
Disallow /

ws-bot-v1

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

Comments

  • https://babbar.tech/crawler
  • Disallow: Sistrix
  • Disallow: Sistrix Searchengine
  • Disallow: SEOkicks-Robot
  • Disallow: jobs.de-Robot
  • Backlink Analysis
  • Bot der Leipziger Unister Holding GmbH
  • http://www.opensiteexplorer.org/dotbot
  • http://www.searchmetrics.com
  • http://www.majestic12.co.uk/projects/dsearch/mj12bot.php
  • http://www.domaintools.com/webmasters/surveybot.php
  • http://www.seodiver.com/bot
  • http://openlinkprofiler.org/bot
  • http://www.wotbox.com/bot/
  • http://www.meanpath.com/meanpathbot.html
  • http://www.backlinktest.com/crawler.html
  • http://www.brandwatch.com/magpie-crawler/
  • http://filterdb.iss.net/crawler/
  • http://webmeup-crawler.com
  • https://megaindex.com/crawler
  • http://www.cloudservermarket.com
  • http://www.trendiction.de/de/publisher/bot
  • http://www.exalead.com
  • http://www.career-x.de/bot.html
  • https://www.lipperhey.com/en/about/
  • https://www.lipperhey.com/en/about/
  • https://turnitin.com/robot/crawlerinfo.html
  • http://help.coccoc.com/
  • ubermetrics-technologies.com
  • datenbutler.de
  • http://searchgears.de/uber-uns/crawling-faq.html
  • http://commoncrawl.org/faq/
  • https://www.qwant.com/
  • http://linkfluence.net/
  • http://www.botje.com/plukkie.htm
  • https://www.safedns.com/searchbot
  • http://www.haosou.com/help/help_3_2.html
  • http://www.haosou.com/help/help_3_2.html
  • http://www.moz.com/dp/rogerbot
  • http://www.openhose.org/bot.html
  • http://www.screamingfrog.co.uk/seo-spider/
  • http://thumbsniper.com
  • http://www.radian6.com/crawler
  • http://cliqz.com/company/cliqzbot
  • https://www.aihitdata.com/about
  • http://www.trendiction.com/en/publisher/bot
  • http://seocompany.store
  • https://github.com/yasserg/crawler4j/
  • http://warebay.com/bot.html
  • http://www.website-datenbank.de/
  • http://law.di.unimi.it/BUbiNG.html
  • http://www.linguee.com/bot; bot@linguee.com
  • https://www.semrush.com/bot/
  • www.sentibot.eu
  • http://velen.io
  • https://moz.com/help/guides/moz-procedures/what-is-rogerbot
  • http://www.garlik.com
  • https://www.gosign.de/typo3-extension/typo3-sicherheitsmonitor/
  • http://www.siteliner.com/bot
  • https://sabsim.com
  • http://ltx71.com/
  • https://aspiegel.com/petalbot
  • https://seostar.co/robot/
  • https://dataforseo.com/dataforseo-bot
  • https://domainstats.com/pages/our-bot
  • https://inetdex.com/
  • http://www.checkmarknetwork.com/spider.html
  • https://start.me/bot
  • http://cincrawdata.net/bot/
  • https://babbar.tech/crawler
  • https://www.t3versions.com/bot
  • https://serpstatbot.com
  • https://app.hypefactors.com/media-monitoring/about.html
  • http://2ip.io
  • http://linkfluence.com/
  • https://www.trendiction.com/bot
  • https://platform.openai.com/docs/gptbot
  • https://blog.google/technology/ai/an-update-on-web-publisher-controls/
  • https://darkvisitors.com/agents/google-extended
  • https://darkvisitors.com/agents/cohere-ai
  • https://darkvisitors.com/agents/anthropic-ai
  • https://darkvisitors.com/agents/ccbot
  • https://darkvisitors.com/agents/facebookbot
  • https://darkvisitors.com/agents/gptbot
  • https://darkvisitors.com/agents/omgilibot
  • http://serendeputy.com/about/serendeputy-bot
  • ByteDance: Duobao

Warnings

  • 5 invalid lines.