cloudtoc.net
robots.txt

Robots Exclusion Standard data for cloudtoc.net

Resource Scan

Scan Details

Site Domain cloudtoc.net
Base Domain cloudtoc.net
Scan Status Ok
Last Scan2025-08-21T12:53:44+00:00
Next Scan 2025-08-28T12:53:44+00:00

Last Scan

Scanned2025-08-21T12:53:44+00:00
URL https://cloudtoc.net/robots.txt
Redirect https://www.bloomberg.com/robots.txt
Redirect Domain www.bloomberg.com
Redirect Base bloomberg.com
Domain IPs 15.197.146.156, 3.33.146.110
Redirect IPs 151.101.1.73, 151.101.129.73, 151.101.193.73, 151.101.65.73
Response IP 199.232.45.73
Found Yes
Hash a055cb29de1f3015475b314ba8edb2f99f270acc9cb15ce93a8998bee9829ea4
SimHash b7375921f570

Groups

*

Rule Path
Disallow /polska
Allow /account/newsletters
Disallow /account/*
Disallow /tosv*.html
Disallow /search
Disallow /company/search/
Disallow /professional/search/
Disallow /impact/search/
Disallow /ux/search/
Disallow /wnwi/search/
Disallow /gei/search/
Disallow /impact/search/
Disallow /netzeropathfinders/search/
Disallow /notices/search/
Disallow /distribution/search/
Disallow /ukinnovators/search/
Disallow /latam/search/
Disallow /faq/search/
Disallow /tc/search/
Disallow /subscriptions/group/manage/
Disallow /preview/lineup
Disallow /preview/articles
Disallow /explore/
Disallow /press-releases/
Disallow /artemis/
Disallow /sessions-publisher/

google-extended

Rule Path
Disallow /

mediapartners-google

Rule Path
Disallow /about/careers
Disallow /about/careers/
Disallow /offlinemessage/
Disallow /apps/fbk
Disallow /bb/newsarchive/
Disallow /apps/news

spinn3r

Rule Path
Disallow /podcasts/
Disallow /feed/podcast/
Disallow /bb/avfile/

googlebot-news

Rule Path
Disallow /sponsor/
Disallow /news/sponsors/*
Disallow /news/terminal/*

twitterbot

Rule Path
Allow /en/news/thp

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

python-urllib

Rule Path
Disallow /

python-requests

Rule Path
Disallow /

python-http-client

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

google-apps

Rule Path
Disallow /

google-apps-script

Rule Path
Disallow /

appengine-google

Rule Path
Disallow /

google-cloud

Rule Path
Disallow /

google-cloudvertexbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

amazonadbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

feedly

Rule Path
Disallow /

feedlybot

Rule Path
Disallow /

feedlyapp

Rule Path
Disallow /

mwfeedparser

Rule Path
Disallow /

comscore

Rule Path
Disallow /

comscore

Rule Path
Disallow /

hubspot

Rule Path
Disallow /

hubspot

Rule Path
Disallow /

criteo

Rule Path
Disallow /

criteo-bot

Rule Path
Disallow /

peer39_crawler

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

linkdexbot

Rule Path
Disallow /

ntent

Rule Path
Disallow /

anderspinkbot

Rule Path
Disallow /

indeedbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

toutiaospider

Rule Path
Disallow /

screaming frog seo spider

Rule Path
Disallow /

prtg

Rule Path
Disallow /

freshpingbot

Rule Path
Disallow /

panopta

Rule Path
Disallow /

datadogsynthetics

Rule Path
Disallow /

rackspace

Rule Path
Disallow /

censys

Rule Path
Disallow /

burp

Rule Path
Disallow /

burp

Rule Path
Disallow /

check_http

Rule Path
Disallow /

dotcommonitor

Rule Path
Disallow /

watchsumo

Rule Path
Disallow /

wormlybot

Rule Path
Disallow /

calibrebot

Rule Path
Disallow /

audistobot

Rule Path
Disallow /

hatena

Rule Path
Disallow /

hatena-bookmark

Rule Path
Disallow /

everyonesocialbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

netvibes

Rule Path
Disallow /

webceo

Rule Path
Disallow /

postano

Rule Path
Disallow /

rebelmouse

Rule Path
Disallow /

muck-rack

Rule Path
Disallow /

instapaperviewer

Rule Path
Disallow /

twurly

Rule Path
Disallow /

livelapbot

Rule Path
Disallow /

datagnionbot

Rule Path
Disallow /

linespider

Rule Path
Disallow /

discourse

Rule Path
Disallow /

medusa

Rule Path
Disallow /

pingback

Rule Path
Disallow /

wordpress

Rule Path
Disallow /

wp_ping

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

wesee

Rule Path
Disallow /

halebot

Rule Path
Disallow /

brightbot

Rule Path
Disallow /

unityplayer

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.bloomberg.com/sitemaps/news/index.xml
sitemap https://www.bloomberg.com/sitemaps/news/latest.xml
sitemap https://www.bloomberg.com/sitemaps/collections/index.xml
sitemap https://www.bloomberg.com/sitemaps/media/video/index.xml
sitemap https://www.bloomberg.com/sitemaps/media/audio/index.xml
sitemap https://www.bloomberg.com/sitemaps/people/profiles/index.xml
sitemap https://www.bloomberg.com/sitemaps/companies/public-company/index.xml
sitemap https://www.bloomberg.com/sitemaps/companies/private-company/index.xml
sitemap https://www.bloomberg.com/sitemaps/securites/index.xml
sitemap https://www.bloomberg.com/billionaires/sitemap.xml

Comments

  • Bot rules:
  • 1. A bot may not injure a human being or, through inaction, allow a human being to come to harm.
  • 2. A bot must obey orders given it by human beings except where such orders would conflict with the First Law.
  • 3. A bot must protect its own existence as long as such protection does not conflict with the First or Second Law.
  • If you can read this then you should apply here https://www.bloomberg.com/careers/
  • Development Tools
  • AI/LLM Bots
  • Google Services
  • Major Company Bots
  • Feed & Content Aggregators
  • Analytics & Marketing
  • Search Engine Bots
  • Monitoring & Security
  • Social & Content
  • Potentially Malicious
  • Gaming/3D
  • Sitemaps app
  • Billionaires, owned by graphics

Warnings

  • 2 invalid lines.