eu.cincinnati.com
robots.txt

Robots Exclusion Standard data for eu.cincinnati.com

Resource Scan

Scan Details

Site Domain eu.cincinnati.com
Base Domain cincinnati.com
Scan Status Ok
Last Scan2025-09-21T04:43:55+00:00
Next Scan 2025-09-28T04:43:55+00:00

Last Scan

Scanned2025-09-21T04:43:55+00:00
URL https://eu.cincinnati.com/robots.txt
Redirect https://www.cincinnati.com/robots.txt
Redirect Domain www.cincinnati.com
Redirect Base cincinnati.com
Domain IPs 151.101.130.62, 151.101.194.62, 151.101.2.62, 151.101.66.62
Redirect IPs 151.101.130.62, 151.101.194.62, 151.101.2.62, 151.101.66.62
Response IP 146.75.46.62
Found Yes
Hash a1dbe94d8fe63df6b5efb2d2d99bb62c51d22e6368b5d466121b2554011b861c
SimHash 7f1c03d7c281

Groups

anthropic-ai

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

googlebot-news

Rule Path
Disallow /story/sponsor-story/
Disallow /picture-gallery/sponsor-story/
Disallow /videos/sponsor-story/
Disallow /longform/sponsor-story/
Disallow /pages/interactives/sponsor-story/
Disallow /interactives/sponsor-story/
Disallow /videos/embed/

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

news-please

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

peer39_crawler

Rule Path
Disallow /

peer39_crawler/1.0

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claude-user

Rule Path
Disallow /

claude-searchbot

Rule Path
Disallow /

ai2bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

ai2bot-dolma

Rule Path
Disallow /

applebot

Rule Path
Disallow /

arquivo-web-crawler

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

cohere-training-data-crawler

Rule Path
Disallow /

crawler4j

Rule Path
Disallow /

crawlspace

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

duckassistbot

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

google-cloudvertexbot

Rule Path
Disallow /

https://hada.news

Rule Path
Disallow /

https://www.imediaethics.org

Rule Path
Disallow /

iaskspider/2.0

Rule Path
Disallow /

icc-crawler

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

img2dataset

Rule Path
Disallow /

isscyberriskcrawler

Rule Path
Disallow /

jenkersbot

Rule Path
Disallow /

kangaroo bot

Rule Path
Disallow /

livelapbot

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

mojeek

Rule Path
Disallow /

mojeekbot

Rule Path
Disallow /

moodlebot

Rule Path
Disallow /

neticlebot

Rule Path
Disallow /

netvibes

Rule Path
Disallow /

pangubot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

seekr

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

sidetrade indexer bot

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

turnitin

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

webzio-extended

Rule Path
Disallow /

yacy

Rule Path
Disallow /

yacybot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yandexadditional

Rule Path
Disallow /

yandexadditionalbot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

ia_archiver-web.archive.org

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

amazon kendra

Rule Path
Disallow /

big sur ai

Rule Path
Disallow /

brandwatch

Rule Path
Disallow /

bravest

Rule Path
Disallow /

chatgpt operator

Rule Path
Disallow /

cotoyogi

Rule Path
Disallow /

digitaloceangenaicrawler

Rule Path
Disallow /

echobot bot

Rule Path
Disallow /

echoboxbot

Rule Path
Disallow /

factset_spyderbot

Rule Path
Disallow /

grok

Rule Path
Disallow /

iaskspider

Rule Path
Disallow /

icc crawler

Rule Path
Disallow /

imgproxy

Rule Path
Disallow /

liner bot

Rule Path
Disallow /

mistralai-user

Rule Path
Disallow /

netestate imprint crawler

Rule Path
Disallow /

novaact

Rule Path
Disallow /

qualifiedbot

Rule Path
Disallow /

sbintuitionsbot

Rule Path
Disallow /

semrushbot-ocob

Rule Path
Disallow /

semrushbotswa

Rule Path
Disallow /

sidetrade

Rule Path
Disallow /

tiktokspider

Rule Path
Disallow /

wardbot

Rule Path
Disallow /

*

Rule Path
Disallow /errors
Disallow /interactive/
Disallow /userauth/
Disallow /ugc/
Disallow /feeds/
Disallow /services/
Disallow /facebook/
Disallow /version-info/
Disallow /longform/draft/
Disallow /story/draft/
Disallow /topic/*/smart/
Disallow /search
Disallow /module-showcase/
Disallow /newsletter/
Disallow /blended-newsletter/
Disallow /story/nletter/
Disallow /sports/services/photos/
Disallow /optimus
Disallow /ux-train
Disallow /story/advisory/
Disallow /.cam-tangent/
Disallow /pbd/
Disallow /gciaf/
Disallow /gcdn/gciaf/
Disallow /dcc/
Disallow /gcdn/dcc/
Disallow /dc/
Disallow /gcdn/dc/
Disallow /dcjs/
Disallow /gcdn/dcjs/
Disallow /content-queries/
Disallow /zxc/

Other Records

Field Value
sitemap https://www.cincinnati.com/news-sitemap.xml
sitemap https://www.cincinnati.com/web-sitemap-index.xml
sitemap https://www.cincinnati.com/video-sitemap-index.xml
sitemap https://www.cincinnati.com/betting/sitemap.xml
sitemap https://www.cincinnati.com/betting/news-sitemap.xml

Comments

  • robots.txt file for https://www.cincinnati.com/