cleveland.com
robots.txt

Robots Exclusion Standard data for cleveland.com

Resource Scan

Scan Details

Site Domain cleveland.com
Base Domain cleveland.com
Scan Status Ok
Last Scan2024-04-25T19:53:50+00:00
Next Scan 2024-05-02T19:53:50+00:00

Last Scan

Scanned2024-04-25T19:53:50+00:00
URL https://cleveland.com/robots.txt
Redirect https://www.cleveland.com:443/robots.txt
Redirect Domain www.cleveland.com
Redirect Base cleveland.com
Domain IPs 75.2.53.215, 99.83.138.34
Redirect IPs 173.222.148.35, 173.222.148.48, 2600:1413:b000:13::b857:c18f, 2600:1413:b000:13::b857:c192
Response IP 42.99.140.217
Found Yes
Hash acd8ca87eb165d3f767d9c449eaba241c8c009b0514f46d8edeff9c73e1f743a
SimHash 5d3449454735

Groups

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /home-beta/
Disallow /test/
Disallow /preview/
Disallow /staging/
Disallow /clients/
Disallow /auctions/
Disallow /cgi-bin/
Disallow /printer/
Disallow /*/print.html*
Disallow /*/print.ssf*
Disallow /landingpages/
Disallow /premium/
Disallow /contests/scrape/
Disallow /howard-hanna/
Disallow /hn/
Disallow /HyperNews/
Disallow /win/
Disallow /puzzles-palace/
Disallow /plain_dealer_syndication/
Disallow /plain-dealer-stories/
Disallow /toprailapp/
Disallow /newssun/2010/06/middleburg_heights_native_find.html
Disallow /*mt-preview-*
Disallow /puzzles-games/
Disallow /comics/
Disallow /puzzle-society/
Disallow /go-comics/
Disallow /comics-kingdom/

googlebot

Rule Path
Disallow /home-beta/
Disallow /test/
Disallow /preview/
Disallow /staging/
Disallow /clients/
Disallow /auctions/
Disallow /cgi-bin/
Disallow /printer/
Disallow /*/print.html*
Disallow /*/print.ssf*
Disallow /landingpages/
Disallow /premium/
Disallow /contests/scrape/
Disallow /howard-hanna/
Disallow /seniorgames/
Disallow /hn/
Disallow /HyperNews/
Disallow /win/
Disallow /puzzles-palace/
Disallow /plain_dealer_syndication/
Disallow /plain-dealer-stories/
Disallow /toprailapp/
Disallow /newssun/2010/06/middleburg_heights_native_find.html
Disallow /*mt-preview-*
Disallow /puzzles-games/
Disallow /comics/
Disallow /puzzle-society/
Disallow /go-comics/
Disallow /comics-kingdom/

proximic

Rule Path
Disallow /home-beta/
Disallow /test/

slurp

Rule Path
Disallow /home-beta/
Disallow /test/
Disallow /preview/
Disallow /staging/
Disallow /clients/
Disallow /auctions/
Disallow /cgi-bin/
Disallow /printer/
Disallow /*/print.html*
Disallow /*/print.ssf*
Disallow /landingpages/
Disallow /premium/
Disallow /contests/scrape/
Disallow /howard-hanna/
Disallow /seniorgames/
Disallow /hn/
Disallow /HyperNews/
Disallow /win/
Disallow /puzzles-palace/
Disallow /plain_dealer_syndication/
Disallow /plain-dealer-stories/
Disallow /toprailapp/
Disallow /newssun/2010/06/middleburg_heights_native_find.html
Disallow /*mt-preview-*
Disallow /puzzles-games/
Disallow /comics/
Disallow /puzzle-society/
Disallow /go-comics/
Disallow /comics-kingdom/

twitterbot

Rule Path
Disallow /home-beta/
Disallow /test/
Disallow /preview/
Disallow /staging/
Disallow /clients/
Disallow /auctions/
Disallow /cgi-bin/
Disallow /printer/
Disallow /*/print.html*
Disallow /*/print.ssf*
Disallow /landingpages/
Disallow /premium/
Disallow /contests/scrape/
Disallow /howard-hanna/
Disallow /hn/
Disallow /HyperNews/
Disallow /win/
Disallow /puzzles-palace/
Disallow /plain_dealer_syndication/
Disallow /plain-dealer-stories/
Disallow /toprailapp/
Disallow /newssun/2010/06/middleburg_heights_native_find.html
Disallow /*mt-preview-*
Disallow /puzzles-games/
Disallow /comics/
Disallow /puzzle-society/
Disallow /go-comics/
Disallow /comics-kingdom/

*

Rule Path
Disallow /digitalsubscription/imagedownload/

Other Records

Field Value
sitemap https://www.cleveland.com/arc/outboundfeeds/rss-latest/?outputType=xml
sitemap https://www.cleveland.com/arc/outboundfeeds/sitemap-index/?outputType=xml
sitemap https://www.cleveland.com/arc/outboundfeeds/news-sitemap-index/?outputType=xml