ocnews.us
robots.txt

Robots Exclusion Standard data for ocnews.us

Resource Scan

Scan Details

Site Domain ocnews.us
Base Domain ocnews.us
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-10-11T16:17:18+00:00
Next Scan 2024-11-10T16:17:18+00:00

Last Successful Scan

Scanned2024-09-12T11:58:40+00:00
URL https://ocnews.us/robots.txt
Domain IPs 104.21.95.200, 172.67.147.179, 2606:4700:3035::6815:5fc8, 2606:4700:3037::ac43:93b3
Response IP 172.67.147.179
Found Yes
Hash 73db3c16153bf458dd5c4e8ec124d321da74a3a92fc2f1f843cf07a40addc881
SimHash 7a1091136ee7

Groups

semrushbot

Rule Path
Disallow /

siteauditbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

semrushbot-swa

Rule Path
Disallow /

semrushbot-ct

Rule Path
Disallow /

semrushbot-bm

Rule Path
Disallow /

splitsignalbot

Rule Path
Disallow /

semrushbot-coub

Rule Path
Disallow /

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /cgi-bin/
Disallow /wp-includes/
Disallow /xmlrpc.php
Disallow /wp-content/plugins/
Disallow /wp-content/cache/
Disallow /trackback/
Disallow /comments/

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

grapeshot

Rule Path
Disallow

oai-searchbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

mozilla/5.0 (compatible; newsroom.bi/0.1; +https://www.newsroom.bi/bot.html)

Rule Path
Allow /

mozilla/5.0 (linux; android 6.0; nexus 5 build/mra51n) applewebkit/537.36 (khtml, like gecko) chrome/87.0.4280.67 mobile safari/537.36 (compatible; mrfcompass-booldog/1.0)

Rule Path
Allow /

mozilla/5.0 (macintosh; intel mac os x 10_9_2) applewebkit/537.36(khtml, like gecko) chrome/35.0.1916.153 safari/537.36 (compatible; mrfcompass-booldog/1.0)

Rule Path
Allow /

mozilla/5.0 (linux; android 6.0.1; nexus 5x build/mmb29p) applewebkit/537.36 (khtml, like gecko) chrome/41.0.2272.96 mobile safari/537.36 (compatible; mrfcompass-marshall/1.0)

Rule Path
Allow /

mozilla/5.0 (linux; android 6.0; nexus 5 build/mra51n) applewebkit/537.36 (khtml, like gecko) chrome/87.0.4280.67 mobile safari/537.36 (compatible; mrfcompass-jukebox/1.0)

Rule Path
Allow /

perplexitybot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://ocnews.us/sitemap.xml

Comments

  • Our Content is made available for your personal, non-commercial use subject to our Terms of Service here: https://www.medianewsgroup.com/terms-of-use/
  • Any other uses are prohibited, including but not limited to:
  • (1) text and data mining activities under Art. 4 of the EU Directive on Copyright in the Digital Single Market;
  • (2) use of any Content or information available on the Site for purposes of retrieval augmented generation, grounding, training, or development of machine learning models, algorithms, or artificial intelligence (AI) systems, or to generate substitute content or develop any products, services, or technology;
  • (3) caching or archiving the Content; and/or
  • (4) any commercial purposes.
  • Contact us at https://ocnews.us/contact-us/ for assistance.
  • Sitemap archive