pchealthadvisor.com
robots.txt

Robots Exclusion Standard data for pchealthadvisor.com

Resource Scan

Scan Details

Site Domain pchealthadvisor.com
Base Domain pchealthadvisor.com
Scan Status Ok
Last Scan2025-11-08T16:47:28+00:00
Next Scan 2025-12-08T16:47:28+00:00

Last Scan

Scanned2025-11-08T16:47:28+00:00
URL https://pchealthadvisor.com/robots.txt
Domain IPs 104.21.63.239, 172.67.173.28, 2606:4700:3030::ac43:ad1c, 2606:4700:3033::6815:3fef
Response IP 104.21.63.239
Found Yes
Hash 519b52219b3db22ce253f392418e5bb8363d10c128e5ee518ff120d498c67e47
SimHash 44156bd3c591

Groups

*

Rule Path
Allow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

googlebot

Rule Path
Disallow

adsense

Rule Path
Disallow

inspection

Rule Path
Disallow

telegrambot

Rule Path
Disallow

yandexbot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-news

Rule Path
Disallow

googlebot-video

Rule Path
Disallow

adsbot-google

Rule Path
Disallow

bingbot

Rule Path
Disallow

adidxbot

Rule Path
Disallow

slurp

Rule Path
Disallow

duckduckbot

Rule Path
Disallow

baiduspider

Rule Path
Disallow

yandex

Rule Path
Disallow

sogou

Rule Path
Disallow

exabot

Rule Path
Disallow

facebot

Rule Path
Disallow

ia_archiver

Rule Path
Disallow

applebot

Rule Path
Disallow

petalbot

Rule Path
Disallow

linkedinbot

Rule Path
Disallow

amazonbot

Rule Path
Disallow

amazonbot-image

Rule Path
Disallow

gptbot

Rule Path
Disallow

chatgpt-user

Rule Path
Disallow

ccbot

Rule Path
Disallow

claudebot

Rule Path
Disallow

claude-web

Rule Path
Disallow

perplexitybot

Rule Path
Disallow

oai-searchbot

Rule Path
Disallow

oai-search

Rule Path
Disallow

magpie-crawler

Rule Path
Disallow

cohere-ai

Rule Path
Disallow

youbot

Rule Path
Disallow

anthropic-ai

Rule Path
Disallow

bytedancespider

Rule Path
Disallow

ahrefsbot

Rule Path
Disallow

semrushbot

Rule Path
Disallow

mj12bot

Rule Path
Disallow

dataforseobot

Rule Path
Disallow

diffbot

Rule Path
Disallow

scrapy

Rule Path
Disallow

serpstatbot

Rule Path
Disallow

dotbot

Rule Path
Disallow

archive.org_bot

Rule Path
Disallow

wayback-machine

Rule Path
Disallow

commoncrawl

Rule Path
Disallow

Comments

  • As a condition of accessing this website, you agree to abide by the following
  • content signals:
  • (a) If a content-signal = yes, you may collect content for the corresponding
  • use.
  • (b) If a content-signal = no, you may not collect content for the
  • corresponding use.
  • (c) If the website operator does not include a content signal for a
  • corresponding use, the website operator neither grants nor restricts
  • permission via content signal with respect to the corresponding use.
  • The content signals and their meanings are:
  • search: building a search index and providing search results (e.g., returning
  • hyperlinks and short excerpts from your website's contents). Search does not
  • include providing AI-generated search summaries.
  • ai-input: inputting content into one or more AI models (e.g., retrieval
  • augmented generation, grounding, or other real-time taking of content for
  • generative AI search answers).
  • ai-train: training or fine-tuning AI models.
  • ANY RESTRICTIONS EXPRESSED VIA CONTENT SIGNALS ARE EXPRESS RESERVATIONS OF
  • RIGHTS UNDER ARTICLE 4 OF THE EUROPEAN UNION DIRECTIVE 2019/790 ON COPYRIGHT
  • AND RELATED RIGHTS IN THE DIGITAL SINGLE MARKET.
  • BEGIN Cloudflare Managed content
  • END Cloudflare Managed Content
  • ======================================
  • Robots.txt - Allow All Listed Crawlers
  • ======================================
  • ======================================
  • END
  • ======================================

Warnings

  • `content-signal` is not a known field.