hccpei.ca
robots.txt

Robots Exclusion Standard data for hccpei.ca

Resource Scan

Scan Details

Site Domain hccpei.ca
Base Domain hccpei.ca
Scan Status Ok
Last Scan2025-11-04T09:06:41+00:00
Next Scan 2025-12-04T09:06:41+00:00

Last Scan

Scanned2025-11-04T09:06:41+00:00
URL https://hccpei.ca/robots.txt
Redirect https://www.hccpei.ca/robots.txt
Redirect Domain www.hccpei.ca
Redirect Base hccpei.ca
Domain IPs 3.214.112.14, 54.237.225.198
Redirect IPs 3.214.112.14, 54.237.225.198
Response IP 54.237.225.198
Found Yes
Hash d9224f526cf16f633f5ef3ead4fec80b316ab666b37af0f2eb0cf89a5ee067bc
SimHash a3257553dc62

Groups

*

Rule Path
Disallow /Administration/*
Disallow /bin/
Disallow /VideoUtils/
Disallow /gb/
Disallow /SocialMedia/
Disallow /FunHelper/
Disallow /ObituariesHelper/RefreshComments
Disallow /ObituariesHelper/ObituaryEventsLoad
Disallow /ObituariesHelper/PurchaseWallProduct
Disallow /FunHelper/Print

mj12bot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

domain re-animator bot

Rule Path
Disallow /

baidu

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

sogou

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

megaindex.com

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

uptimebot/1.0

Rule Path
Disallow /

uptimerobot/2.0

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

zoominfo

Rule Path
Disallow /

neevabot

Rule Path
Disallow /

pinterestbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

naver.me

Rule Path
Disallow /

mail.ru_bot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

surdotlybot

Rule Path
Disallow /

dataprovider

Rule Path
Disallow /

orbbot

Rule Path
Disallow /

gnowitnewsbot

Rule Path
Disallow /

addthis

Rule Path
Disallow /

webmeup

Rule Path
Disallow /

lanaibot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

panscient

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /obituaries/*

Other Records

Field Value
sitemap https://www.hccpei.ca/sitemap.xml
sitemap https://www.hccpei.ca/obituaries-sitemap/4.xml.gz
sitemap https://www.hccpei.ca/obituaries-sitemap/3.xml.gz
sitemap https://www.hccpei.ca/obituaries-sitemap/2.xml.gz
sitemap https://www.hccpei.ca/obituaries-sitemap/1.xml.gz
sitemap https://www.hccpei.ca/static-sitemap.xml

Comments

  • robots.txt for Umbraco

Warnings

  • 2 invalid lines.