cumbriaguru.com
robots.txt

Robots Exclusion Standard data for cumbriaguru.com

Resource Scan

Scan Details

Site Domain cumbriaguru.com
Base Domain cumbriaguru.com
Scan Status Ok
Last Scan2026-03-06T12:56:20+00:00
Next Scan 2026-03-13T12:56:20+00:00

Last Scan

Scanned2026-03-06T12:56:20+00:00
URL https://cumbriaguru.com/robots.txt
Domain IPs 77.72.2.43
Response IP 77.72.2.43
Found Yes
Hash 8125c137ab29de026069b22c2dd1ac0b791d8cf533cd4a1b0ed90b3014c9bd55
SimHash 005c88d2e799

Groups

*

Rule Path
Allow /

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

bingbot

Rule Path
Allow /

gptbot

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

ccbot

Rule Path
Allow /

bravebot

Rule Path
Allow /

ahrefsbot

Rule Path
Allow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

moz

Rule Path
Disallow /

mozbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

screaming frog seo spider

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

siteauditor

Rule Path
Disallow /

cocolyzebot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.cumbriaguru.com/sitemap.xml

Comments

  • --- ALLOW MAJOR SEARCH + AI CRAWLERS ---
  • --- BLOCK NON-ESSENTIAL / HEAVY / SCRAPER BOTS ---
  • --- SITEMAP ---