plaidscocooning.com
robots.txt

Robots Exclusion Standard data for plaidscocooning.com

Resource Scan

Scan Details

Site Domain plaidscocooning.com
Base Domain plaidscocooning.com
Scan Status Ok
Last Scan2025-12-17T15:11:37+00:00
Next Scan 2025-12-31T15:11:37+00:00

Last Scan

Scanned2025-12-17T15:11:37+00:00
URL https://plaidscocooning.com/robots.txt
Redirect https://thewaro.com/robots.txt
Redirect Domain thewaro.com
Redirect Base thewaro.com
Domain IPs 23.227.38.65
Redirect IPs 23.227.38.65
Response IP 23.227.38.65
Found Yes
Hash 8a9e94ad97ac86430754cefa1340a80028ea4e0761feeb0e2e1b97ebad0f3b6f
SimHash 6d909ed05c89

Groups

*

Rule Path
Disallow /admin
Disallow /cart
Disallow /orders
Disallow /checkouts/
Disallow /checkout
Disallow /carts
Disallow /account
Disallow /search
Disallow /*preview_theme_id*
Disallow /*preview_script_id*
Disallow /*?*oseid=*
Disallow */collections/*filter*%26*filter*
Disallow /*?*sort_by*
Disallow /*?*utm_*
Disallow /*?*variant=*
Allow /policies/
Allow /

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

adsbot-google-mobile

Rule Path
Allow /

bingbot

Rule Path
Allow /

msnbot-media

Rule Path
Allow /

applebot

Rule Path
Allow /

yandex

Rule Path
Allow /

yandeximages

Rule Path
Allow /

slurp

Rule Path
Allow /

ahrefsbot

Rule Path
Allow /

ahrefssiteaudit

Rule Path
Allow /

semrushbot

Rule Path
Allow /

siteauditbot

Rule Path
Allow /

semrushbot-sa

Rule Path
Allow /

dotbot

Rule Path
Allow /

mj12bot

Rule Path
Allow /

screaming frog seo spider

Rule Path
Allow /

botify

Rule Path
Allow /

deepcrawl

Rule Path
Allow /

lumar

Rule Path
Allow /

oncrawl

Rule Path
Allow /

rytebot

Rule Path
Allow /

onpagebot

Rule Path
Allow /

seobility

Rule Path
Allow /

seobilitybot

Rule Path
Allow /

serpstatbot

Rule Path
Allow /

rsiteauditor

Rule Path
Allow /

sitebulb

Rule Path
Allow /

sitebulb crawler

Rule Path
Allow /

petalbot

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

megaindex

Rule Path
Disallow /

sogou

Rule Path
Disallow /

exabot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://thewaro.com/sitemap.xml

Comments

  • ****************************************************************************
  • robots.txt
  • Purpose: Control how crawlers access Shopify site
  • Notes:
  • - Major search engines explicitly allowed
  • - Trusted SEO tools allowed for audits
  • - Sensitive Shopify pages blocked
  • - Spammy / scraper bots disallowed
  • ****************************************************************************
  • --------------
  • Default Rules
  • --------------
  • ------------------------------------------
  • Major Search Engines — Explicitly Allowed
  • ------------------------------------------
  • ----------------------------
  • Trusted SEO Tools — Allowed
  • ----------------------------
  • ------------------------------
  • Blocked Spammy / Scraper Bots
  • ------------------------------
  • -----------------
  • Sitemap Location
  • -----------------