crumblcookies.com
robots.txt

Robots Exclusion Standard data for crumblcookies.com

Resource Scan

Scan Details

Site Domain crumblcookies.com
Base Domain crumblcookies.com
Scan Status Ok
Last Scan2024-11-06T14:10:47+00:00
Next Scan 2024-11-20T14:10:47+00:00

Last Scan

Scanned2024-11-06T14:10:47+00:00
URL https://crumblcookies.com/robots.txt
Domain IPs 104.18.8.115, 104.18.9.115, 2606:4700::6812:873, 2606:4700::6812:973
Response IP 104.18.9.115
Found Yes
Hash b6e002330bfe103bfb83fbdce4768f1ad7a2cca533698438d67faa081afeed36
SimHash ec709565c635

Groups

*

Rule Path
Allow /
Disallow /tv
Disallow /tv/*
Disallow /statement
Disallow /app
Disallow /bank_success
Disallow /bank_retry
Disallow /shop
Disallow /en/account
Disallow /es/*
Disallow /f/*
Disallow /thankyou/*
Disallow /r/*
Disallow *?*
Disallow /api
Disallow /_next/*
Disallow /voucher/*

Other Records

Field Value
sitemap https://crumblcookies.com/sitemap.xml

Comments

  • *
  • Host
  • Sitemaps

Warnings

  • `host` is not a known field.