crumbl.com
robots.txt

Robots Exclusion Standard data for crumbl.com

Resource Scan

Scan Details

Site Domain crumbl.com
Base Domain crumbl.com
Scan Status Ok
Last Scan2024-11-04T00:46:34+00:00
Next Scan 2024-11-18T00:46:34+00:00

Last Scan

Scanned2024-11-04T00:46:34+00:00
URL https://crumbl.com/robots.txt
Redirect https://crumblcookies.com/robots.txt
Redirect Domain crumblcookies.com
Redirect Base crumblcookies.com
Domain IPs 108.157.254.124, 108.157.254.37, 108.157.254.43, 108.157.254.74
Redirect IPs 104.18.8.115, 104.18.9.115, 2606:4700::6812:873, 2606:4700::6812:973
Response IP 104.18.9.115
Found Yes
Hash b6e002330bfe103bfb83fbdce4768f1ad7a2cca533698438d67faa081afeed36
SimHash ec709565c635

Groups

*

Rule Path
Allow /
Disallow /tv
Disallow /tv/*
Disallow /statement
Disallow /app
Disallow /bank_success
Disallow /bank_retry
Disallow /shop
Disallow /en/account
Disallow /es/*
Disallow /f/*
Disallow /thankyou/*
Disallow /r/*
Disallow *?*
Disallow /api
Disallow /_next/*
Disallow /voucher/*

Other Records

Field Value
sitemap https://crumblcookies.com/sitemap.xml

Comments

  • *
  • Host
  • Sitemaps

Warnings

  • `host` is not a known field.