crumbl.com
robots.txt

Robots Exclusion Standard data for crumbl.com

Resource Scan

Scan Details

Site Domain crumbl.com
Base Domain crumbl.com
Scan Status Ok
Last Scan2024-06-30T19:02:17+00:00
Next Scan 2024-07-14T19:02:17+00:00

Last Scan

Scanned2024-06-30T19:02:17+00:00
URL https://crumbl.com/robots.txt
Redirect https://crumblcookies.com/robots.txt
Redirect Domain crumblcookies.com
Redirect Base crumblcookies.com
Domain IPs 108.157.254.124, 108.157.254.37, 108.157.254.43, 108.157.254.74
Redirect IPs 104.22.30.249, 104.22.31.249, 172.67.11.43, 2606:4700:10::6816:1ef9, 2606:4700:10::6816:1ff9, 2606:4700:10::ac43:b2b
Response IP 172.67.11.43
Found Yes
Hash b6e002330bfe103bfb83fbdce4768f1ad7a2cca533698438d67faa081afeed36
SimHash ec709565c635

Groups

*

Rule Path
Allow /
Disallow /tv
Disallow /tv/*
Disallow /statement
Disallow /app
Disallow /bank_success
Disallow /bank_retry
Disallow /shop
Disallow /en/account
Disallow /es/*
Disallow /f/*
Disallow /thankyou/*
Disallow /r/*
Disallow *?*
Disallow /api
Disallow /_next/*
Disallow /voucher/*

Other Records

Field Value
sitemap https://crumblcookies.com/sitemap.xml

Comments

  • *
  • Host
  • Sitemaps

Warnings

  • `host` is not a known field.