crumblcookies.com
robots.txt

Robots Exclusion Standard data for crumblcookies.com

Resource Scan

Scan Details

Site Domain crumblcookies.com
Base Domain crumblcookies.com
Scan Status Ok
Last Scan2024-07-03T07:15:04+00:00
Next Scan 2024-07-17T07:15:04+00:00

Last Scan

Scanned2024-07-03T07:15:04+00:00
URL https://crumblcookies.com/robots.txt
Domain IPs 104.22.30.249, 104.22.31.249, 172.67.11.43, 2606:4700:10::6816:1ef9, 2606:4700:10::6816:1ff9, 2606:4700:10::ac43:b2b
Response IP 104.22.30.249
Found Yes
Hash b6e002330bfe103bfb83fbdce4768f1ad7a2cca533698438d67faa081afeed36
SimHash ec709565c635

Groups

*

Rule Path
Allow /
Disallow /tv
Disallow /tv/*
Disallow /statement
Disallow /app
Disallow /bank_success
Disallow /bank_retry
Disallow /shop
Disallow /en/account
Disallow /es/*
Disallow /f/*
Disallow /thankyou/*
Disallow /r/*
Disallow *?*
Disallow /api
Disallow /_next/*
Disallow /voucher/*

Other Records

Field Value
sitemap https://crumblcookies.com/sitemap.xml

Comments

  • *
  • Host
  • Sitemaps

Warnings

  • `host` is not a known field.