thefreshgrow.com
robots.txt
Robots Exclusion Standard data for thefreshgrow.com
Resource Scan
Scan Details
Site Domain | thefreshgrow.com |
Base Domain | thefreshgrow.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-11-13T07:23:43+00:00 |
Next Scan | 2024-11-20T07:23:43+00:00 |
Last Successful Scan
Scanned | 2024-11-05T06:08:21+00:00 |
URL | https://thefreshgrow.com/robots.txt |
Domain IPs | 104.21.90.114, 172.67.200.99, 2606:4700:3032::ac43:c863, 2606:4700:3035::6815:5a72 |
Response IP | 104.21.90.114 |
Found | Yes |
Hash | 94e176b2d5e1e54c459003b7bcee2d437e15c3d481e04b60fd35f8b649a8eaea |
SimHash | 6404f047cf50 |
Groups
*
Rule | Path |
---|---|
Allow | /*.js? |
Allow | /*.css? |
Allow | /*.png? |
Allow | /*.jpg? |
Allow | /*.json? |
Allow | /*.ico? |
Allow | /*.svg? |
Allow | /*.eot? |
Allow | /*.woff? |
Allow | /*.ttf? |
Allow | /*.xml? |
Disallow | /?utm_source= |
Other Records
Field | Value |
---|---|
sitemap | https://thefreshgrow.com/sitemap_index.xml |
Warnings
- `host` is not a known field.