cargill.it
robots.txt
Robots Exclusion Standard data for cargill.it
Resource Scan
Scan Details
| Site Domain | cargill.it |
| Base Domain | cargill.it |
| Scan Status | Failed |
| Failure Stage | Fetching resource. |
| Failure Reason | Server returned a client error. |
| Last Scan | 2026-03-08T02:46:41+00:00 |
| Next Scan | 2026-04-07T02:46:41+00:00 |
Last Successful Scan
| Scanned | 2026-01-15T01:14:35+00:00 |
| URL | https://cargill.it/robots.txt |
| Redirect | https://www.cargill.it/robots.txt |
| Redirect Domain | www.cargill.it |
| Redirect Base | cargill.it |
| Domain IPs | 44.209.174.5, 44.213.185.86 |
| Redirect IPs | 104.18.28.180, 104.18.29.180, 2606:4700::6812:1cb4, 2606:4700::6812:1db4 |
| Response IP | 104.18.28.180 |
| Found | Yes |
| Hash | ca860ae4248fe38f36e8c647fd1b0baf6dd7bfd33888b39cf6635e5c45810098 |
| SimHash | c0019450c773 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
| Disallow | /en/search-results |
| Disallow | /page/en/search-results |
| Disallow | /it/risultati-della-ricerca |