intreccia.it
robots.txt
Robots Exclusion Standard data for intreccia.it
Resource Scan
Scan Details
Site Domain | intreccia.it |
Base Domain | intreccia.it |
Scan Status | Ok |
Last Scan | 2024-11-10T18:24:34+00:00 |
Next Scan | 2024-11-17T18:24:34+00:00 |
Last Scan
Scanned | 2024-11-10T18:24:34+00:00 |
URL | https://intreccia.it/robots.txt |
Redirect | https://www.intreccia.it/robots.txt |
Redirect Domain | www.intreccia.it |
Redirect Base | intreccia.it |
Domain IPs | 3.73.135.230 |
Redirect IPs | 138.201.201.18, 176.9.79.149, 195.201.193.180, 213.133.97.172, 88.99.101.219, 88.99.2.209, 88.99.2.210, 88.99.2.212, 88.99.2.213, 94.130.164.5, 94.130.206.224 |
Response IP | 88.99.2.210 |
Found | Yes |
Hash | ab3ad9c12d4fd688e51b46ac24b0ae8d617f37c0643fe7d1b6970cfef82d74e0 |
SimHash | e9a9a800edbb |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-content/uploads/wc-logs/ |
Disallow | /wp-content/uploads/woocommerce_transient_files/ |
Disallow | /wp-content/uploads/woocommerce_uploads/ |
Disallow | /wp-admin/ |
Allow | /wp-admin/admin-ajax.php |
*
Rule | Path |
---|---|
Disallow |
Other Records
Field | Value |
---|---|
sitemap | https://www.intreccia.it/sitemap_index.xml |
Comments