cattolica.it
robots.txt
Robots Exclusion Standard data for cattolica.it
Resource Scan
Scan Details
Site Domain | cattolica.it |
Base Domain | cattolica.it |
Scan Status | Ok |
Last Scan | 2024-06-22T02:12:14+00:00 |
Next Scan | 2024-07-22T02:12:14+00:00 |
Last Scan
Scanned | 2024-06-22T02:12:14+00:00 |
URL | https://cattolica.it/robots.txt |
Redirect | https://www.cattolica.it/robots.txt |
Redirect Domain | www.cattolica.it |
Redirect Base | cattolica.it |
Domain IPs | 52.49.27.220 |
Redirect IPs | 108.156.133.28, 108.156.133.35, 108.156.133.47, 108.156.133.82 |
Response IP | 108.156.133.35 |
Found | Yes |
Hash | 97e1f0e04ef06868d436f54588449b9a6f38936a6abd6230dd27f9d85e965e1a |
SimHash | e9498b464fb1 |
Groups
*
Rule | Path |
---|---|
Disallow | */en/* |
Disallow | */investor-relation* |
Disallow | */comunicati-rss* |
Disallow | */group* |
Disallow | */media-relation* |
*
Rule | Path |
---|---|
Allow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.cattolica.it/sitemap.xml |
Warnings
- `host` is not a known field.