icfalcomatarchi.edu.it
robots.txt
Robots Exclusion Standard data for icfalcomatarchi.edu.it
Resource Scan
Scan Details
| Site Domain | icfalcomatarchi.edu.it |
| Base Domain | icfalcomatarchi.edu.it |
| Scan Status | Ok |
| Last Scan | 2026-02-28T10:02:52+00:00 |
| Next Scan | 2026-03-30T10:02:52+00:00 |
Last Scan
| Scanned | 2026-02-28T10:02:52+00:00 |
| URL | https://icfalcomatarchi.edu.it/robots.txt |
| Redirect | https://www.icfalcomatarchi.edu.it/robots.txt |
| Redirect Domain | www.icfalcomatarchi.edu.it |
| Redirect Base | icfalcomatarchi.edu.it |
| Domain IPs | 2a00:6d40:4:1::c275:42, 89.46.108.42 |
| Redirect IPs | 2a00:6d40:4:1::c275:42, 89.46.108.42 |
| Response IP | 89.46.108.42 |
| Found | Yes |
| Hash | 08f5c91b482cb0d84e39dde82197cc1e873a6882fb7c45ba294c2b20cc6e62f6 |
| SimHash | 69400a82c513 |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.icfalcomatarchi.edu.it/sitemap_index.xml |
Warnings
- 3 invalid lines.
- `<!doctype html public "-//w3c//dtd html 4.0 transitional//en" "http` is not a known field.
- `performance optimized by w3 total cache. learn more` is not a known field.
- `served from` is not a known field.
Comments