icfalcomatarchi.edu.it
robots.txt

Robots Exclusion Standard data for icfalcomatarchi.edu.it

Resource Scan

Scan Details

Site Domain icfalcomatarchi.edu.it
Base Domain icfalcomatarchi.edu.it
Scan Status Ok
Last Scan2026-02-28T10:02:52+00:00
Next Scan 2026-03-30T10:02:52+00:00

Last Scan

Scanned2026-02-28T10:02:52+00:00
URL https://icfalcomatarchi.edu.it/robots.txt
Redirect https://www.icfalcomatarchi.edu.it/robots.txt
Redirect Domain www.icfalcomatarchi.edu.it
Redirect Base icfalcomatarchi.edu.it
Domain IPs 2a00:6d40:4:1::c275:42, 89.46.108.42
Redirect IPs 2a00:6d40:4:1::c275:42, 89.46.108.42
Response IP 89.46.108.42
Found Yes
Hash 08f5c91b482cb0d84e39dde82197cc1e873a6882fb7c45ba294c2b20cc6e62f6
SimHash 69400a82c513

Groups

termlybot

Rule Path
Allow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.icfalcomatarchi.edu.it/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK</p></body></html>

Warnings

  • 3 invalid lines.
  • `<!doctype html public "-//w3c//dtd html 4.0 transitional//en" "http` is not a known field.
  • `performance optimized by w3 total cache. learn more` is not a known field.
  • `served from` is not a known field.