gruppoeditorialesanpaolo.it
robots.txt

Robots Exclusion Standard data for gruppoeditorialesanpaolo.it

Resource Scan

Scan Details

Site Domain gruppoeditorialesanpaolo.it
Base Domain gruppoeditorialesanpaolo.it
Scan Status Ok
Last Scan2025-03-10T15:16:50+00:00
Next Scan 2025-04-09T15:16:50+00:00

Last Scan

Scanned2025-03-10T15:16:50+00:00
URL https://www.gruppoeditorialesanpaolo.it/robots.txt
Domain IPs 151.236.52.213
Response IP 151.236.52.213
Found Yes
Hash 066f8d382957f48390a47987c6bcc51648f32c2fa4aa924b8ec59542170c724e
SimHash 63037b615fc0

Groups

*

Rule Path
Disallow /aspnet_client/
Disallow /bin/
Disallow /config/
Disallow /data/
Disallow /macroScripts/
Disallow /masterpages/
Disallow /scripts/
Disallow /umbraco/*
Disallow /umbraco/controls/
Disallow /umbraco/uComponents/
Disallow /umbraco_client/
Disallow /usercontrols/
Disallow /xslt/
Allow /scripts/
Allow /css/
Disallow /index.php?*

Other Records

Field Value
sitemap http://www.gruppoeditorialesanpaolo.it/sitemap

Comments

  • Robots.txt for Umbraco
  • Immagini
  • Disallow: /Search
  • Mappa