lanova.it
robots.txt

Robots Exclusion Standard data for lanova.it

Resource Scan

Scan Details

Site Domain lanova.it
Base Domain lanova.it
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-11-29T17:38:51+00:00
Next Scan 2026-02-27T17:38:51+00:00

Last Successful Scan

Scanned2025-03-20T10:26:13+00:00
URL https://www.lanova.it/robots.txt
Domain IPs 89.46.105.15
Response IP 89.46.105.15
Found Yes
Hash 160ed91e0e0c985a1ad427120881afe698f55a760779eaa8a7241345eb5618b3
SimHash a95418748141

Groups

*

Rule Path
Disallow /*?
Disallow /*.json$
Disallow /wip/
Disallow /bckup_wp/
Disallow /testnew/

ia_archiver

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

googlebot

Rule Path
Allow /*.css$
Allow /*.js$

Other Records

Field Value
sitemap https://www.lanova.it/sitemap.xml

Comments

  • Disable Chinese Baidu crawler
  • User-agent: Baiduspider
  • Disallow: /
  • Removing Documents From the Wayback Machine archive.org
  • Disable *.domaintools.com crawler