biolage.it
robots.txt
Robots Exclusion Standard data for biolage.it
Resource Scan
Scan Details
Site Domain | biolage.it |
Base Domain | biolage.it |
Scan Status | Ok |
Last Scan | 2024-10-03T21:27:45+00:00 |
Next Scan | 2024-10-17T21:27:45+00:00 |
Last Scan
Scanned | 2024-10-03T21:27:45+00:00 |
URL | https://biolage.it/robots.txt |
Redirect | https://www.biolage.it/robots.txt |
Redirect Domain | www.biolage.it |
Redirect Base | biolage.it |
Domain IPs | 104.18.32.154, 172.64.155.102, 2606:4700:4400::6812:209a, 2606:4700:4400::ac40:9b66 |
Redirect IPs | 104.18.32.154, 172.64.155.102, 2606:4700:4400::6812:209a, 2606:4700:4400::ac40:9b66 |
Response IP | 104.18.32.154 |
Found | Yes |
Hash | eaa4a6110ce01366c2821e86a5623c656a35f64b95db7a9570ac39ddef8f9f85 |
SimHash | e9800e15afd2 |
Groups
*
Rule | Path |
---|---|
Disallow | /sitecore |
Disallow | /sitecore/* |
Disallow | /Sitecore |
Disallow | /Sitecore/* |
Disallow | /sitecore_files/ |
Disallow | /sitecore_modules/ |
Disallow | /App_Browsers/ |
Disallow | /App_Config/ |
Disallow | /App_Data/ |
Disallow | /temp/ |
Disallow | /App_Browsers/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.biolage.it/sitemap_biolage_ita.xml |