cerave.nl
robots.txt
Robots Exclusion Standard data for cerave.nl
Resource Scan
Scan Details
Site Domain | cerave.nl |
Base Domain | cerave.nl |
Scan Status | Ok |
Last Scan | 2024-05-03T02:14:34+00:00 |
Next Scan | 2024-05-17T02:14:34+00:00 |
Last Scan
Scanned | 2024-05-03T02:14:34+00:00 |
URL | https://cerave.nl/robots.txt |
Redirect | https://www.cerave.nl/robots.txt |
Redirect Domain | www.cerave.nl |
Redirect Base | cerave.nl |
Domain IPs | 104.18.39.106, 172.64.148.150, 2606:4700:4400::6812:276a, 2606:4700:4400::ac40:9496 |
Redirect IPs | 104.18.39.106, 172.64.148.150, 2606:4700:4400::6812:276a, 2606:4700:4400::ac40:9496 |
Response IP | 104.18.39.106 |
Found | Yes |
Hash | 3474b73987e934f16135c40040e33e0cacd8c8823491419b3c29aaceb43e30e4 |
SimHash | 21001e25cd90 |
Groups
*
Rule | Path |
---|---|
Disallow | /xsl/ |
Disallow | /temp/ |
Disallow | /upload/ |
Disallow | /sitecore |
Disallow | /Sitecore |
Disallow | /App_Data/ |
Disallow | /App_config/ |
Disallow | /App_Browsers/ |
Disallow | /sitecore_files/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.cerave.nl/sitemap-index.xml |
Warnings
- 1 invalid line.