it.caudalie.com
robots.txt
Robots Exclusion Standard data for it.caudalie.com
Resource Scan
Scan Details
Site Domain | it.caudalie.com |
Base Domain | caudalie.com |
Scan Status | Ok |
Last Scan | 2025-06-20T08:23:20+00:00 |
Next Scan | 2025-07-04T08:23:20+00:00 |
Last Scan
Scanned | 2025-06-20T08:23:20+00:00 |
URL | https://it.caudalie.com/robots.txt |
Domain IPs | 2600:9000:271a:3800:14:b634:38c0:93a1, 2600:9000:271a:5c00:14:b634:38c0:93a1, 2600:9000:271a:7000:14:b634:38c0:93a1, 2600:9000:271a:c800:14:b634:38c0:93a1, 2600:9000:271a:ca00:14:b634:38c0:93a1, 2600:9000:271a:e000:14:b634:38c0:93a1, 2600:9000:271a:ea00:14:b634:38c0:93a1, 2600:9000:271a:f000:14:b634:38c0:93a1, 3.165.75.122, 3.165.75.16, 3.165.75.26, 3.165.75.38 |
Response IP | 3.165.75.122 |
Found | Yes |
Hash | 5f3d71b70ecd06eb0f9eef3d121e4f9b572ef95ea2a5634d28a6ad8a97613f30 |
SimHash | ef041c54c331 |
Groups
*
Rule | Path |
---|---|
Disallow | /*.css$ |
Disallow | /*.js$ |
Disallow | /*.php$ |
Other Records
Field | Value |
---|---|
crawl-delay | 5 |
Other Records
Field | Value |
---|---|
sitemap | https://it.caudalie.com/sitemap.xml |