dataiku.com
robots.txt
Robots Exclusion Standard data for dataiku.com
Resource Scan
Scan Details
Site Domain | dataiku.com |
Base Domain | dataiku.com |
Scan Status | Ok |
Last Scan | 2024-11-03T19:25:59+00:00 |
Next Scan | 2024-11-17T19:25:59+00:00 |
Last Scan
Scanned | 2024-11-03T19:25:59+00:00 |
URL | https://dataiku.com/robots.txt |
Domain IPs | 104.25.147.106, 104.25.148.106, 172.67.83.158, 2606:4700:20::6819:936a, 2606:4700:20::6819:946a, 2606:4700:20::ac43:539e |
Response IP | 104.25.148.106 |
Found | Yes |
Hash | 93f67f6c8576273e53efd2ac5da0b69e9b726c597cedd834f69718c1c3efd00b |
SimHash | 5a09d808c5b2 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Disallow | /wp-login.php |
Disallow | /?p= |
Disallow | /search/ |
Disallow | /wp-content/themes/dataiku/slice/ |
Allow | /wp-content/themes/dataiku/slice/dist/css/ |
Allow | /wp-content/themes/dataiku/slice/dist/js/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.dataiku.com/sitemap_index.xml |