bioplanet.be
robots.txt
Robots Exclusion Standard data for bioplanet.be
Resource Scan
Scan Details
Site Domain | bioplanet.be |
Base Domain | bioplanet.be |
Scan Status | Ok |
Last Scan | 2024-09-13T05:20:46+00:00 |
Next Scan | 2024-10-13T05:20:46+00:00 |
Last Scan
Scanned | 2024-09-13T05:20:46+00:00 |
URL | https://bioplanet.be/robots.txt |
Redirect | https://www.bioplanet.be/robots.txt |
Redirect Domain | www.bioplanet.be |
Redirect Base | bioplanet.be |
Domain IPs | 151.101.39.52, 2a04:4e42:9::820 |
Redirect IPs | 151.101.131.52, 151.101.195.52, 151.101.3.52, 151.101.67.52, 2a04:4e42:200::820, 2a04:4e42:400::820, 2a04:4e42:600::820, 2a04:4e42::820 |
Response IP | 199.232.47.52 |
Found | Yes |
Hash | 7ce1a036cede9928c51aac84f471d168b1e3bf5883b85f8cda77b7217a1071c9 |
SimHash | c8008932c731 |
Groups
*
Rule | Path |
---|---|
Allow | /content/bioplanet/*.*.json$ |
Disallow | /threat |
Disallow | /nl/error |
Disallow | /fr/error |
Disallow | /fr-lu/error |
Disallow | /nl/producten/product-detail |
Disallow | /fr/producten/product-detail |
Disallow | /fr/produits/product-detail |
Disallow | /content/bioplanet |
Other Records
Field | Value |
---|---|
sitemap | https://www.bioplanet.be/sitemap.xml |