duermecol.com
robots.txt
Robots Exclusion Standard data for duermecol.com
Resource Scan
Scan Details
Site Domain | duermecol.com |
Base Domain | duermecol.com |
Scan Status | Ok |
Last Scan | 2024-09-25T12:20:48+00:00 |
Next Scan | 2024-10-25T12:20:48+00:00 |
Last Scan
Scanned | 2024-09-25T12:20:48+00:00 |
URL | https://duermecol.com/robots.txt |
Redirect | https://www.duermecol.com/robots.txt |
Redirect Domain | www.duermecol.com |
Redirect Base | duermecol.com |
Domain IPs | 2001:8d8:100f:f000::2e9, 217.160.0.88 |
Redirect IPs | 2001:8d8:100f:f000::2e9, 217.160.0.88 |
Response IP | 217.160.0.88 |
Found | Yes |
Hash | 231258206bd315ec7fc96ee15c26fc29032ff2b70896536cfbf4e4ce4094d3a5 |
SimHash | ea568050ca30 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /tienda/carrito/* |
Disallow | /tienda/carrito-add |
Allow | /*.js$ |
Allow | /*.css$ |
Other Records
Field | Value |
---|---|
sitemap | https://www.duermecol.com/sitemap.xml |
sitemap | https://www.duermecol.com/sitemap2.xml |
sitemap | https://www.duermecol.com/sitemap4.xml |
sitemap | https://www.duermecol.com/sitemap5.xml |
Warnings
- 2 invalid lines.