aparici.com
robots.txt
Robots Exclusion Standard data for aparici.com
Resource Scan
Scan Details
| Site Domain | aparici.com |
| Base Domain | aparici.com |
| Scan Status | Ok |
| Last Scan | 2025-12-05T09:04:33+00:00 |
| Next Scan | 2026-01-04T09:04:33+00:00 |
Last Scan
| Scanned | 2025-12-05T09:04:33+00:00 |
| URL | https://aparici.com/robots.txt |
| Redirect | https://www.aparici.com/robots.txt |
| Redirect Domain | www.aparici.com |
| Redirect Base | aparici.com |
| Domain IPs | 104.26.2.72, 104.26.3.72, 172.67.69.120, 2606:4700:20::681a:248, 2606:4700:20::681a:348, 2606:4700:20::ac43:4578 |
| Redirect IPs | 104.26.2.72, 104.26.3.72, 172.67.69.120, 2606:4700:20::681a:248, 2606:4700:20::681a:348, 2606:4700:20::ac43:4578 |
| Response IP | 104.26.2.72 |
| Found | Yes |
| Hash | 534fbdd824fe72d5bd6f0e4317a776bbae689db2d228b222eafbeb15902f73ba |
| SimHash | 0c7dda870333 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /dash/ |
| Disallow | *coleccion%3D* |
| Disallow | *pdf_* |
| Disallow | /reality*? |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.aparici.com/sitemap.xml |
| sitemap | https://www.aparici.com/en/sitemap.xml |