salsarosa.com
robots.txt
Robots Exclusion Standard data for salsarosa.com
Resource Scan
Scan Details
Site Domain | salsarosa.com |
Base Domain | salsarosa.com |
Scan Status | Ok |
Last Scan | 2024-06-29T00:22:55+00:00 |
Next Scan | 2024-07-06T00:22:55+00:00 |
Last Scan
Scanned | 2024-06-29T00:22:55+00:00 |
URL | https://salsarosa.com/robots.txt |
Redirect | https://www.salsarosa.com/robots.txt |
Redirect Domain | www.salsarosa.com |
Redirect Base | salsarosa.com |
Domain IPs | 104.18.12.133, 104.18.13.133, 2606:4700::6812:c85, 2606:4700::6812:d85 |
Redirect IPs | 104.18.12.133, 104.18.13.133, 2606:4700::6812:c85, 2606:4700::6812:d85 |
Response IP | 104.18.13.133 |
Found | Yes |
Hash | decbe4aadea8499d9541c0cdc11bedb164ba87dfb475cf6c5897705c58b28400 |
SimHash | 2d086005ad92 |
Groups
*
Rule | Path |
---|---|
Disallow | /_call* |
Disallow | /*breaking-news-es.json* |
Disallow | /buscador.html?* |
Disallow | /cercador.html?* |
Disallow | /cdn-cgi/rum |
Other Records
Field | Value |
---|---|
sitemap | https://www.salsarosa.com/uploads/feeds/google_sitemap_salsa-rosa.xml |