cronacaossona.com
robots.txt
Robots Exclusion Standard data for cronacaossona.com
Resource Scan
Scan Details
Site Domain | cronacaossona.com |
Base Domain | cronacaossona.com |
Scan Status | Failed |
Failure Reason | Scan timed out. |
Last Scan | 2025-06-08T10:59:40+00:00 |
Next Scan | 2025-07-08T10:59:40+00:00 |
Last Successful Scan
Scanned | 2025-05-10T10:58:29+00:00 |
URL | https://cronacaossona.com/robots.txt |
Redirect | https://www.cronacaossona.com/robots.txt |
Redirect Domain | www.cronacaossona.com |
Redirect Base | cronacaossona.com |
Domain IPs | 89.46.106.49 |
Redirect IPs | 89.46.106.49 |
Response IP | 89.46.106.49 |
Found | Yes |
Hash | 9a3698882b11e899e7f33e39e9721dcdf6a55fa527f29ac768f7374ba2323db1 |
SimHash | 8d601e974685 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /cgi-bin |
Disallow | /wp-admin |
Disallow | /wp-includes |
Disallow | /wp-content |
Disallow | /e/ |
Disallow | /show-error-* |
Disallow | /xmlrpc.php |
Disallow | /trackback/ |
Disallow | /comment-page- |
Allow | /wp-content/uploads/ |
Allow | /feed |
Other Records
Field | Value |
---|---|
sitemap | http://www.cronacaossona.com/sitemap.xml |
sitemap | http://www.cronacaossona.com/news-sitemap.xml |
sitemap | http://www.cronacaossona.com/sitemap-news.xml |