cronacaossona.com
robots.txt

Robots Exclusion Standard data for cronacaossona.com

Resource Scan

Scan Details

Site Domain cronacaossona.com
Base Domain cronacaossona.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-06-08T10:59:40+00:00
Next Scan 2025-07-08T10:59:40+00:00

Last Successful Scan

Scanned2025-05-10T10:58:29+00:00
URL https://cronacaossona.com/robots.txt
Redirect https://www.cronacaossona.com/robots.txt
Redirect Domain www.cronacaossona.com
Redirect Base cronacaossona.com
Domain IPs 89.46.106.49
Redirect IPs 89.46.106.49
Response IP 89.46.106.49
Found Yes
Hash 9a3698882b11e899e7f33e39e9721dcdf6a55fa527f29ac768f7374ba2323db1
SimHash 8d601e974685

Groups

*

Rule Path
Allow /
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content
Disallow /e/
Disallow /show-error-*
Disallow /xmlrpc.php
Disallow /trackback/
Disallow /comment-page-
Allow /wp-content/uploads/
Allow /feed

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

adsbot-google-mobile-apps

Rule Path
Allow /

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

googlebot-video

Rule Path
Allow /

grapeshot

Rule Path
Disallow

Other Records

Field Value
sitemap http://www.cronacaossona.com/sitemap.xml
sitemap http://www.cronacaossona.com/news-sitemap.xml
sitemap http://www.cronacaossona.com/sitemap-news.xml