cronacaossona.com
robots.txt

Robots Exclusion Standard data for cronacaossona.com

Resource Scan

Scan Details

Site Domain cronacaossona.com
Base Domain cronacaossona.com
Scan Status Ok
Last Scan2025-12-26T20:43:03+00:00
Next Scan 2026-01-02T20:43:03+00:00

Last Scan

Scanned2025-12-26T20:43:03+00:00
URL https://cronacaossona.com/robots.txt
Redirect https://www.cronacaossona.com/robots.txt
Redirect Domain www.cronacaossona.com
Redirect Base cronacaossona.com
Domain IPs 2a00:6d40:4:3::c243:49, 89.46.106.49
Redirect IPs 2a00:6d40:4:3::c243:49, 89.46.106.49
Response IP 89.46.106.49
Found Yes
Hash 99d1699e7bca7dd65686482a4a1c1f9c9b4933b7368b283217261ce9851cbfc8
SimHash 8d601e974685

Groups

*

Rule Path
Allow /
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content
Disallow /e/
Disallow /show-error-*
Disallow /xmlrpc.php
Disallow /trackback/
Disallow /comment-page-
Allow /wp-content/uploads/
Allow /feed

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

adsbot-google-mobile-apps

Rule Path
Allow /

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

googlebot-video

Rule Path
Allow /

grapeshot

Rule Path
Disallow

Other Records

Field Value
sitemap http://www.cronacaossona.com/sitemap.xml
sitemap http://www.cronacaossona.com/news-sitemap.xml
sitemap http://www.cronacaossona.com/sitemap-news.xml