gazzanet.gazzetta.it
robots.txt
Robots Exclusion Standard data for gazzanet.gazzetta.it
Resource Scan
Scan Details
Site Domain | gazzanet.gazzetta.it |
Base Domain | gazzetta.it |
Scan Status | Ok |
Last Scan | 2024-11-15T12:42:28+00:00 |
Next Scan | 2024-11-16T12:42:28+00:00 |
Last Scan
Scanned | 2024-11-15T12:42:28+00:00 |
URL | https://gazzanet.gazzetta.it/robots.txt |
Domain IPs | 34.90.132.17 |
Response IP | 34.90.132.17 |
Found | Yes |
Hash | fe6669f197f631d43adb55fa634fa82146405a1cb00755fe55aa2ec50308727a |
SimHash | 211f52e043f1 |
Groups
*
Rule | Path |
---|---|
Disallow | */commenti/$ |
Disallow | /rcs-community-comments-rest-api/ |
Disallow | /archivio/pagina-*/pagina- |
Disallow | /archivio/page/ |
Disallow | /archivio/categoria/ |
Disallow | /archivio/gallery/ |
Disallow | /archivio/video/ |
Disallow | /*commenti/ |
Disallow | /*?app_v2 |
Disallow | /*?app_v1 |
Other Records
Field | Value |
---|---|
sitemap | https://www.gazzetta.it/sitemaps/sitemap.xml |
sitemap | https://www.gazzetta.it/sitemaps/sitemap-news.xml |