gazzettadelsud.it
robots.txt

Robots Exclusion Standard data for gazzettadelsud.it

Resource Scan

Scan Details

Site Domain gazzettadelsud.it
Base Domain gazzettadelsud.it
Scan Status Ok
Last Scan2024-05-03T13:06:31+00:00
Next Scan 2024-05-10T13:06:31+00:00

Last Scan

Scanned2024-05-03T13:06:31+00:00
URL https://gazzettadelsud.it/robots.txt
Domain IPs 104.26.14.190, 104.26.15.190, 172.67.71.218, 2606:4700:20::681a:ebe, 2606:4700:20::681a:fbe, 2606:4700:20::ac43:47da
Response IP 104.26.15.190
Found Yes
Hash 26c0c9f7f803df31a643ebab25ba70f69d5f123dabb8b7ba1e155a464d8e493b
SimHash 683950454ed3

Groups

*

Rule Path
Disallow /errorpages/
Disallow /articoli/includes/
Disallow /speciali/english/
Disallow /includes/
Disallow /*?refresh_ce
Disallow /*?p=

Other Records

Field Value
sitemap https://gazzettadelsud.it/sitemap-index.xml

Comments

  • Disallow: /articoli/ajax/