ilmeteo.it
robots.txt

Robots Exclusion Standard data for ilmeteo.it

Resource Scan

Scan Details

Site Domain ilmeteo.it
Base Domain ilmeteo.it
Scan Status Ok
Last Scan2024-05-10T14:19:22+00:00
Next Scan 2024-05-17T14:19:22+00:00

Last Scan

Scanned2024-05-10T14:19:22+00:00
URL https://ilmeteo.it/robots.txt
Redirect https://www.ilmeteo.it/robots.txt
Redirect Domain www.ilmeteo.it
Redirect Base ilmeteo.it
Domain IPs 104.22.60.141, 104.22.61.141, 172.67.15.175, 2606:4700:10::6816:3c8d, 2606:4700:10::6816:3d8d, 2606:4700:10::ac43:faf
Redirect IPs 104.22.60.141, 104.22.61.141, 172.67.15.175, 2606:4700:10::6816:3c8d, 2606:4700:10::6816:3d8d, 2606:4700:10::ac43:faf
Response IP 104.22.61.141
Found Yes
Hash 477f8f19b9853ee5b199efd61d1c5ceb8f11c41aa8d16afdfe0be3b1bb865c9a
SimHash 4c3adcc00195

Groups

googlebot

Rule Path
Disallow /index2.html
Disallow /portale/index2.html
Disallow /maps/
Disallow /password/
Disallow /892424/
Disallow /portale/manutenzione.html
Disallow /portale/indexbusy.php

mj12bot

Rule Path
Disallow /

nutch

Rule Path
Disallow /

riddler

Rule Path
Disallow /

*

Rule Path
Disallow /*standard%3D1
Disallow /?t=*
Disallow /?f=*
Disallow /?b=*
Disallow /?p=*
Disallow /?page=*
Disallow /portale/node/*/amp

Other Records

Field Value
crawl-delay 2

Warnings

  • 2 invalid lines.