annunci.ilpiccolo.gelocal.it
robots.txt
Robots Exclusion Standard data for annunci.ilpiccolo.gelocal.it
Resource Scan
Scan Details
Site Domain | annunci.ilpiccolo.gelocal.it |
Base Domain | gelocal.it |
Scan Status | Ok |
Last Scan | 2024-09-23T10:12:00+00:00 |
Next Scan | 2024-10-23T10:12:00+00:00 |
Last Scan
Scanned | 2024-09-23T10:12:00+00:00 |
URL | https://annunci.ilpiccolo.gelocal.it/robots.txt |
Redirect | https://annunci.repubblica.it/robots.txt |
Redirect Domain | annunci.repubblica.it |
Redirect Base | repubblica.it |
Domain IPs | 213.92.16.205 |
Redirect IPs | 213.92.16.205 |
Response IP | 213.92.16.205 |
Found | Yes |
Hash | e54090a88558651c01415caad7505da7652aea5bcae9fbc02f36c327bb185c88 |
SimHash | 80044bd71fd4 |
Groups
*
Rule | Path |
---|---|
Disallow | /redirect.html |
Disallow | /clickaway.html |
Disallow | /gestione/ |
Disallow | /data/ |
Disallow | /adcontacts/ |
Disallow | /pscontacts/ |
Disallow | /ricerca/ |
Disallow | /mappa/ |
Disallow | /griglia/ |
Disallow | /css/ |
Disallow | /js/ |
Disallow | /flip/ |
Disallow | /greybox/ |
Disallow | /my/ |
Disallow | *?tc=1$ |
Disallow | */ord-* |
Disallow | *?from=* |
Disallow | */ultimericerche.html |
Disallow | /images/v2/mappa* |