gazzettalavoro.it
robots.txt
Robots Exclusion Standard data for gazzettalavoro.it
Resource Scan
Scan Details
Site Domain | gazzettalavoro.it |
Base Domain | gazzettalavoro.it |
Scan Status | Ok |
Last Scan | 2024-11-04T16:50:21+00:00 |
Next Scan | 2024-11-11T16:50:21+00:00 |
Last Scan
Scanned | 2024-11-04T16:50:21+00:00 |
URL | https://gazzettalavoro.it/robots.txt |
Redirect | https://www.gazzettalavoro.it/robots.txt |
Redirect Domain | www.gazzettalavoro.it |
Redirect Base | gazzettalavoro.it |
Domain IPs | 2600:1901:0:11f5::, 34.149.210.165 |
Redirect IPs | 2600:1901:0:11f5::, 34.149.210.165 |
Response IP | 34.149.210.165 |
Found | Yes |
Hash | 99786279851f2d38a9f9bc2296876f0d3bae50395aec1aff89d9ccbce0dbb6da |
SimHash | 0054c360c771 |
Groups
*
Rule | Path |
---|---|
Allow | /$ |
Allow | /ads.txt$ |
Allow | /amp/* |
Allow | /editor/* |
Disallow | /* |