thelocal.it
robots.txt
Robots Exclusion Standard data for thelocal.it
Resource Scan
Scan Details
Site Domain | thelocal.it |
Base Domain | thelocal.it |
Scan Status | Ok |
Last Scan | 2024-09-26T16:56:10+00:00 |
Next Scan | 2024-10-03T16:56:10+00:00 |
Last Scan
Scanned | 2024-09-26T16:56:10+00:00 |
URL | https://thelocal.it/robots.txt |
Redirect | https://www.thelocal.it/robots.txt |
Redirect Domain | www.thelocal.it |
Redirect Base | thelocal.it |
Domain IPs | 104.18.4.188, 104.18.5.188, 2606:4700::6812:4bc, 2606:4700::6812:5bc |
Redirect IPs | 104.18.4.188, 104.18.5.188, 2606:4700::6812:4bc, 2606:4700::6812:5bc |
Response IP | 104.18.5.188 |
Found | Yes |
Hash | f1aac0f33036f000f8c5bda55b14be5eb065d9521f023dc04e303c41132e051a |
SimHash | 5d2950d18f80 |
Groups
*
Rule | Path |
---|---|
Disallow | *.js* |
Disallow | *.css* |
Disallow | /200* |
Disallow | /2010* |
Disallow | /2011* |
Disallow | /2012* |
Disallow | /2013* |
Disallow | /amp* |
Disallow | /fonts* |
Disallow | /index.php/ |
Disallow | *cms.* |
Disallow | *medium.* |
Disallow | *medium2.* |
Disallow | *discuss.* |
Other Records
Field | Value |
---|---|
sitemap | https://www.thelocal.it/sitemap/it/news.xml |
sitemap | https://www.thelocal.it/sitemap/index.xml |