asteannunci.it
robots.txt
Robots Exclusion Standard data for asteannunci.it
Resource Scan
Scan Details
Site Domain | asteannunci.it |
Base Domain | asteannunci.it |
Scan Status | Ok |
Last Scan | 2024-10-27T07:56:18+00:00 |
Next Scan | 2024-11-26T07:56:18+00:00 |
Last Scan
Scanned | 2024-10-27T07:56:18+00:00 |
URL | https://asteannunci.it/robots.txt |
Redirect | https://www.asteannunci.it/robots.txt |
Redirect Domain | www.asteannunci.it |
Redirect Base | asteannunci.it |
Domain IPs | 104.26.12.239, 104.26.13.239, 172.67.73.63, 2606:4700:20::681a:cef, 2606:4700:20::681a:def, 2606:4700:20::ac43:493f |
Redirect IPs | 104.26.12.239, 104.26.13.239, 172.67.73.63, 2606:4700:20::681a:cef, 2606:4700:20::681a:def, 2606:4700:20::ac43:493f |
Response IP | 172.67.73.63 |
Found | Yes |
Hash | aef4b2c9ceb24772a26299e1bce728c2f5ba198271f0b907e0df1fb734417698 |
SimHash | cd7d65794d91 |
Groups
*
Rule | Path |
---|---|
Disallow | /aste/*/allegati/download |
Disallow | /en/aste/*/allegati/download |
Disallow | /aste/*/vicinanze |
Disallow | /en/aste/*/vicinanze |
Disallow | /news/embed |
Disallow | /convegni/embed |
mail.ru
dotbot
blexbot
blexbot/1.0
istellabot
istellabot/1.01.18
istellabot/1.01.18 +http://www.tiscali.it/
istellabot/1.10.2 +http://www.tiscali.it/
mozilla/5.0 (compatible; istellabot/1.01.18 +http://www.tiscali.it/)
turnitinbot
mj12bot
smtbot
smtbot/1.0
alphabot
alphaseobot
alphaseobot-sa
seekbot
seekport crawler
linguee bot
Rule | Path |
---|---|
Disallow | * |
Other Records
Field | Value |
---|---|
sitemap | https://www.asteannunci.it/sitemap.xml |