asteannunci.it
robots.txt

Robots Exclusion Standard data for asteannunci.it

Resource Scan

Scan Details

Site Domain asteannunci.it
Base Domain asteannunci.it
Scan Status Ok
Last Scan2024-10-27T07:56:18+00:00
Next Scan 2024-11-26T07:56:18+00:00

Last Scan

Scanned2024-10-27T07:56:18+00:00
URL https://asteannunci.it/robots.txt
Redirect https://www.asteannunci.it/robots.txt
Redirect Domain www.asteannunci.it
Redirect Base asteannunci.it
Domain IPs 104.26.12.239, 104.26.13.239, 172.67.73.63, 2606:4700:20::681a:cef, 2606:4700:20::681a:def, 2606:4700:20::ac43:493f
Redirect IPs 104.26.12.239, 104.26.13.239, 172.67.73.63, 2606:4700:20::681a:cef, 2606:4700:20::681a:def, 2606:4700:20::ac43:493f
Response IP 172.67.73.63
Found Yes
Hash aef4b2c9ceb24772a26299e1bce728c2f5ba198271f0b907e0df1fb734417698
SimHash cd7d65794d91

Groups

*

Rule Path
Disallow /aste/*/allegati/download
Disallow /en/aste/*/allegati/download
Disallow /aste/*/vicinanze
Disallow /en/aste/*/vicinanze
Disallow /news/embed
Disallow /convegni/embed

mail.ru
dotbot
blexbot
blexbot/1.0
istellabot
istellabot/1.01.18
istellabot/1.01.18 +http://www.tiscali.it/
istellabot/1.10.2 +http://www.tiscali.it/
mozilla/5.0 (compatible; istellabot/1.01.18 +http://www.tiscali.it/)
turnitinbot
mj12bot
smtbot
smtbot/1.0
alphabot
alphaseobot
alphaseobot-sa
seekbot
seekport crawler
linguee bot

Rule Path
Disallow *

Other Records

Field Value
sitemap https://www.asteannunci.it/sitemap.xml