greatestate.it
robots.txt

Robots Exclusion Standard data for greatestate.it

Resource Scan

Scan Details

Site Domain greatestate.it
Base Domain greatestate.it
Scan Status Ok
Last Scan2024-10-01T15:33:49+00:00
Next Scan 2024-10-31T15:33:49+00:00

Last Scan

Scanned2024-10-01T15:33:49+00:00
URL https://greatestate.it/robots.txt
Redirect https://www.greatestate.it/robots.txt
Redirect Domain www.greatestate.it
Redirect Base greatestate.it
Domain IPs 206.189.12.141
Redirect IPs 206.189.12.141
Response IP 206.189.12.141
Found Yes
Hash 22ac3b15ffb8ec241f4871a2864850673f07902131d8b1320b542bbe6c28a90e
SimHash ce5ff0426eb9

Groups

*

Rule Path
Disallow /img/
Disallow /email/
Disallow /inc/email/

teleport
teleportpro
emailcollector
emailsiphon
webbandit
webzip
webreaper
webstripper
web downloader
webcopier
offline explorer pro
httrack website copier
offline commander
leech
websnake
blackwidow
http weazel
dotbot
dotbot/1.1
semrushbot
semrushbot/6~bl
ahrefsbot
ahrefsbot/6.1
blexbot
blexbot/1.0
mj12bot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.greatestate.it/sitemapindex.xml
sitemap https://www.greatestate.it/sitemap_it.xml
sitemap https://www.greatestate.it/sitemap_en.xml
sitemap https://www.greatestate.it/sitemap_ru.xml
sitemap https://www.greatestate.it/sitemap_de.xml