hagalil.com
robots.txt

Robots Exclusion Standard data for hagalil.com

Resource Scan

Scan Details

Site Domain hagalil.com
Base Domain hagalil.com
Scan Status Ok
Last Scan2025-05-26T11:36:14+00:00
Next Scan 2025-06-02T11:36:14+00:00

Last Scan

Scanned2025-05-26T11:36:14+00:00
URL https://hagalil.com/robots.txt
Domain IPs 217.160.0.39
Response IP 217.160.0.39
Found Yes
Hash 7bb0d4c41716465fd3366e26bc1a68647f0761fe4196b73ee542a5233db46bd9
SimHash d369b176ce5f

Groups

rogerbot
exabot
mj12bot
dotbot
gigabot
ahrefsbot
blackwidow
bytespider
semrushbot
semrushbot/1.1~bl
linkpadbot
ahrefsbot/5.1
chinaclaw
custo
disco
ecatch
eirgrabber
emailsiphon
emailwolf
extractorpro
eyenetie
flashget
getright
getweb!
go!zilla
go-ahead-got-it
grabnet
grafula
hmview
httrack
interget
jetcar
larbin
leechftp
navroad
nearsite
netants
netspider
netzip
octopus
pagegrabber
pavuk
pcbrowser
realdownload
reget
sitesnagger
smartdownload
superbot
superhttp
surfbot
takeout
voideye
webauto
webcopier
webfetch
webleacher
webreaper
websauger
webstripper
webwhacker
webzip
wget
widow
wwwoffle
zeus

Rule Path
Disallow /

*

Rule Path
Disallow /bb/
Disallow /cgi-bin/
Disallow /system/
Disallow /newsletter/
Disallow /01/
Disallow /random/
Disallow /services/

Comments

  • Sitemap: http://www.hagalil.com/sitemap.xml