webcala.net
robots.txt

Robots Exclusion Standard data for webcala.net

Resource Scan

Scan Details

Site Domain webcala.net
Base Domain webcala.net
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-06-11T21:22:42+00:00
Next Scan 2025-08-10T21:22:42+00:00

Last Successful Scan

Scanned2025-04-13T21:21:57+00:00
URL https://webcala.net/robots.txt
Domain IPs 109.94.209.214
Response IP 109.94.209.214
Found Yes
Hash 7e2127e5069568d826986df7056a38d7943023188ab1ad4d57aac4d3d484c9b9
SimHash 521e4d476e60

Groups

*

Rule Path
Disallow /404/
Disallow /adm/
Disallow /adv/
Disallow /bones/
Disallow /bootstrap/
Disallow /cgi-bin/
Disallow /code/
Disallow /Connections/
Disallow /css/
Disallow /geo/
Disallow /js/
Disallow /skycom/
Disallow /timer/
Disallow /eurofood/
Disallow /forum/
Disallow /resume/
Disallow /rulya/
Disallow /sess/
Disallow /statya/add-statya/
Disallow /system/
Disallow /tatyana/
Disallow /test/
Disallow /timeline/
Disallow /topeny/
Disallow /vendor/
Disallow /polzovatelskoye-soglasheniye/
Disallow *shablon*
Disallow *test*

ahrefsbot
apptusbot
blp_bbot
businessdbbot
ccbot
covarioids
phantom
seznambot
surveybot
converacrawler
curl/
discobot
download ninja
email exractor
ezooms
fdm 3.x
flaxcrawler
grapeshot
grabber
gslfbot
bubing
heritrix
webindex
httrack
intelium_bot
istellabot
java/
prodvigatorbot
searchmetricsbot
genieo
wesee
wesee:ads/pagebot
wesee:ads/picturebot
lemurwebcrawler
libwww-perl
metamojicrawler
mj12bot
yyspider
aboundexbot
nutch
xovibot
openacoon
php/
webzip
plukkie
proximic
python-urllib
ruby
semrushbot
skimbot
seokicks
spbot
panscient
turnitinbot
wbsearchbot
weblexbot
voltron
wget
wire/0.
zyborg
shopwiki
sentibot
megaindex.ru
xovibot
advbot
memorybot
smtbot
easouspider
domainappender
baiduspider
baiduspider-video
baiduspider-image
sogou web spider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://webcala.net/sitemap.xml

Warnings

  • 1 invalid line.
  • `clean-param` is not a known field.
  • `host` is not a known field.