cv.lt
robots.txt

Robots Exclusion Standard data for cv.lt

Resource Scan

Scan Details

Site Domain cv.lt
Base Domain cv.lt
Scan Status Ok
Last Scan2025-04-11T20:29:15+00:00
Next Scan 2025-05-11T20:29:15+00:00

Last Scan

Scanned2025-04-11T20:29:15+00:00
URL https://cv.lt/robots.txt
Redirect https://www.cv.lt/robots.txt
Redirect Domain www.cv.lt
Redirect Base cv.lt
Domain IPs 104.26.0.116, 104.26.1.116, 172.67.73.195, 2606:4700:20::681a:174, 2606:4700:20::681a:74, 2606:4700:20::ac43:49c3
Redirect IPs 104.26.0.116, 104.26.1.116, 172.67.73.195, 2606:4700:20::681a:174, 2606:4700:20::681a:74, 2606:4700:20::ac43:49c3
Response IP 104.26.0.116
Found Yes
Hash 15185f4a2eca111013ce289b39821ddb4254bfcf37ff14a83f84895e29905ed0
SimHash a6498c313566

Groups

*

Rule Path
Disallow /banner/
Disallow /download/
Disallow /pdf/skelbimai/
Disallow /employee/announcementsAll.jsp
Disallow /employee/printLDBBulletin.do
Disallow /employee/redirect.do
Disallow /volunteering/announcements.do
Disallow /extra/index.do
Disallow /employer/register.do
Disallow /employer/demoSearch.do
Disallow /employer/viewCvNoContacts.do
Disallow /vzmailer/
Allow /vzmailer/wn-archive/
Disallow relaunch.cv.lt
Disallow /view-archive/
Disallow /wap/
Disallow /flash/
Disallow /slaptazodis
Disallow /reset.do
Disallow /ecp.do
Disallow /INLogin.do
Disallow /invitePage.do
Disallow /cp/events/*.jsp
Allow /