cflow.lt
robots.txt

Robots Exclusion Standard data for cflow.lt

Resource Scan

Scan Details

Site Domain cflow.lt
Base Domain cflow.lt
Scan Status Ok
Last Scan2024-08-24T07:21:49+00:00
Next Scan 2024-09-23T07:21:49+00:00

Last Scan

Scanned2024-08-24T07:21:49+00:00
URL https://cflow.lt/robots.txt
Redirect https://www.cflow.lt/robots.txt
Redirect Domain www.cflow.lt
Redirect Base cflow.lt
Domain IPs 13.33.88.120, 13.33.88.15, 13.33.88.81, 13.33.88.97
Redirect IPs 108.128.72.146, 54.216.252.255, 54.73.26.109
Response IP 108.128.72.146
Found Yes
Hash 45153ad676ed29ce5260b780acdd6840d9c5034095ac1d4affed3b66f3b06c82
SimHash 9a451c853d54

Groups

*

Rule Path
Disallow /admin/
Disallow /delayed_job/
Disallow /dashboard
Disallow /users/sign_up
Disallow /users/sign_in
Disallow /users/password/new
Disallow /locale/
Disallow /invoices/
Disallow /accounting/
Disallow /users/new
Disallow /dashboard/tax-payments
Disallow /users/log-in
Disallow /accountant/journal
Disallow /clients/new
Disallow /dashboard/revenue-declarations
Disallow /dashboard/expenses-strategy-edit
Disallow /clients/show
Disallow /accountant/invoices
Disallow /accountant/expenses
Disallow /wp-admin
Disallow /revenue-widget
Disallow /expenses/new
Disallow /historical-incomes/edit
Disallow /expenses/edit

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-agent: *
  • Disallow: /