Data Science Notebooks

Data science gets done in notebooks. This website exists to compare the features in different data science notebook tools.

Robert Lacok
About the author

My name is Robert Lacok, and I’m a data notebook enthusiast. Because I keep on top of the latest developments in the space, I wanted to share it with the world.

I’m also a product manager at Deepnote. I try to be unbiased — if you believe any tools are missing or misrepresented, please email me or open a pull request on GitHub.

Need help?

If you need help picking a data notebook for your next project, feel free to reach out to me at my personal email address. I’d be happy to chat about the pros and cons of each solution.

View all notebooks

Databricks Notebooks

A screenshot of Databricks Notebooks

Collaborate across engineering, data science, and machine learning teams with support for multiple languages, built-in data visualizations, automatic versioning, and operationalization with jobs.

WebsiteAlternativesExamples

Databricks Notebooks capabilities

Setup

Is it managed?

Fully managed (setup in minutes)

Can you self-host?

You can self-host (setup in hours)

Features

Is it Jupyter compatible?

Jupyter-compatible

Programming languages

Jupyter languages (e.g. Python, R)

What kind of data sources can you connect to?

Connect with Jupyter libraries (e.g. SQLAlchemy, psycopg2)
Connect to data warehouses (Databricks)

What kind of data visualization can you do?

Jupyter data visualization (e.g. Matplotlib, Altair, Plotly)
UI for building charts

Reactivity

No reactivity, you decide the execution order

Notebook scheduling

Notebook scheduling is built in

Management

Reproducibility

Environments are reproducible by default

Version history

Version history is built in

Collaborative editing

Multiple editors at the same time

Comments

Comment on items within a notebook

Notebook organization

View notebooks in a list

Licensing

License

Proprietary

Price

Free tier
Pay for compute

Databricks Notebooks examples

Build a streaming ML application with Spark
Build an application to monitor credit card fraud in real time.
Open →
Intro to Delta Lake
Bringing ACID transactions to Apache Spark and big data workloads.
Open →
Machine learning on tabular data
MLFlow in an end-to-end ML lifecycle for tabular data.
Open →

Alternatives to Databricks Notebooks

ToolSetupJupyter compatibilityProgramming languagesData visualizationCollaborative editingPricingLicense
Databricks Notebooks
Self-hosted or fully managedJupyter-compatibleJupyterVisualize with code or UIRealtime collaborationFree and paid optionsProprietary
Jupyter
Compare with
Databricks Notebooks
Self-hostedJupyter-compatibleJupyterVisualize with codeFile-based or asynchronous collaborationFreeOpen source
Amazon Sagemaker
Fully managedJupyter-compatibleJupyterVisualize with codeLimited collaborationFree and paid optionsProprietary
Google Colab
Fully managedJupyter-compatibleJupyterVisualize with codeLimited collaborationFree and paid optionsProprietary
Deepnote
Fully managedJupyter-compatibleJupyter, SQLVisualize with code or UIRealtime collaborationFree and paid optionsProprietary
Hex
Fully managedJupyter-compatibleJupyter, SQLVisualize with code or UIRealtime collaborationFree and paid optionsProprietary
DataCamp Workspace
Fully managedJupyter-compatibleJupyter, SQLVisualize with code or UIRealtime collaborationFree and paid optionsProprietary
JupyterLab
Self-hostedJupyter-compatibleJupyterVisualize with codeFile-based or asynchronous collaborationFreeOpen source
CoCalc
Self-hosted or fully managedJupyter-compatibleJupyterVisualize with codeRealtime collaborationFree and paid optionsOpen source
Jetbrains Datalore
Fully managedJupyter-compatibleJupyterVisualize with code or UIRealtime collaborationFree and paid optionsProprietary
Kaggle
Fully managedJupyter-compatibleJupyterVisualize with codeLimited collaborationFreeProprietary
Nextjournal
Fully managedJupyter-compatibleJupyterVisualize with codeRealtime collaborationFree and paid optionsProprietary
Noteable
Self-hosted or fully managedJupyter-compatibleJupyter, SQLVisualize with code or UIRealtime collaborationFree and paid optionsProprietary
nteract
Self-hostedJupyter-compatibleJupyterVisualize with codeFile-based or asynchronous collaborationFreeOpen source
Observable
Fully managedNoneJSVisualize with codeRealtime collaborationFree and paid optionsProprietary
Query.me
Fully managedNoneSQLVisualization UI builderFile-based or asynchronous collaborationFree and paid optionsProprietary
VS Code
Self-hostedJupyter-compatibleJupyterVisualize with codeFile-based or asynchronous collaborationFreeOpen source
Mode Notebooks
Fully managedNoneSQL, R, PythonVisualization UI builderFile-based or asynchronous collaborationUnknownProprietary
Querybook
Self-hostedNonePythonVisualization UI builderRealtime collaborationFreeOpen source
Zeppelin
Self-hostedNonePython, SQL, SparkUnknownFile-based or asynchronous collaborationFreeOpen source
Husprey
Fully managedNoneSQLVisualization UI builderFile-based or asynchronous collaborationFree and paid optionsProprietary
Pluto.jl
Self-hostedNoneJuliaUnknownFile-based or asynchronous collaborationFreeOpen source
Polynote
Self-hostedNoneScala, PythonUnknownFile-based or asynchronous collaborationFreeOpen source
Zepl
Fully managedUnknownPython, RUnknownRealtime collaborationUnknownProprietary
Count
Fully managedNoneSQL, PythonVisualization UI builderUnknownUnknownProprietary

Data Science Notebooks

Popular notebook tools

DeepnoteGoogle ColabHexJupyterJetbrains DataloreSagemaker