Data Science Notebooks

Data science gets done in notebooks. This website exists to compare the features in different data science notebook tools.

Robert Lacok
About the author

My name is Robert Lacok, and I’m a data notebook enthusiast. Because I keep on top of the latest developments in the space, I wanted to share it with the world.

I’m also a product manager at Deepnote. I try to be unbiased — if you believe any tools are missing or misrepresented, please email me or open a pull request on GitHub.

Need help?

If you need help picking a data notebook for your next project, feel free to reach out to me at my personal email address. I’d be happy to chat about the pros and cons of each solution.

View all notebooks

Amazon Sagemaker vs Zeppelin

Comparing two data science notebooks.

Amazon Sagemaker

Zeppelin

A screenshot of Amazon Sagemaker

Amazon SageMaker helps data scientists and developers to prepare, build, train, and deploy high-quality machine learning (ML) models quickly by bringing together a broad set of capabilities purpose-built for ML.

WebsiteAlternatives
A screenshot of Zeppelin

Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.

Setup

Is it managed?

Fully managed (setup in hours)

Is it managed?

No, you must host it yourself

Can you self-host?

No, you must use a managed offering

Can you self-host?

You can self-host (setup in hours)

Features

Is it Jupyter compatible?

Jupyter-compatible

Is it Jupyter compatible?

Not Jupyter-compatible

Programming languages

Jupyter languages (e.g. Python, R)

Programming languages

Python
SQL
Spark

What kind of data sources can you connect to?

Connect with Jupyter libraries (e.g. SQLAlchemy, psycopg2)
AWS

What kind of data sources can you connect to?

What kind of data visualization can you do?

Jupyter data visualization (e.g. Matplotlib, Altair, Plotly)

What kind of data visualization can you do?

Unknown

Reactivity

No reactivity, you decide the execution order

Reactivity

No reactivity, you decide the execution order

Notebook scheduling

Notebook scheduling with additional tools

Notebook scheduling

Notebook scheduling is built in

Management

Reproducibility

There is no support for reproducibility

Reproducibility

With effort, you can make reproducible environments

Version history

File-based (use Git)

Version history

File-based (use Git)

Collaborative editing

No support for collaborative editors

Collaborative editing

File-based (use Git)
Multiple editors, one at a time (asynchronous)

Comments

File-based (use GitHub)

Comments

File-based (use GitHub)

Notebook organization

View notebooks in a list

Notebook organization

File-based

Licensing

License

Proprietary

License

Open-source (Apache 2.0)

Price

Free tier
Pay for compute

Price

Free

Alternatives to Amazon Sagemaker and Zeppelin

Compare Amazon Sagemaker and Zeppelin with other data science notebook tools.

ToolSetupJupyter compatibilityProgramming languagesData visualizationCollaborative editingPricingLicense
Amazon Sagemaker
Fully managedJupyter-compatibleJupyterVisualize with codeLimited collaborationFree and paid optionsProprietary
Zeppelin
Self-hostedNonePython, SQL, SparkUnknownFile-based or asynchronous collaborationFreeOpen source
Jupyter
Compare with
Amazon Sagemaker, Zeppelin
Self-hostedJupyter-compatibleJupyterVisualize with codeFile-based or asynchronous collaborationFreeOpen source
Google Colab
Fully managedJupyter-compatibleJupyterVisualize with codeLimited collaborationFree and paid optionsProprietary
Deepnote
Fully managedJupyter-compatibleJupyter, SQLVisualize with code or UIRealtime collaborationFree and paid optionsProprietary
Hex
Fully managedJupyter-compatibleJupyter, SQLVisualize with code or UIRealtime collaborationFree and paid optionsProprietary
Databricks Notebooks
Self-hosted or fully managedJupyter-compatibleJupyterVisualize with code or UIRealtime collaborationFree and paid optionsProprietary
DataCamp Workspace
Fully managedJupyter-compatibleJupyter, SQLVisualize with code or UIRealtime collaborationFree and paid optionsProprietary
JupyterLab
Self-hostedJupyter-compatibleJupyterVisualize with codeFile-based or asynchronous collaborationFreeOpen source
CoCalc
Self-hosted or fully managedJupyter-compatibleJupyterVisualize with codeRealtime collaborationFree and paid optionsOpen source
Hyperquery
Fully managedNonePython, SQLVisualize with code or UIRealtime collaborationFree and paid optionsProprietary
Jetbrains Datalore
Fully managedJupyter-compatibleJupyterVisualize with code or UIRealtime collaborationFree and paid optionsProprietary
Kaggle
Fully managedJupyter-compatibleJupyterVisualize with codeLimited collaborationFreeProprietary
Nextjournal
Fully managedJupyter-compatibleJupyterVisualize with codeRealtime collaborationFree and paid optionsProprietary
Noteable
Self-hosted or fully managedJupyter-compatibleJupyter, SQLVisualize with code or UIRealtime collaborationFree and paid optionsProprietary
nteract
Self-hostedJupyter-compatibleJupyterVisualize with codeFile-based or asynchronous collaborationFreeOpen source
Observable
Fully managedNoneJSVisualize with codeRealtime collaborationFree and paid optionsProprietary
Query.me
Fully managedNoneSQLVisualization UI builderFile-based or asynchronous collaborationFree and paid optionsProprietary
VS Code
Self-hostedJupyter-compatibleJupyterVisualize with codeFile-based or asynchronous collaborationFreeOpen source
Mode Notebooks
Fully managedNoneSQL, R, PythonVisualization UI builderFile-based or asynchronous collaborationUnknownProprietary
Querybook
Self-hostedNonePythonVisualization UI builderRealtime collaborationFreeOpen source
Count
Fully managedNoneSQLVisualization UI builderRealtime collaborationFree and paid optionsProprietary
Husprey
Fully managedNoneSQLVisualization UI builderFile-based or asynchronous collaborationFree and paid optionsProprietary
Pluto.jl
Self-hostedNoneJuliaUnknownFile-based or asynchronous collaborationFreeOpen source
Polynote
Self-hostedNoneScala, PythonUnknownFile-based or asynchronous collaborationFreeOpen source
Zepl
Fully managedUnknownPython, RUnknownRealtime collaborationUnknownProprietary

Data Science Notebooks

Popular notebook tools

DeepnoteGoogle ColabHexJupyterJetbrains DataloreSagemaker