Altair is a declarative statistical visualization library for Python. With Altair, you can spend more time understanding your data and its meaning. Altair's API is simple, friendly and consistent and built on top of the powerful Vega-Lite JSON specification. This elegant simplicity produces beautiful and effective visualizations with a minimal amount of code. Altair is developed by Jake Vanderplas and Brian Granger in close collaboration with the UW Interactive Data Lab.
Note: Altair's documentation is currently in a very incomplete form; we are in the process of creating more comprehensive documentation. Stay tuned!
See Altair's Documentation Site, as well as Altair's Tutorial Notebooks.
Here is an example using Altair to quickly visualize and display a dataset with the native Vega-Lite renderer in the JupyterLab:
import altair as alt
# to use with Jupyter notebook (not JupyterLab) run the following
# alt.renderers.enable('notebook')
# load a simple dataset as a pandas DataFrame
from vega_datasets import data
cars = data.cars()
alt.Chart(cars).mark_point().encode(
x='Horsepower',
y='Miles_per_Gallon',
color='Origin',
)
Altair provides a Python API for building statistical visualizations in a declarative manner. By statistical visualization we mean:
DataFrame
that consists of columns of different data types (quantitative, ordinal, nominal and date/time).DataFrame
is in a tidy format
where the rows correspond to samples and the columns correspond to the observed variables.The Altair API contains no actual visualization rendering code but instead emits JSON data structures following the Vega-Lite specification. The resulting Vega-Lite JSON data can be rendered in the following user-interfaces:
To use Altair for visualization, you need to install two sets of tools
The core Altair Package and its dependencies
The renderer for the frontend you wish to use (i.e. Jupyter Notebook
,
JupyterLab
, or nteract
)
Altair can be installed with either pip
or with conda
.
For full installation instructions, please see
https://altair-viz.github.io/getting_started/installation.html
We maintain a separate Github repository of Jupyter Notebooks that contain an interactive tutorial and examples:
https://github.com/altair-viz/altair_notebooks
To launch a live notebook server with those notebook using binder, click on the following badge:
Many excellent plotting libraries exist in Python, including the main ones:
Each library does a particular set of things well.
However, such a proliferation of options creates great difficulty for users as they have to wade through all of these APIs to find which of them is the best for the task at hand. None of these libraries are optimized for high-level statistical visualization, so users have to assemble their own using a mishmash of APIs. For individuals just learning data science, this forces them to focus on learning APIs rather than exploring their data.
Another challenge is current plotting APIs require the user to write code, even for incidental details of a visualization. This results in unfortunate and unnecessary cognitive burden as the visualization type (histogram, scatterplot, etc.) can often be inferred using basic information such as the columns of interest and the data types of those columns.
For example, if you are interested in a visualization of two numerical columns, a scatterplot is almost certainly a good starting point. If you add a categorical column to that, you probably want to encode that column using colors or facets. If inferring the visualization proves difficult at times, a simple user interface can construct a visualization without any coding. Tableau and the Interactive Data Lab's Polestar and Voyager are excellent examples of such UIs.
We believe that these challenges can be addressed without the creation of yet another visualization library that has a programmatic API and built-in rendering. Altair's approach to building visualizations uses a layered design that leverages the full capabilities of existing visualization libraries:
This approach enables users to perform exploratory visualizations with a much simpler API initially, pick an appropriate renderer for their usage case, and then leverage the full capabilities of that renderer for more advanced plot customization.
We realize that a declarative API will necessarily be limited compared to the full programmatic APIs of Matplotlib, Bokeh, etc. That is a deliberate design choice we feel is needed to simplify the user experience of exploratory visualization.
Altair requires the following dependencies:
If you have cloned the repository, run the following command from the root of the repository:
pip install -e .[dev]
If you do not wish to clone the repository, you can install using:
pip install git+https://github.com/altair-viz/altair
To run the test suite you must have py.test installed. To run the tests, use
py.test --pyargs altair
(you can omit the --pyargs
flag if you are running the tests from a source checkout).
We welcome any input, feedback, bug reports, and contributions via Altair's GitHub Repository. In particular, we welcome companion efforts from other visualization libraries to render the Vega-Lite specifications output by Altair. We see this portion of the effort as much bigger than Altair itself: the Vega and Vega-Lite specifications are perhaps the best existing candidates for a principled lingua franca of data visualization.
We are also seeking contributions of additional Jupyter notebook-based examples in our separate GitHub repository: https://github.com/altair-viz/altair_notebooks.
The altair users mailing list can be found at https://groups.google.com/forum/#!forum/altair-viz
Altair is the brightest star in the constellation Aquila, and along with Deneb and Vega forms the northern-hemisphere asterism known as the Summer Triangle.
此处可能存在不合适展示的内容,页面不予展示。您可通过相关编辑功能自查并修改。
如您确认内容无涉及 不当用语 / 纯广告导流 / 暴力 / 低俗色情 / 侵权 / 盗版 / 虚假 / 无价值内容或违法国家有关法律法规的内容,可点击提交进行申诉,我们将尽快为您处理。