Navigating the magical data visualisation forest
Speaker: Dr Margriet Groenendijk from IDM
Margriet is a tech enthusiast at IBM and gave a lightning talk at the Django Bristol Bath meetup as well.
This talk is about using Jupyter notebooks, https://jupyter.org/, for data analysis and visualisations.
NB: These can be run on the desktop and is available in the cloud as well.
Libraries Margriet has used:
- matplotlib with pandas.
- seaborn – https://seaborn.pydata.org/
– is an open source package developed by David Taieb and Margriet Groenendijk. It is a a wrapper around various libraries, which turns into GUI options in the Jupyter notebook. This is an amazing useful tool for data scientist and others that would like to explore their data without learning as much code and the various differences between each library.
It can load pandas and spark data frames. It can also load data via URLs, very helpful for cloud based notebooks.
PixieDust provides a serious amount of options and less code for busy people or those exploring data or data science newbies.
seabourne has a nice map based visualisation.
PixieDust integrates with google, mapbox, and seaborne
- can be used to build interactive apps in Jupyter. Who knew. For example making the visualisation of your data more interactive, e.g. change the column to map etc. Jinga 2 templates are used
- Can be published using a Pixiegate server, https://www.ibm.com/support/knowledgecenter/DSXDOC/pixiedust/pixiegateway.html – NB: A particular server you need access to. Also open source here: https://github.com/pixiedust/pixiegateway
The technical lead for PixieDust, David Taieb https://twitter.com/dtaieb55, has also published this book: Thoughtful Data Science – https://www.safaribooksonline.com/library/view/thoughtful-data-science/9781788839969/