Big data visualization overview

UX for Big Data

What is Big Data?

Lots of different definitions can be found for "big data," but they all have one aspect in common: big data is inconvenient. It's too big to fit on screen, or in memory, or on disk. There are more fields than are easy to articulate.  And it is so ill-organized and messy that it will take a fair bit of nursing to get it into usable shape.

How is it different?

Big data requires both rethinking the front-end and the back-end. Slow database queries mean that users cannot take advantage of the massive speeds of a big database; the wrong visual presentations means that visual clutter overwhelms the data.

Million lines visualization

Back-End Solutions

These challenges are about tradeoffs.

Indexes and pre-aggregated data get precision, at the cost of flexibility. Approximations gain flexibility, but lose precision. Distributed systems can keep both, at the cost of reliability

Each of these calls for new visual representations.

Front-End Approaches

A big-data scatterplot can turn into a formless blob; a big-data line chart can become an overlapping crowd. An approximate query should have its imprecision reflected in the user interface.

We can choose points in the data space to address these ideas.

This talk from the OpenVis Conference 2015, Boston, provides an overview of my approach.

Get involved

This is an exciting conversation, and a huge opportunity for researchers from the human-computer interaction (HCI), database, and data visualization communities.

Some of the places this has been discussed:

We also focused on this topic at two different Dagstuhl workshops

Dive Deeper

These papers explore some of the technical challenges around big data analytics.

Overview of Challenges

User Experiences with Approximations