Editor’s Note: This article has been lightly edited and updated from the original post that ran on Medium.
Like most of what surrounds us, data is a human-made fiction. Which means that it, and whatever we understand it to be, has been made up by us (and by us I mean humans). So whether you understand data to be neutral or truthful or biased or reliable, it is regardless made up by us. But, we still live in the world and produce data and gather it and make things with it. Things like, of course, data visualization.
So you know the old joke about fish who swim in water but don’t know what water is? There are these two young fish swimming along, and they happen to meet an older fish swimming the other way, who nods at them and says, “Morning, boys. How’s the water?” The two young fish swim on for a bit, and then eventually one of them looks over at the other and goes, “What the hell is water?” The point of the fish story is merely that the most obvious, important realities are often the ones that are hardest to see and talk about… And in our case, I think, that is data.
As professionals working in data visualization, we often take data for granted. And it could be argued that the general public has come to think of data and its visualizations as something neutral and indisputable. But the data, the designer, and even the readers can and will influence the understanding and nuance of a piece.
Data visualization is often described as incorporating elements from many disciplines: map design principles from cartography, conventions of displaying numerical data from statistics, best practices for the use of type, layout and color from graphic design, and principles of writing and storytelling from journalism. While this allows for a lot of creative freedom, it is also a double-edged sword: It imbues data visualization with certain predetermined expectations. Predominantly, these are the common perceptions that data visualization is impartial and portrays “the truth.”
However, there’s interpretation at every turn—from the nature of the data itself, to the role of the designer, to the perceptions of the audience.
It’s often thought that the data being used is neutral, unbiased, and objective, and that in visually mapping, rendering and displaying information the graphic devices used do not obscure the content they display.
The problematic assumption that data is neutral comes from it being described more often than not as existing in a “raw” state that represents collected phenomena with a disembodied nature from its human agent. Data is thought to be pre-existing or “out there,” ready to be harvested and represented as transparently as possible. Data invokes ideas of a resource to be exploited or a material to be manipulated (so much so that we even come up with ridiculous terms such as “data mining”).
But the increasing relevance of data science and data visualization has also prompted the interrogation of what we even understand data itself to be, as well as how we formulate claims of truth. And, borrowing Lisa Gitelman’s expression, “There is no such thing as raw data: it’s an oxymoron.” Data is anything but raw, and we shouldn’t think of data as a natural resource but as a cultural one that needs to be generated, protected, and interpreted. Data—and its collection, manipulation and presentation—both empowers and disempowers the people and the interests that are represented (or misrepresented) by it.
Turning data into structured information means giving shape to the data. By designing visualizations, the designer is an agent in the whole process, and, inevitably, another piece of data. Just as numbers and data don’t simply exist, data visualizations don’t, either: They are created for specific reasons, with a purpose, and by someone. The role of the designer includes either all or part of the thinking, collecting, documenting, mapping, analyzing, reflecting, translating, synthesizing, and concluding around the data and the graphics. And any graphic representation of information is phenomenologically distinct to the information itself.
But it is rather the understanding of how this someone shapes the piece that is most relevant. Whether they are the generators or facilitators of content, the designer’s view of the world shapes their design.
As humans, we are intimately and physically connected to the work we produce, and it is inevitable that our work bears our stamp. Every dataset can be visualized in different ways, and visualizations can be more or less effective depending on the shape they take. Moreover, individual designs, as well as ideas about designing, are related to wider sets of values; at its best, graphic design is merely reflecting the sentiments that a culture at large is already feeling. The designer is a communicator who has agency in the message they are translating into visual form.
Despite our best efforts, what we do is never truly neutral, nor purely objective. And it shouldn’t be. We say that visualizations are successful when they communicate “what the data has to say” with efficiency. But graphics do not transparently reveal data — really, they create a new interpretation of it. Perhaps the question is not one of truthfulness but of reliability, so that designers can reject the discourse of objectivity and instead actively adopt a reflexive position where they are able to understand the relationship between their own and others’ ideological assumptions in the acts of communication and meaning-making.
Lastly, it is important to place greater emphasis on the responsibility that the reader has as a consumer of information. Rather than demanding transparency on the side of the data visualizer, they should acknowledge that the choice to engage or not with a data design piece is ultimately down to them, and that they must do so knowing that what they are consuming is not only the product of someone else’s mind, but also that of a broader discourse-at-large. Their challenge as readers is in understanding the multiplicity of methods that can comprise a design language.
Data is everywhere: in every headline, and at the center of every conversation. Only a few months after the Covid pandemic began, data became an essential language to understand and make sense of a rapidly changing world. But how fluent in this language is the general public? Just as we learn how to read, we need to learn how to read data visualizations: how to spot bad statistics, understand context; how uncertainty can and should be visualized, and how missing data is also part of the story.
Data visualization is increasingly a way in which we translate the information we produce, so it is crucial that we recognise how much of our lives is mediated and shaped by it. Seeing Data, for example, is a group of research projects that aim to understand the place of data visualizations in society. It also publishes resources to help non-experts develop their ability to make sense of data visualizations. In a data-driven world, we need to learn how to critically engage with the graphic devices that surround us.
The amount of data we produce, collect, and process is only going to grow. Add to this purposely erroneous information produced by humans (read: fake news), as well as the inevitable proliferation of machine, AI-produced content yet to come. The world we live in is increasingly shaped by datasets, algorithms, and trained models. As well as ensuring we train critical data designers, it is imperative that as readers we become ever more critical and skeptical of the content we consume, and equipped to judge its veracity and credibility.
Closing thoughts: nothing changes if nothing changes
So what is our role as data designers? How do we avoid the catastrophe? For starters, we can see ourselves more as storytellers than as truth-revealers—as experts endowed with the power to create form, solve problems, pass judgment, and convey meaning. Data visualization is the result of knowledge translation, just as any other technology—a clock, a compass, an abacus, a map, a book—is.
We need to use our expertise and knowledge of design, the power of data, and the available technologies to enable the exploration of (complex) information with intelligent, meaningful pieces. Below is a proposed framework for data designers:
1. Don’t trust data
By using data and representing it visually in our projects, we confirm its authenticity unless we explicitly undermine its validity. Always ask: Where does the data come from? Who collected it, organized it, categorized it? When did they make it? Why did they make it? What social/cultural conditions was it made under? Just because we use data as our material doesn’t mean we should blindly believe in it.
2. Go on a date with the data
Enquire, be curious, spend some time with the data: it is where you pave the grounds, set the tone, understand the material you are working with. Gathering, understanding, and organizing the data is like shopping for paint and brushes. Things which seem quite simple like a data inventory are actually a key part of the process. Before we get on to designing, we need to fully understand what it is we are designing.
3. Look around the data
More often than not, the data lacks data (or we lack data completely). Be resourceful: A lot of information is available out there. Make connections, complement the data. Consider bias in data collection, inclusions/exclusions, gaps or omission in the data, provenance and data quality. The raw data is not always needed, but how you made sense of it and where it came from can expose the decisions that contributed to the creation of data visualizations.
4. Don’t be afraid of the missing data
Keep in mind that data visualization design prioritizes certainty, yet uncertainty is just as important to visualize as certainty. It often tells you more than that which is there. Design is more about the message you are transmitting with the data, than about the data itself.
5. Be transparent (about the data and about the design)
Data visualizations need to be situated in time and space to counter the visual rhetoric that connotes omniscience that suggests data has been gathered, and is being viewed, from nowhere — a position often confused with objectivity. Subjects who acknowledge the contingency of their own position in the world produce greater objectivity than those who claim to be neutral. In this way, we can slowly shift from the current emphasis on objectivity in favor of designs that facilitate pathways to multiple truths.
Neither data, its visualization, nor its visualizer represent a single truth, but rather offer a constructed, fallible, and subjective view of the world. It is crucial that as readers we recognize how much of our lives are mediated and shaped by data visualization, and that this calls for greater comprehension of the graphic realm we constantly engage with. Visualization has a disruptive, revolutionary potential, and the agency of designers needs to be recognized as a force to shape the world they aim to represent. If as designers we can create visuals that encourage careful reading and personal engagement, people will find more and more real value in data and in what it represents. So, whatever you do, don’t forget: data is water.
Elena is a graphic designer trained in London and currently based in the Netherlands. After exploring a passion for typography, and as way through information design, maps, and signage systems, she found her place in data visualisation, graduating in 2021 from the MA in Data Visualisation at London College of Communication. She subsequently joined data design studio CLEVER°FRANKE in Utrecht, where she works as a visual designer responsible for creating beautiful and engaging designs that effectively communicate complex data. She has a deep understanding of how to design with data, and is adept at creating designs that are both aesthetically attractive and highly functional.