|
Hey folks, As 2025 is winding down, I want to encourage you to think about your goals for 2026! For many people designing an effective visualization and then implementing it with the tool of their choice is too much to take on at once. I think this is why many researchers recycle approaches that they see in the literature or that their mentors insist they use. Of course, this perpetuates problematic design practices. What if you could break out of these practices? What if you could tell your mentors, colleagues, reviewers, and anyone else what the strengths and weaknesses are of what you are trying to do versus what they are advising you to do? I have spent a lot of time creating my own plots, critiquing those of others, and reading the ideas of leaders in the field of data visualization. As you know, I’ve shared many of these ideas in this newsletter and in my YouTube videos. I’m excited to work with you more directly. On January 9th (1-4 PM Eastern), I will be offering a 3-hour Zoom workshop introducing you to the principles that drive effective data visualizations in science. There will be no coding in this workshop. Aside from Zoom to watch along, all you’ll need is some paper and a pen - if you have different colored pens you’ll be in even better shape. What will I talk about? I’ll tell you the importance of aligning your audience and the format with your data visualization. I’ll give you fancy language like pre-attentive attributes to help you talk with your colleagues about your visualizations. You’ll be (re)introduced to the grammar of graphics framework, which will enable you to dissect any data visualization. Finally, I’ll describe strategies to align the form and function of your visualizations. Data visualization is hard! This interactive workshop will give you greater confidence to design your own visualizations that effectively convey your science to your audience. I’ll lead you through the material by sharing numerous examples from the popular media and scientific literature. You’re also encouraged to bring your favorite visualization to share with other participants and any visualizations you are already working on. If this sounds like how you want to start your 2026, click the button below to learn more
Because a single workshop isn’t enough to put the ideas into practice, I will also be making myself available for one-on-one and group coaching sessions. If you are interested in these sessions, please reply to this email. Last week I introduced you to a cool microbial ecology paper recently published in Nature Microbiology by Bakkeren and colleagues, “Strain displacement in microbiomes via ecological competition”. On Monday I provided a critique of Figure 2 from this paper. As you may recall, in last week’s newsletter I discussed panels f and g from the figure. I also recreated these panels in Wednesday’s livestream. Today I want to talk about how I’d make panels b through e: They’re all the same basic panel each describing a different type of competition. What stands out to me about these panels is that they have a “cartoon” embedded in them to explain the panel’s experiment. I really thought this was slick. I especially liked how they used the same colors in the cartoon that they use for the points and the lines. It’s crystal clear to me that the red data is from the invading strain and the blue data is the resident strain. How would we make this in R? For the sake of conversation, let’s just think about panel c. This is actually a scatter plot. We could use How about the line through the points? One approach would be to create a separate data frame that has the median density at each time point and then use that as the data for a call to Looking at the y-axis you might notice some interesting numbers. The title and the text both have numbers in superscripts. We can get that using the Now that I’ve mentioned
The height value will scale the height of the image in pixels, so finding the right size will require some fiddling. Of course, we’ll need to use I’m planning on building this out in Wednesday’s livestream (9AM Eastern), so be on the lookout for that video. While I’m talking about livestreams… Can I tell you how much I’m learning by doing these? In each of these a viewer will make a comment like, “Why don’t you do it this way?” or “If you do it this way, then you can do this”. In each of these cases, “this way” never occurred to me. I never would have tried it “this way” if I had been recording and editing videos like I was a year ago. If I’m learning, then I’m sure others are too!
|
Hey folks, It has been great to see the high level of engagement with my weekly critique videos on YouTube. I have really enjoyed making them and have learned a lot about current practices in data visualization. The one problem with these videos is that they’re a bit like an autopsy. We can figure out what went well or what didn’t work in a published figure. But we can’t do much to improve the published figure. What if we could do critiques before submitting our papers, preparing a...
Hey folks, This week I want to share with you a figure that resembles many a type of figure that I see in a lot of genomics papers. I’d consider it a data visualization meme - kind of like how you’re “required” to have a stacked bar plot if you’re doing microbiome research or a dynamite plot if you’re publishing in Nature :) This figure was included in the paper, “Impact of intensive control on malaria population genomics under elimination settings in Southeast Asia” that was published...
Hey folks! I hope you enjoyed last week’s series on the radial volcano plot (newsletter, critique video, livestream). I think it did a good job of illustrating the various reasons I think it’s valuable to recreate figures, even if we don’t like how they display the data. Something I didn’t really emphasize in last week’s newsletter was that by recreating a figure, we can make sure that the data are legit. I’m surprised by the number of signals I’ve been finding where authors using tools like...