|
Hey folks! I’m appreciating the positive feedback on Monday critique videos. They’re a lot of fun to think through and make. I think I might start looking at figures that are drawn from the scientific literature since many of you found out about me from my science work. Let me know if there are plots or practices that you’d like to see me talk about. I’ll see if I can work them into the queue. Also, if you’re working on developing figures for a presentation, poster, or paper and would like to work with me to come up with more effective designs, please don’t hesitate to get in touch. This week I’m returning to Our World in Data for a visualization. Someone sent me a similar plot from OWID from a story about vaping. I’m starting to notice that news outlets tend to have a set of visualization approaches that they use over and over. It becomes a bit like “if your only tool is a hammer, all your problems look like nails”. Something to think about… I’m probably biting off more than I can chew with this plot. I think I probably picked it because I’m already thinking ahead to how I might refactor it :) This is a set of stacked bar plots showing the causes of death in 2023 and how three media outlets covered different types of deaths in the same year. Perhaps you’ve heard the adage, “if it bleeds, it leads”. I think that is what this plot is trying to show. I’m not sure that I see a consistent partisan bias here. I was pleased to find the code used to collect the data and data used in the story available elsewhere on their site. They have a nice Python notebook that walks through their data collection and curation steps. The CSV file is already in a tidy format with columns for the Let’s start with stacked bar plots. These are straightforward to create using We’ll need to use facets to get the separation of the actual and reported causes of death. I’d add another variable to my data frame to indicate whether the data were actual percentage or media-reported percentages. I would then use Now for the titles. It might be easiest to make the top line of the title the actual Those arrows might be a bit funky. I’d likely add them using Ok, I still think this is a lot of work. But, by breaking it down for you, I’m starting to believe that I can implement this figure in a two-hour livestream. What do you think? Be sure to tune in on Monday to see what I like about this visual and then again on Wednesday morning to see me implement it. Let me know if you’d like to see a better way of representing the same data using a dot plot. Have I mentioned how I hate stacked bar plots? :)
|
Hey folks, It has been great to see the high level of engagement with my weekly critique videos on YouTube. I have really enjoyed making them and have learned a lot about current practices in data visualization. The one problem with these videos is that they’re a bit like an autopsy. We can figure out what went well or what didn’t work in a published figure. But we can’t do much to improve the published figure. What if we could do critiques before submitting our papers, preparing a...
Hey folks, This week I want to share with you a figure that resembles many a type of figure that I see in a lot of genomics papers. I’d consider it a data visualization meme - kind of like how you’re “required” to have a stacked bar plot if you’re doing microbiome research or a dynamite plot if you’re publishing in Nature :) This figure was included in the paper, “Impact of intensive control on malaria population genomics under elimination settings in Southeast Asia” that was published...
Hey folks! I hope you enjoyed last week’s series on the radial volcano plot (newsletter, critique video, livestream). I think it did a good job of illustrating the various reasons I think it’s valuable to recreate figures, even if we don’t like how they display the data. Something I didn’t really emphasize in last week’s newsletter was that by recreating a figure, we can make sure that the data are legit. I’m surprised by the number of signals I’ve been finding where authors using tools like...