|
Hey folks! I hope you enjoyed last week’s series on the radial volcano plot (newsletter, critique video, livestream). I think it did a good job of illustrating the various reasons I think it’s valuable to recreate figures, even if we don’t like how they display the data. Something I didn’t really emphasize in last week’s newsletter was that by recreating a figure, we can make sure that the data are legit. I’m surprised by the number of signals I’ve been finding where authors using tools like Illustrator to compose their figures. I don’t hope to convince everyone to uninstall Illustrator, but hopefully people can be more careful about moving axes independently of the data! Yikes. Also, in the radial volcano plot we saw that they were plotting the raw P-values rather than the adjusted P-values. None of these observations have resulted in fundamental changes to the interpretation of the data, but they do show signs of carelessness in working with data. If I’m honest, it reminds me of the very real problems caused by image splicing and manipulation that was rampant in the scientific literature. Again, I don’t think I’m finding data manipulation that warrants retractions or even corrections, but it does cause me some concern. What do you think? Is this a real problem or a nothing burger? This week, I have another figure from Nature Microbiology to share with you. It’s Figure 1b from the paper, “A drug–microbiome–drug interaction impacts co-prescribed medications for Parkinson’s disease”, which was authored by Andrew Verdegaal and colleagues in Andy Goodman’s lab at Yale University. Before talking about this panel, go check out panel 2c for another volcano plot. Similar to last week’s volcano plot, they seem to have used y-axis labels that don’t fit with the y-axis title. I like this label format, but the axis title needs some help. Interestingly, in panels 1j and 3j they use the traditional formatting of the y-axis. Anyway, I’ve talked enough about volcano plots. Let’s talk about heat maps :) I’ll have plenty to say about this plot in Monday’s critique video. For now, let’s think about how we’d go about composing this panel. When I look at this panel, I instinctively break it down into 5 parts. The first is the flow diagram from tolcapone to tolcapone M2 in the left-hand column. I’d create this “plot” using With these five images, how do we compose them together? You might already have caught on that the answer isn’t Illustrator :) Let’s try to use In the past we’ve talked about gathering together scales/legends with The “trick” here is that the labels on the x-axis will need to be the same in each of the four plots. That should be easy enough to pull off. I would likely use Finally, as a bit of foreshadowing to the critique video, the “rules” I mentioned a few weeks ago included putting the most important variables on the x and y-axis and to put things next to each other that you want me to compare. With these rules in mind, I’m becoming convinced that 90% of what is displayed as a heatmap should not be a heatmap. Can you sketch a version of this panel that doesn’t use heatmaps?
|
Hey folks, If you’ve watched any of my livestreams when someone asks why I don’t get ChatGPT or something to do a task for me, you probably saw a pained expression come across my face. Part of me dies every time someone tells me that they used some LLM chatbot to solve a problem. I have many reasons for despising the fascination with AI-based tools. I even wrote a commentary that I submitted to mBio in the fall of 2024. Yes, I wrote it. By hand. Then I typed it. No really, I typed it on a...
Hey folks, It has been great to see the high level of engagement with my weekly critique videos on YouTube. I have really enjoyed making them and have learned a lot about current practices in data visualization. The one problem with these videos is that they’re a bit like an autopsy. We can figure out what went well or what didn’t work in a published figure. But we can’t do much to improve the published figure. What if we could do critiques before submitting our papers, preparing a...
Hey folks, This week I want to share with you a figure that resembles many a type of figure that I see in a lot of genomics papers. I’d consider it a data visualization meme - kind of like how you’re “required” to have a stacked bar plot if you’re doing microbiome research or a dynamite plot if you’re publishing in Nature :) This figure was included in the paper, “Impact of intensive control on malaria population genomics under elimination settings in Southeast Asia” that was published...