|
Hey folks, If you missed Wednesday’s livestream, I encourage you to go back and check it out. I recreated a panel from a paper published in Nature that is pretty typical. It was made up entirely of photographs. Sometimes I feel like I’m the only PI that doesn’t merge panels into figures using Illustrator or Powerpoint. I prefer to use R with some help from The benefit for reproduciblity wasn’t what I loved most about this livestream. I was all set to use the This week I’m going to share panels i and j from Figure 1 of a recently published paper in Nature, “A mechanism to initiate emergency type 2 myelopoiesis”. This paper has a lot of panels like this one. Based on the text of the paper and the appearance of the panels, I’m pretty sure these were made with R’s These panels have different data but are the same basic design. There’s a lot going on in these plots that I could comment on (wait for Monday’s video!). But for today, I’d like to make a couple of general points. First, check out where the x-axis crosses the y-axis in both panels. By default, I consider the expansion below bar plots one of the few “bad practices” that
I think that the easiest way to get rid of it would be to add
The Second, I think these bar plots are an example of “chartjunk”. They are so common in the figures I look at that I think I must be missing something. Yet, I have yet to find an instructions to authors document for a journal that indicates a preference for these plots. I also don’t see examples of published reviewer comments asking for them. Why do I think they’re chartjunk? They don’t add anything to the plot and can actually hinder the interpretation of the data. The bars extend to the mean of the data. That’s a lot of ink to represent something that could have been indicated by a horizontal line. Although it’s not the case here, I often see bars drawn when there are only 2 or 3 points (for an example see last week’s critique video). How can they hinder interpretation? Without thinking of it, we compare the lengths of things. We compare the lengths of those bars. Because of that it is critical that the bars start at zero. If they don’t very little differences between bars can look quite large. I showed several examples of plots violating this “rule” in scientific literature in last week’s critique video. If one were to only show the points then we go from comparing the length of things to comparing their relative position. Because of this, zooming in on the y-axis isn’t such a problem once the bars are removed. Finally, I’ll roll out this paper that gives compelling reasons why bar plots are typically never the right answer. I encourage you to give it a read. I think it would be an interesting paper to talk about at a future journal club or lab meeting. If you do, please let me know how your discussion goes
|
Hey folks, It has been great to see the high level of engagement with my weekly critique videos on YouTube. I have really enjoyed making them and have learned a lot about current practices in data visualization. The one problem with these videos is that they’re a bit like an autopsy. We can figure out what went well or what didn’t work in a published figure. But we can’t do much to improve the published figure. What if we could do critiques before submitting our papers, preparing a...
Hey folks, This week I want to share with you a figure that resembles many a type of figure that I see in a lot of genomics papers. I’d consider it a data visualization meme - kind of like how you’re “required” to have a stacked bar plot if you’re doing microbiome research or a dynamite plot if you’re publishing in Nature :) This figure was included in the paper, “Impact of intensive control on malaria population genomics under elimination settings in Southeast Asia” that was published...
Hey folks! I hope you enjoyed last week’s series on the radial volcano plot (newsletter, critique video, livestream). I think it did a good job of illustrating the various reasons I think it’s valuable to recreate figures, even if we don’t like how they display the data. Something I didn’t really emphasize in last week’s newsletter was that by recreating a figure, we can make sure that the data are legit. I’m surprised by the number of signals I’ve been finding where authors using tools like...