|
Hey folks, I’ve now produced three livestream videos. What do you think? Do you watch them live or watch them later? Or are they too long? I’m looking for honest feedback! I have to admit that if I hadn’t livestreamed these videos, they would not have been produced. It’s nice that I can more or less record and post without any editing. This is still a bit of an experiment. I think fewer people are watching the episodes which makes me worry that this might be an overall step backwards for you all. I want what I do to have maximum benefit, so please don’t hesitate to respond to this email and let me know what you think. Yesterday morning, I received a newsletter from Philip Bump who writes a column for the The Washington Post. He has a couple of newsletters, but this one is an “add on” to his columns where he shares more of the data behind what goes into his columns. Although not overly complicated, I thought this would be a fun “basic” plot for beginners but enough ornamentation for more advanced R users. This plot was an add on to his column on a generational rift in the Democratic Party in the aftermath of the New York City mayoral primary election. In this plot he uses March 2025 data from Gallup to compare how the two parties differ in their support for Israelis versus Palestinans. So, how would I go about making this plot? We need the data. If you go to the Gallup article, the second plot has three tabs. One each for Democrats, Republicans, and Independents. The plots show the percent, by party, who support Israelis or Palestinians. In the lower left corner of the plot is a link to “Get the data”, which downloads a CSV-formatted file for the data in each plot. We’ll need to get both the Democrat and Republican datasets. Also, we’ll need to go back to the first plot and get the data for “All Americans”. For each of these files, we’ll need to read them in and join them into a single tibble. We can read the three files in to a single tibble using Again, at the fundamental level, this is a line plot with three groups. We can do this in Now for the ornamentation. First, the axes will need some help. There are no axis titles or ticks. Those can be removed with Second, the gridline choices are “interesting”. The y-axis gridlines look fairly standard. However, we’ll have to add a thicker black line at zero. For the x-axis gridlines he has one at 2016 and October 7, 2023. We’ll have to make those x-axis gridlines and the zero line using Finally, there is text in the right hand margin indicating what each line represents. We can place the text using All in all, this should be a less intense plot than what I’ve been making lately. At the same time, we get to practice some fun stuff with text. I think it will also give an opportunity to compare how we use
|
Hey folks, It has been great to see the high level of engagement with my weekly critique videos on YouTube. I have really enjoyed making them and have learned a lot about current practices in data visualization. The one problem with these videos is that they’re a bit like an autopsy. We can figure out what went well or what didn’t work in a published figure. But we can’t do much to improve the published figure. What if we could do critiques before submitting our papers, preparing a...
Hey folks, This week I want to share with you a figure that resembles many a type of figure that I see in a lot of genomics papers. I’d consider it a data visualization meme - kind of like how you’re “required” to have a stacked bar plot if you’re doing microbiome research or a dynamite plot if you’re publishing in Nature :) This figure was included in the paper, “Impact of intensive control on malaria population genomics under elimination settings in Southeast Asia” that was published...
Hey folks! I hope you enjoyed last week’s series on the radial volcano plot (newsletter, critique video, livestream). I think it did a good job of illustrating the various reasons I think it’s valuable to recreate figures, even if we don’t like how they display the data. Something I didn’t really emphasize in last week’s newsletter was that by recreating a figure, we can make sure that the data are legit. I’m surprised by the number of signals I’ve been finding where authors using tools like...