|
Hey folks, I hope you all are doing well! Before digging into today’s figure, I wanted to remind you that I’m offering a new interactive training opportunity. The goal is to provide greater opportunities to hone your skills in a social setting. My experience with leading this approach has been excellent. I can’t wait to have you give it a try with me. Please let me know if you have any questions. This week, I’m grateful to Georg Andersson for sharing an interesting figure with me from Sweden. Thank you, Georg! If you have a plot you’d like to share, please let me know. He included the following commentary: Here is a figure on what different values people get from agricultural landscape from a survey. The values are plotted in how frequent it was reported from the persons in the survey and distinguished between two groups, farmers and non-farmers. The colour of the dots are according to how big the difference were between the group.
Sorry for the values being in Swedish. :-)
Biologisk mångfald = Biodiversity
Öppna landskap = Open landscapes Rekreation/Naturupplevelse = Recreation / nature experience
Kultur/Historiska värden = Cultural and historical values
Estetiska värden = Aesthetical values
Lugn och ro = Peace and quiet
Looking at a broad overview of the figure, I first tried to interpret the visual myself. It appears that both farmers and non-farmers value biodiversity highly and ecosystem services (Ekosystemtjanster) quite low. Farmers tend to prefer open landscapes higher than non-farmers while non-farmers tend to prefer recreation and nature experience more than farmers. Living in a rural area on a farm, my experience resonates with these data! Georg mentioned that there was a survey that was given to farmers and non-farmers. I can imagine there’s a data frame somewhere with a As I mentioned, for all the extra ornamentation, this is a scatter plot. Whenever I see a scatter plot, my mind immediately thinks, Beyond moving on to the elements of the figure that could be driven by the To overcome these challenges, the {ggrepel} package is a great tool. I has some great tools to move the labels so they don’t overlap and it can add arrows to indicate which point each label corresponds to. Long time followers may recall two videos I made using {ggrepel} regarding opinions by country on a vaccine for COVID-19. One video was for a scatter plot similar to this one and another was for a slope plot version of the same data. This package also has features to allow you to only show labels for specific points. Beyond paired survey data like we have in this example, I’ve seen {ggrepel} also used in scatter plots commonly called “volcano plots” where an investigator wants to highlight specific genes. This is a great package to be familiar with! Let’s move to thinking about some of the theming options that could get a default {ggplot2} figure to more closely resemble this figure’s appearance. First, I notice that the x-axis and legend labels are tilted. The bottom axis could be modified using the For a relatively “simple” figure, this actually has a lot going on. If you’d like to play around with generating your own version, here’s some data to play around with
Finally, I’d love to see what types of figures that interest you. Please be like Georg and send me some examples of things that catch your eye. Also, as I come to the end of the current YouTube channel series building an R package, let me know whether you’d like me to take this verbal analysis of figures and translate it to real R code that I develop in video form.
|
Hey folks, It has been great to see the high level of engagement with my weekly critique videos on YouTube. I have really enjoyed making them and have learned a lot about current practices in data visualization. The one problem with these videos is that they’re a bit like an autopsy. We can figure out what went well or what didn’t work in a published figure. But we can’t do much to improve the published figure. What if we could do critiques before submitting our papers, preparing a...
Hey folks, This week I want to share with you a figure that resembles many a type of figure that I see in a lot of genomics papers. I’d consider it a data visualization meme - kind of like how you’re “required” to have a stacked bar plot if you’re doing microbiome research or a dynamite plot if you’re publishing in Nature :) This figure was included in the paper, “Impact of intensive control on malaria population genomics under elimination settings in Southeast Asia” that was published...
Hey folks! I hope you enjoyed last week’s series on the radial volcano plot (newsletter, critique video, livestream). I think it did a good job of illustrating the various reasons I think it’s valuable to recreate figures, even if we don’t like how they display the data. Something I didn’t really emphasize in last week’s newsletter was that by recreating a figure, we can make sure that the data are legit. I’m surprised by the number of signals I’ve been finding where authors using tools like...