|
Hey folks, I hope you have enjoyed the current series of newsletters and videos recreating “data portraits” from the WEB DuBois collection of visuals he showed at the 1900 Paris Exhibition. You can find the entire collection of “data portraits” in a book assembled by Whitney Battle-Baptiste and Britt Rusert (here) or as a collection of plates through the Library of Congress (here). I’ve really appreciated the positive feedback! These figures are pretty different from what we do in modern data visualizations. But, I have found tremendous value in recreating them. Of course, I am strengthening my R skills in terms of getting more dexterous using ggplot2 and R in general - hopefully you are too! But, I am also learning to think differently about how to solve problems. I think this latter point is far more important than the mechanics of how to generate a figure. After discussing the “fan plot” with you last week, I went about making it for the video that will be posted on Monday. I was able to make it using a stacked bar plot made with This week, I want you to consider an iconic visual from the collection that is actually on the cover of the (Battle-Baptiste and Rusert book). This is Plate 25 This plot is a bit beyond a simple description. It reminded me of Dorothoy’s trip out of Munchkin Land where there are multiple roads, but she has to follow the “yellow brick road”. Except those roads go out from the Munchkins. These roads all circle inward. Regardless, we might call this a circular bar plot. There were a few strategies that stood out to me. First, I could represent each bar as a line. I’d make the lines thick so that they resembled bars. With this approach, I would start by making 5 diagonal lines in Cartesian coordinates. Then I’d use Second, I could take the same strategy as the line plot, but use Third, to avoid going back and forth between coordinate schemes, we could do everything in Cartesian coordinates. We would still need to use angles and radii to draw the spirals. The outer and inner edge of each spiral would have their own radius and the beginning and end of each spiral its own angle. Using the There are three strategies. See if you can get one of these to work! Let me know what you find ;) The legend at the start of the spiral is deceptively simple! If the font were monospaced, it would be somewhat straightforward to get everything line up. But it isn’t. Also, the line connecting the year and the amount of money isn’t merely a series of dashes, but actually a short line segment. Perhaps dashes would be fine if you weren’t so concerned with matching the original. Now that I think about it, I wonder how to generate straight lines with Again, I’m not sure that I would ever create a plot like this for my own data. But, it has definitely gotten me thinking far differently than I normally do with my data visualizations. That’s what make it so much fun for me. Here’s some code to help you get going.
If you thought this was fun, I’d encourage you to check out Plate 17. How would you go about generating that final serpentine bar for the “Industrial” category?
|
Hey folks, It has been great to see the high level of engagement with my weekly critique videos on YouTube. I have really enjoyed making them and have learned a lot about current practices in data visualization. The one problem with these videos is that they’re a bit like an autopsy. We can figure out what went well or what didn’t work in a published figure. But we can’t do much to improve the published figure. What if we could do critiques before submitting our papers, preparing a...
Hey folks, This week I want to share with you a figure that resembles many a type of figure that I see in a lot of genomics papers. I’d consider it a data visualization meme - kind of like how you’re “required” to have a stacked bar plot if you’re doing microbiome research or a dynamite plot if you’re publishing in Nature :) This figure was included in the paper, “Impact of intensive control on malaria population genomics under elimination settings in Southeast Asia” that was published...
Hey folks! I hope you enjoyed last week’s series on the radial volcano plot (newsletter, critique video, livestream). I think it did a good job of illustrating the various reasons I think it’s valuable to recreate figures, even if we don’t like how they display the data. Something I didn’t really emphasize in last week’s newsletter was that by recreating a figure, we can make sure that the data are legit. I’m surprised by the number of signals I’ve been finding where authors using tools like...