|
Hey folks, Long time friends of Riffomonas know that I’ve been teaching data science classes for close to 20 years. The hallmark of my teaching has been three-day workshops where I either teach R (here and here) or the mothur software package. I’ve gotten feedback that three days is just too much time for people to carve out of their busy schedules. So, I’m excited to be offering a 1-day (6 hours) data visualization workshop on May 9th. This will cover an introduction to the ggplot2 package. My goal is to help you develop that mental model so that you can leave the workshop understanding the ggplot2 framework and add to your understanding of the model as you go off on your own journey learning more advanced topics. You can learn more and register by clicking the button below. Feel free to email me if you have any questions.
If a full day is still too much time, let me know. I could schedule a 6-hour workshop over two days. I can also make an even shorter workshop! I’m sorry, not sorry, for all the plots I share from the New York Times. The reality is that they’re about the best in data journalism. What they don’t give away for free is often available through our universities’ academic subscriptions. If you know of another source of great, consistent, data journalism, please send it my way. In the US there has been a lot of discussion about the federal budget. Cuts to NIH, NSF, CDC, and everything else seems to be about shoring up the budget to remove $1,000,000,000,000 - $1 trillion - from an $8.3 trillion budget (this doesn’t include interest on the debt). The NYT has been using “waffle charts” to depict the budget in their reporting. Here’s an example from an April 13th article in the UpShot titled, “Why Elon Musk Hasn’t Come Close to Finding $1 Trillion in Cuts”. My impression has been that going after the NIH and NSF has been like trying to find $100 by searching for lost coins in my couch. Throughout the article they provide a series of aggregated waffle charts to show what different types of cuts might look like. I can imagine another article with an interactive visualization that would allow the user to find the $1 trillion. A waffle chart is also called a “square pie chart”. That’s fair. Each square represents a consistent value. In this case, it’s $5 billion. One problem with pie charts is that it’s really difficult to interpret the relative area of a wedge from a circle. In contrast, a waffle chart deals in squares, which can be counted to make comparisons between groups easier. What do you think of this alternative? Anyway, how would we make this in R? One thought would be that these are effectively heat maps with a few specific fill colors for each group. It would take some logic to figure out how to make specific tiles in the heat map the same group and preferably to have those tiles be arranged next to each other. Thankfully, there’s a package for that Looking through the How would I adapt this code for my use? First, I notice that the NYT version has partial squares on the bottom row of the charts. The area of those are difficult to interpret, so I’d forgo those and instead use full squares in my version of the waffle chart. This would result in the bottom row not having all 50 squares across. Second, each waffle chart has its own title and in some cases, subtitle. I’d likely use Third, each waffle chart also has text indicating the percent cut on the right side of the title. I’d likely do this with Finally, I’ve had a bit of a journey to find the data, but I think this PDF from the Congressional Budget Office (CBO) has the data we want in the 2026 column.
|
Hey folks! Before launching into this week’s visualization, I’m looking for a bit of feedback. Since November, I’ve settled into a new routine with this newsletter and the YouTube channel. Each week this newsletter introduces a visualization at a 30,000 ft view or discusses a specific topic in some depth (example). The following Monday I post a video critiquing the visualization (example). Then on Wednesday (or Tuesday like this past week), I livestream a video where I recreate the...
Hey folks! I just got back from a seminar. I’m still trying to stretch out my eyes from straining to see the small text on each slide! If you don’t know why I’m brining this up, then you must have missed the videos I posted earlier this week. I was discussing the factors we should consider when converting figures designed for papers to figures designed to a slide deck. You can see me critique a figure from my own lab here and the livestream where I refactor the figure can be found here. I’d...
Hey folks, I was a student-invited speaker at the Syracuse University Biology department this week. It was great to meet with them and hear how they are benefiting from these newsletters and my videos. As much as I love posting newsletters and videos, seeing people light up at ideas, laugh at my jokes, and tell me how they are using what I teach them is like jet fuel. I actually gave two talks. One talk covered what I’ve learned about data visualization by critiquing, recreating, and remaking...