Hey folks, Long time friends of Riffomonas know that I’ve been teaching data science classes for close to 20 years. The hallmark of my teaching has been three-day workshops where I either teach R (here and here) or the mothur software package. I’ve gotten feedback that three days is just too much time for people to carve out of their busy schedules. So, I’m excited to be offering a 1-day (6 hours) data visualization workshop on May 9th. This will cover an introduction to the ggplot2 package. My goal is to help you develop that mental model so that you can leave the workshop understanding the ggplot2 framework and add to your understanding of the model as you go off on your own journey learning more advanced topics. You can learn more and register by clicking the button below. Feel free to email me if you have any questions.
If a full day is still too much time, let me know. I could schedule a 6-hour workshop over two days. I can also make an even shorter workshop! I’m sorry, not sorry, for all the plots I share from the New York Times. The reality is that they’re about the best in data journalism. What they don’t give away for free is often available through our universities’ academic subscriptions. If you know of another source of great, consistent, data journalism, please send it my way. In the US there has been a lot of discussion about the federal budget. Cuts to NIH, NSF, CDC, and everything else seems to be about shoring up the budget to remove $1,000,000,000,000 - $1 trillion - from an $8.3 trillion budget (this doesn’t include interest on the debt). The NYT has been using “waffle charts” to depict the budget in their reporting. Here’s an example from an April 13th article in the UpShot titled, “Why Elon Musk Hasn’t Come Close to Finding $1 Trillion in Cuts”. My impression has been that going after the NIH and NSF has been like trying to find $100 by searching for lost coins in my couch. Throughout the article they provide a series of aggregated waffle charts to show what different types of cuts might look like. I can imagine another article with an interactive visualization that would allow the user to find the $1 trillion. A waffle chart is also called a “square pie chart”. That’s fair. Each square represents a consistent value. In this case, it’s $5 billion. One problem with pie charts is that it’s really difficult to interpret the relative area of a wedge from a circle. In contrast, a waffle chart deals in squares, which can be counted to make comparisons between groups easier. What do you think of this alternative? Anyway, how would we make this in R? One thought would be that these are effectively heat maps with a few specific fill colors for each group. It would take some logic to figure out how to make specific tiles in the heat map the same group and preferably to have those tiles be arranged next to each other. Thankfully, there’s a package for that Looking through the
How would I adapt this code for my use? First, I notice that the NYT version has partial squares on the bottom row of the charts. The area of those are difficult to interpret, so I’d forgo those and instead use full squares in my version of the waffle chart. This would result in the bottom row not having all 50 squares across. Second, each waffle chart has its own title and in some cases, subtitle. I’d likely use Third, each waffle chart also has text indicating the percent cut on the right side of the title. I’d likely do this with Finally, I’ve had a bit of a journey to find the data, but I think this PDF from the Congressional Budget Office (CBO) has the data we want in the 2026 column.
|
Hey folks, I really enjoyed teaching a one-day, introduction to ggplot2 workshop last week. It was a lot of fun - I enjoyed teaching the principles behind ggplot2. I’ve been noticing many learners (and teachers) focusing on making templates that they can recycle to make variations on a common plot type. This is how I often teach ggplot2 and the rest of the tidyverse - it’s also how I learned R. In the most recent workshop I was testing a hypothesis that teaching concepts would yield more long...
Hey folks, If you’re interested in participating in a 1-day (6 hours) data visualization workshop, you’re running out of time to register. I’ll be teaching this workshop on May 9th. I will cover an introduction to the ggplot2 package and will assume no prior R knowledge. My goal is to help you to understand the ggplot2 framework and begin to apply it to make some interesting and compelling visualizations. After this workshop, you should be able to learn more advanced topics on your own. You...
Hey folks, I’m gearing up to teach a 1-day (6 hours) data visualization workshop on May 9th. This workshop will cover an introduction to the ggplot2 package and will assume no prior R knowledge. My goal is to help you to understand the ggplot2 framework and begin to apply it to make some interesting and compelling visualizations. From this workshop, I hope that you would be able to go off on your own journey learning more advanced topics. You can learn more and register by clicking the button...