|
Hey folks, I really hope you enjoyed the series of newsletters and videos of me recreating the visualizations presented by W.E.B. DuBois at the 1900 Paris Exposition. I can’t express how much I enjoyed making them. Some of them were pretty tricky and required a lot of work. But I think it was worth it! It definitely forced me to use some new-to-me tools like This week, I want to highlight a story in the NY Times’s “TheUpshot” section from December 2024, “How Drug Overdose Deaths Have Plagued One Generation of Black Men for Decades”. The story describes how drug overdoses has had a disproportionate effect on Black men through the 1980s, 1990s, and now - these men are all from the same generation. One quote that stood out to me in the article was from Tracie Gardner, who said, “They were resilient enough to live through a bunch of other epidemics — H.I.V., crack, Covid, multi-drug-resistant tuberculosis — only to be killed by fentanyl.” Oof. In this article, the team of journalists and data scientists present heatmaps for different diseases with the year across the x-axis, the age across the y-axis, and the intensity of each cell colored by the number of deaths per 100,000 people. They make plots for different diseases and U.S. cities. Here’s the heatmap for drug deaths among Black men in Chicago: A few things stand out to me about this figure that I’d enjoy taking on in R with ggplot2. First, it’s obviously a heatmap. I need to aggregate the data from the mortality data from NCHS, but let’s assume that we can get a CSV or TSV with columns for the year, age, and number of deaths in each region. We should be able to map the year to the y, age to the x, and deaths to the fill aesthetics. Then we’d use Second, I’m intrigued that they also included the dashed lines indicating the ages of a cohort of men born between 1951 and 1970 across the last 40 years. I’d likely create this using Third, the legend is a gradient going from a pale color to a dark purple color. This is reminiscent of one of the viridis color scales. Even if it isn’t exactly one of the built-in color scales, we could use Fourth, there’s no x-axis title, but there is a y-axis title - “AGE”. It’s rotated 90 degrees and located outside the top let corner of the heatmap. I’d likely put that there with some Finally, the plot has a title and caption. The title has two font faces - bold and regular sans serif font. Likely a Libre Franklin-related font that we can get from google fonts. The caption at the bottom is small and gray. What else catches your eye about this visual? Let me know! Reading the comments of a NY Times article is rarely a good idea. But nestled in there are other plots that I’d be interested in seeing. For example, what do the data look like for Black women? Hispanics? Whites? What do they look like in Detroit? How do you think we’d need to alter the R code to look at these questions and compare them to Black men? I suspect the WONDER NCHS Data is a treasure trove for answering these and other questions.
|
Hey folks, Happy 2026! It’s great to be joining you on another trip around the sun as we explore data visualization, R, and reproducible research. Later today I’ll be hosting a workshop on the design of data visualizations. If you register ASAP, I can probably still get you in. If you missed this one, but would like to be notified when I run this workshop again, reply to this email and let me know! This week I found a pretty unique plot type in a paper published in the journal Nature This is...
Hey folks, What a year! This will be the last newsletter of 2025 and so it’s a natural break point to think back on the year and to look forward to the next. Some highlights for me have been recreating a number of panels from the collection of WEB DuBois visualizations on YouTube, recreating plots from the popular media, and modifying and recreating figures from the scientific literature. I guess you could say 2025 was a year of “recreating”! I have found this approach to making...
Hey folks, As 2025 is winding down, I want to encourage you to think about your goals for 2026! For many people designing an effective visualization and then implementing it with the tool of their choice is too much to take on at once. I think this is why many researchers recycle approaches that they see in the literature or that their mentors insist they use. Of course, this perpetuates problematic design practices. What if you could break out of these practices? What if you could tell your...