|
Hey folks! The summer is nearly over - where did it go?! Many of us are getting ready to send our kids off to school and start a new academic year. If you’re subscribed to this newsletter, I suspect you are interested in improving your data visualization skills. You can certainly continue to receive this newsletter and watch my weekly livestreams on YouTube for free to help increase those skills. If you want a more concentrated or personalized opportunity to develop your data visualization chops, I want to remind you of a few opportunities. First, starting in September I am going to be teaching a 5-part workshop that meets weekly to discuss and apply concepts of data visualization. Second, I have pre-recorded workshops teaching the fundamentals of the tidyverse using microbiome data and data of interest to a more general audience. Finally, I would love to work one-on-one with you or your research team to develop custom learning solutions. If any of these opportunities interest you, please click on the links above or reply to this email and let’s start taking. This week reader and livestream viewer Mike Parrott from the UK forwarded a plot to me from the Pew Research Center. The plot was part of Pew’s overall effort to look at US media consumption by sex, age, race, politics, and education. Mike was happy to see that The Guardian and BBC News are relatively popular among college educated people living in the US. This plot reports the survey of 9,482 US adults that Pew surveyed back on March 2025. Part of the survey was to ask the people being surveyed where they get their news from and their level of education. One of my first questions when trying to recreate the data is whether I can get the data from somewhere that will allow me to bring it into R easily. Yes, all of the names and numbers are in the plot, but manually typing that would be a pain. I did find the data, but sadly, the data are embedded in a PDF. Why do people do this? It seems they want to be perceived as being transparent without actually having to be transparent. Someone on a recent livestream mentioned that there are R packages to extract tables from PDFs. I forget which package they mentioned. A quick google search found a few options. First is Back to the plot… Clearly this is a bar plot with the axes switched from what we traditionally see. This is helpful because it allows us to more easily read the name of the news outlet than if the names were along the x-axis and the names rotated to prevent them from overlapping. I would use The next notable element of the plot is the percentages of college graduates. I’d use Let’s think about the text elements for a moment. There are two bits of text that help orient the reader to the plot. The first is the “62% of people regularly…” blurb that helps us interpret the first bar. I think that’s pretty helpful. There’s a downward pointing triangle there to connect the text to the bar for “The Atlantic”. I’d probably put the text an the triangle in with Thinking about that second blurb, we see that the authors put a pink point at 36% on the x-axis for each media outlet. We could place that with Now that I’ve started thinking about things I would change, let’s think more about the data being displayed. The story makes a point that the visual is basically flipped for people with a high school diploma or less. For example, “Univision” and “Telemundo” are most popular among these folks and “The Atlantic” is not popular with them. I could imagine changing the plot to be a dot plot instead of a bar plot. For each media outlet, I’d place a different colored point for each of the three education categories across the x-axis. I’d like to put a vertical line to show the total percentage of US adults in each category where the color matches the color of the point. Maybe that would be too busy? If so, we could drop the “some college” population to focus on the extremes. What do you think?
|
Hey folks! I continue to get positive feedback about my critique videos. This has me quite excited that I’ve perhaps scratched an itch that people have been struggling with. Would you like to meet with a group of other people who are committed to making their data visualizations better? I’m forming groups now that would meet once a week or every other week to give each other constructive feedback on the visualizations they are making for their work. Alternatively, if you have ever thought, “I...
Hey folks! I posted two videos last week! On Monday I posted a video critiquing the diverging bar plot that I described in this newsletter last Friday. My goal in this video was to think through a “constructive” approach to interpreting and critiquing data visualizations. As scientists, I think we are too worried about hurting each other’s feelings. So we don’t critique each other. At the same time, many of us think before we speak and can come off overly harsh. My goal is to create a...
Hey folks! As I’m writing this newsletter the US government is in shutdown mode with no clear signs that things will get going anytime soon. I’ll withhold my own political take except to say that my family has been running without an official budget for about 25 years. I don’t recommend it, but we know basically how much money goes to our mortgage, insurance, groceries, charities, etc. and how much money we generally have left over. Somehow we still are able to spend money on living a pretty...