|
Hey folks, It’s March! That means the days are getting longer, the weather is pretty bonkers, the Cubs season has already started, and it’s time for March Madness. For the uninitiated, that’s the roughly month-long period starting last week when men’s and women’s college basketball teams compete for their conference championship and then the National Championship. After falling apart at the end of the regular season the University of Michigan Men’s team won their conference tournament and received one of four 5 seeds. Our women’s team also had a strong season falling just short in the conference tournament in the semifinals. They received one of four 6 seeds. To be completely honest and while I’d be happy to be wrong, I doubt either team will get far in the NCAA tournament. Regardless, it’s a fun time of year when nearly anything happen. Earlier this week the New York Times newsletter recalled that last year was the first time that the Women’s NCAA Championship game had more TV viewers than the men’s game. The women’s game had 18.9 million viewers and the men’s game had 14.8 million viewers. Much of that is being credited to Caitlin Clark. Basketball insiders have noticed that there is greater parity in both the men’s and women’s game. There are more upsets and the traditional powers have lost much of their power. That parity makes for more viewers. Here’s the visualization that the New York Times included in their newsletter: It’s a pretty straightforward line plot. Within A few things about the plot stand out. First, as we’ve seen in previous videos recreating NYT visuals, the y-axis text often sits on the horizontal grid lines and the top most value includes the unit of the y-axis. Second, on the right side of the plot they have a single point for the 2024 data along with a text annotation indicating the number of 2024 viewers for both finals broadcasts. The point and text are the same color as the line. I’d likely add this by taking the full data frame and filtering for the 2024 data. Then I’d add a Third, I really like the use of color. The line for the men’s data is gray and the line for the women’s data is orange. The article was about the rise in popularity of the women’s game so it makes sense to highlight those data. Labelling the 2024 data serves as a legend for the figure. I have a couple of small critiques about the plot. First, I’m not sure that the “2024 finals” annotation was necessary. Instead, the title of the plot could have been “N.C.A.A. basketball championship game viewers” - inserting “game” and removing the years. This would have highlighted what the numbers mean. Also, the years are obvious from the x-axis. Of course, they also could have made the title more declarative. Something about how there has been a downward trend in viewership of the men’s game and a meteoric rise for the women’s game. Finally, the lines make it appear that there was a tournament in 2020. There was not. Ideally, these lines would have a gap in them to indicate that there was a lost year to the pandemic. Finally, I tried to track down the data that the author’s used. Considering this was a big story last year, I figured Nielsen or someone would have the data gathered together. Nope. Nielsen has numbers for the women’s game going back to 1995. Sports Media Watch has numbers for the men’s game going back to 1975. After a few efforts of manually transcribing numbers, I’m going to take a different approach. This looks like a great opportunity to focus on using the Give this a try on your own and let me know how it goes. I hope that both University of Michigan teams are still playing by the time I get a chance to share my approach to recreating this visualization. Do you think the women’s game will have more viewers than the men again this year? We’ll know in a few weeks :)
|
Hey folks, If you missed Wednesday’s livestream, I encourage you to go back and check it out. I recreated a panel from a paper published in Nature that is pretty typical. It was made up entirely of photographs. Sometimes I feel like I’m the only PI that doesn’t merge panels into figures using Illustrator or Powerpoint. I prefer to use R with some help from {cowplot} or {patchwork} to do this for me. That way I can write a single script to generate the entire set of panels. The result is a...
Hey folks, This week I’ve been teaching one of my 3 day R workshops as part of my official teaching duties at the U of Michigan. I really enjoy teaching these classes! I offer recorded versions of these workshops that use microbiome data or other types of data to help motivate my teaching of R’s tidyverse packages. If you would like to purchase your own version of these workshop click on those links! Also, if you would like me to teach a live workshop to your group, reply to this email and...
Hey folks, If you missed it, on Wednesday I did a livestream where I made a stacked barplot and pronounced it good. No, I wasn’t drinking anything! But it’s a reminder to think about the question before finding the best data visualization strategy. I think this highlights the value of the constructive approach I’ve been trying to take to critiquing data visualizations. The first steps are to establish the question and figure out the question. If you aren’t a “regular”, I think you’re really...