Hey folks, Are you looking for more personalized support and coaching to help you develop your data analysis skills? Are you looking for help in leading a data science team where your folks aren’t super proficient in analyzing data? Let me know what you’re looking for and we can discuss how I might be able to help you. Unfortunately, this wouldn’t be a free service. But, I’m confident I can help you get over the challenges that are keeping you from creating data analyses and visualizations that you are proud of. Let me know by replying to this email. I really hate stacked bar plots. Unfortunately, one of my most popular videos is how to make a stacked bar plot! I even tell people that there are better ways of representing data than with a stacked bar plot. Oh well. Today, I want to share a stacked bar plot that I think would be fun to recreate and think about how we could make it better. This visualization was published online two years ago and comes to us from YouGov. This is a horizontal stacked bar plot showing whether people love, like, dislike, hate or don’t know if they like one of 30 card games. It also has text annotation to indicate the size of each of the bars. If you want the data, you can copy and paste it from a PDF with their data. Incidentally, embedding data in a PDF is a sure sign to me that people don’t want you to actually use the data for secondary purposes. Thankfully, this is a nice PDF that we can copy and paste and with some regular expressions in RStudio, we can convert to a tibble. The data will come in wide format with the different sentiment types across the columns, the games in the rows, and the cells the level of sentiment for each game. We can tidy the data using By default, We’d also like to add the level of sentiment for each game to each of the bars. Well, except for those bars with less than 4% support. I’d start by making a There’s a number of interesting stylings that we’ll be able to implement in the Now, how could we improve this figure? The main problem with stacked bar plots is that it is difficult to compare the internal bars across groups. Sure the numbers are there, but it’s not as efficient as comparing the length of a bar that is anchored on either side. One solution would be to convert this to a dot plot where we’d use the same x and y-axis aesthetic mappings, but we’d use As an aside, I’m struck by the preference for solitaire and the overall dislike of bridge. Solitaire is a single person game that at one point (perhaps still?) came on every windows computer. There’s little strategy. Bridge is a very social game that I associate with the “greatest generation”. Couples would get together regularly to play with each other and there were newspapers columns about bridge strategy along side columns about chess strategy. It’s hard to not see this as some referrendum on our social media world where we think we’re participating in a community, but really we’re growing more and more isolated. What’s your favorite card game?
|
Hey folks! I’m in proposal writing mode again. Unfortunately, I am finding my weekly search for a data visualization to share with you is leading me down unproductive internet rabbit holes. So, I thought I’d share an idea with you that I hope resonates. If you have any reactions, please send them my way! For the past year or so I have been recreating other people’s data visualizations in an attempt to learn new techniques with R and expand the type of data that I normally visualize. The idea...
Hey folks! Here in the US, vaccines continue to be a hot button issue. I feel like this issue is really an amalgamation of multiple issues including the decline in respect for authority figures, frustration with COVID, inability to assess risk at a personal level, and parents feeling like they are losing rights. Do people really want their kids to get sick unnecessarily? I doubt it. It’s also in the news because the Secretary of Health and Human Services is a vaccine skeptic/denier with many...
Hey folks! Sorry for the hiatus in getting you a newsletter into your inbox. The end of the summer/beginning of the academic year has been pretty chaotic. Actually, I had what I thought would be an interesting plot to recreate, but then I wasn’t able to find the original data and I wasn’t really interested in simulating it. Oh well. I’m also finding it hard to come up with interesting data visualizations from out in the wild. One of my go-to’s, Philip Bump, stopped working for the Washington...