|
Hey folks! I’m hoping to host two workshops in March and April. The first would be a Zoom-based workshop on the principles of data visualization (I taught a version of this last month). This would be a code-free workshop and would run for about 3 hours. I don’t have a date yet. If you are interested, please reply to this email and let me know if there is a date and time in March that would work best for you. The second would be an in person 3 day workshop taught near the Detroit airport. I know it seems bonkers to meet in person, but this was how I got started teaching. Frankly, I miss these interactions. If you are interested in attending a live, in person workshop learning R’s tidyverse applied to a variety of interesting datasets (an in person version of this workshop), reply to this email and let me know when would work best for you. I need at least 15 people to register to make it work. This week I found a paper recently published in Nature Microbiology, “Initial sites of SIV rebound after antiretroviral treatment cessation in rhesus macaques”. Don’t worry if you aren’t sure what most of those words mean. I often get confused over how authors represent a limit of detection. If there is a line indicating the limit of detection and points are on top of the line, were they below or above the limit of detection? I prefer to remove any doubt and put the points below the line. I found a panel in this paper that showed the limit of detection in an interesting manner See what they did there? They actually have two plotting windows. The one on the top has a y-axis on a log-scale going from 10-1 to 105. The one on the bottom has a label indicating “Below threshold”. I thought this was a unique strategy for indicating the number of observations that had densities below the limit of detection. This approach is different from what I typically do and so it caught my eye. My usual approach is to add a horizontal, thin, gray line, which might be dashed just above the limit of detection on the y-axis. Then any points below the line would be “below the limit of detection”. Often this is done as it is here, with a log-scaled y-axis. To pull off my approach, I have to add a small value to all the values so that I don’t have any zeroes (you can’t take the log of zero). I generally pick that value so that it has a good spacing relative to the other data. We can get away with this because if in this case I were to add 10-2 to all the values, it wouldn’t meaningfully change the position on the y-axis of the other data points. If this worried you, you could always just add the value to those that were below the limit of detection. Take a moment and think about how these authors could have created these figures in R. What did you come up with? Facets? Me too! I’d likely create a variable - perhaps Using Now that I think about it, I rarely add the vertical jitter to my points below the limit of detection. Even if I used my approach rather than the faceted version, I could still add a vertical jitter those points below the limit of detection. I’d use a similar approach, albeit in log10 space. Which of these two approaches to indicating a limit of detection do you prefer? Reply to this email and let me know! Also, don’t forget to let me know if you’re interested in one of the upcoming workshops
|
Hey folks, We had a lot of fun last week with my first workshop on the theory of data visualization! If this is something that you’d be interested in participating in let me know. At this point, I don’t have anything scheduled. So, if you have suggestions for days or times, please let me know This week I have a fun figure to share with you from a paper recently published in Nature Microbiology, titled, “Candida auris skin tropism and antifungal resistance are mediated by carbonic anhydrase...
Hey folks, Happy 2026! It’s great to be joining you on another trip around the sun as we explore data visualization, R, and reproducible research. Later today I’ll be hosting a workshop on the design of data visualizations. If you register ASAP, I can probably still get you in. If you missed this one, but would like to be notified when I run this workshop again, reply to this email and let me know! This week I found a pretty unique plot type in a paper published in the journal Nature This is...
Hey folks, What a year! This will be the last newsletter of 2025 and so it’s a natural break point to think back on the year and to look forward to the next. Some highlights for me have been recreating a number of panels from the collection of WEB DuBois visualizations on YouTube, recreating plots from the popular media, and modifying and recreating figures from the scientific literature. I guess you could say 2025 was a year of “recreating”! I have found this approach to making...