Bow River

Visualization


View on Tableau Public

 


Rationale Report


Communication Goals
Upon initially looking over the data, I decided that I wanted to highlight the anti-bacterial effectiveness of each of the three antibiotics: Penicillin, Streptomycin, and Neomycin, especially in regard to the Gram-positive and Gram-negative groups of bacteria. Initially, I had the idea of plotting the minimum inhibitory concentration(MIC) values of the three antibiotics against each of the given bacterial infections into a stacked bar graph with circles as marks. The MIC value represents the necessary concentration of antibiotic required to prevent in vitro growth of the different bacterial strains. This means that the closer the MIC value is to zero, the more effective it is. Due to the large range of MIC values for the three antibiotics, there was a concentration of overlapping circles at the bottom of the graph with a few outliers near the top. To combat these obvoluted marks near 1 MIC unit, I decided to change the scale to a logarithmic one. This fixed the issue and gave me a more robust view of the data set. To get a better view of the effects of Gram-positive versus Gram-negative bacteria, I split the columns of bacteria names into two groups with Gram-Positive bacterial infections in one and Gram-negative in the other. This allowed me to easily see the most effective antibiotic for each strain of bacteria, but I felt that it still didn’t give a clear view regarding the effectiveness of each of the three antibiotics.

In turn, I created Chart 2 to more closely examine the relationship between the three antibiotics and the Gram-positive/Gram-negative bacterial groups by separating the stacked chart into separate rows for each of the three antibiotics. I found this to be much more effective at expressing which antibiotic had the strongest antibacterial effect against the Gram-positive group and the Gram-negative group. As evident from the visualization, Penicillin was extremely effective against Gram-positive but extremely ineffective against a majority of the Gram-negative strains. Streptomycin had similar MIC levels for both Gram-positive and Gram-negative strains, although for the Gram-positive bacteria, Streptomycin had a much higher standard deviation of MIC values and contained both the minimum and maximum Streptomycin MIC values, (0.01, 14). Neomycin showed around an average MIC level of 1 for Gram-negative strains but had very varied results on Gram-positive strains. It had values as low as 0.001 for two bacteria but also values as high as 10 and 40 for other Gram-positive bacteria. Due to the use of a logarithmic scale from 0.001 to 1,000, I believe the true differences between various MIC, especially the outliers on both ends, might be downplayed although I do believe Chart 1 gives a better look into the true differences between each antibiotics effect on the 16 bacterial strains.
Design Rationale
For my initial examination of the data, Chart 1, I used a stacked bar plot with rings as the mark. I chose this because it allowed me to see all three antibiotics compared against one another for each of the 16 bacterial infections. The design has a logarithmic scale centered on 1 MIC unit, with 0 at the very bottom and 1,000 at the top of the graph, since it would otherwise be impossible to discern between the various MIC values near 1. I chose to use rings since they stood out more than dashes were easier to spot visually with my color scheme. For my color encoding, I used the three antibiotics with their MIC level for the key. Originally it had blue, orange, and red as the color encodings but the given color scheme was too perceptually uniform, and I decided to go with green, yellow, and red instead since it caused more pop-out between the circles. I also added a reference line at 1 MIC unit to highlight the fact it’s a logarithmic scale centered on 1 and show which values very clearly fall under 1.

Since the focus of Chart 2 was to highlight the differences in effect between the three strains, I decided to use a bar plot with bars as my mark for Chart 2. I tried using circles and dashes originally, but it came off with the impression that our data was centered around 0 instead of 1 unit. I found that bars made it easier to discern that the very bottom of the graph started at 0, as well as, made it easier to see the most effective antibiotic for each strain. In regards to the color encodings, I used maize and blue to represent the separate Gram-positive and Gram-negative bacterial infection groups, respectively. This helps force the viewer to focus on the differences each of three antibiotics have on the two Gram-strain groups. I added a reference line at 1 MIC unit on these graphs as well, since it helps the viewer recognize the logarithmic scale and it makes it easier to judge values based off that line. I labeled the bars with their MIC values to make it easier to read if the whole visualization was scaled down or squashed if used on another website. Overall, I feel my visualization used effective encoding and marks to clearly indicate each of the antibiotics effects upon Gram-positive and Gram-negative bacterial infections.

Dataset Information


Data Set: Antibiotics
After the World War II, antibiotics were considered as "wonder drugs", since they were easy remedy for what had been intractable ailments. To learn which drug worked most effectively for which bacterial infection, performance of the three most popular antibiotics on 16 bacteria were gathered.

The values in the table below represent the minimum inhibitory concentration (MIC), a measure of the effectiveness of the antibiotic, which represents the concentration of antibiotic required to prevent growth in vitro. The reaction of the bacteria to Gram staining is described by the covariate “gram staining”. Bacteria that are stained dark blue or violet are Gram-positive. Otherwise, they are Gram-negative.
BacteriaPenicilinStreptomycinNeomycinGram Staining
Aerobacter aerogenes87011.6negative
Brucella abortus120.02negative
Brucella anthracis0.0010.010.007positive
Diplococcus pneumoniae0.0051110positive
Escherichia coli1000.40.1negative
Klebsiella pneumoniae8501.21negative
Mycobacterium tuberculosis80052negative
Proteus vulgaris30.10.1negative
Pseudomonas aeruginosa 85020.4negative
Salmonella (Eberthella) typhosa 10.40.008negative
Salmonella schottmuelleri 100.80.09negative
Staphylococcus albus 0.0070.10.001positive
Staphylococcus aureus 0.030.030.001positive
Streptococcus fecalis 110.1positive
Streptococcus hemolyticus 0.0011410positive
Streptococcus viridans0.0051040positive

Assignment


SI 649 W 2018 Individual Assignment
Your task is to design a visualization that you believe effectively communicates the data and provide a short write-up (no more than 4 paragraphs) describing your design. While you must use the data set given, note that you are free to transform the data as you see fit. You are also free to incorporate external data as you see fit. Your visualization should be interpretable without recourse to your short write-up. Do not forget to include title, axis labels or legends as needed!

As different visualizations can emphasize different aspects of a data set, you should document what aspects of the data you are attempting to most effectively communicate. In short, what story (or stories) are you trying to tell? Just as important, also note which aspects of the data might be obscured or down-played due to your visualization design.

In your write-up, you should provide a rigorous rationale for your design decisions. Document the visual encodings you used and why they are appropriate for the data. These decisions include the choice of visualization type, size, color, scale, and other visual elements, as well as the use of sorting or other data transformations. How do these decisions facilitate effective communication?