Activity

Exploring Advanced Data Visualizations for Research Projects

Objective:

Create advanced data visualizations that effectively communicate complex data insights, moving beyond basic line or scatter plots.

Overview:

Explore one of the following resources for five minutes

  1. Data Visualization Catalogue
  2. Seaborn examples
  3. How to Choose the Right Data Visualization
  4. NCHS Data Visualization Gallery
  5. Data Viz Project
  6. NCHS-CS website on visualizations

Now consider that we have two data set examples below.

  1. Join into a group of 3 or 4 people.
  2. Choose one of these data sets (or you can work with your own research project idea).
    • Analyze the data set and research question to identify key insights.
    • Select an advanced visualization type (do not choose a line or scatter plot) that best represents the data and supports the research question.
    • Sketch a draft of the chosen visualization on paper.
    • Prepare a brief explanation why you chose this view and why the selected visualization is more effective than basic plots.

Data Set 1 - Global Renewable Energy Consumption

  • Research Question: How has the share of different renewable energy sources evolved globally over the past two decades, and what trends can be observed?
  • Data Description: This data set includes annual global consumption figures for various renewable energy sources (e.g., solar, wind, hydro, geothermal) from 2000 to 2020.

Data Set 2: World Population Age Distribution

  • Research Question: What are the comparative age distributions across different continents, and how do they relate to regional socio-economic factors?
  • Data Description: This data set provides population counts segmented by age groups (e.g., 0-14, 15-24, 25-54, 55-64, 65+) for each continent as of 2020.
Possible answers (view after completing above activity)

Data Set 1: Global Renewable Energy Consumption

Research Question: How has the share of different renewable energy sources evolved globally over the past two decades, and what trends can be observed?

Stacked Area Chart:

  • Purpose: Illustrates the cumulative contribution of various renewable energy sources over time, highlighting both individual and collective trends.
  • Example: A chart displaying the global energy consumption from 2000 to 2020, with each segment representing a different renewable source (e.g., solar, wind, hydro). This visualization emphasizes the growth patterns and shifts in energy source prominence.

Stacked area

Sankey Diagram:

  • Purpose: Demonstrates the flow and proportion of energy consumption from different renewable sources, providing insight into the distribution and transition of energy usage.
  • Example: A diagram mapping the journey from various renewable energy sources to their respective consumption sectors, effectively showcasing the magnitude and direction of energy flows.

Sankey diagram

Treemap:

  • Purpose: Offers a hierarchical view of the proportionate contributions of each renewable energy source within the total energy consumption.
  • Example: A treemap where each rectangle’s size corresponds to the consumption level of a specific renewable source, allowing for quick comparison of their relative magnitudes.

Treemap

Data Set 2: World Population Age Distribution

Research Question: What are the comparative age distributions across different continents, and how do they relate to regional socio-economic factors?

Population Pyramid:

  • Purpose: Visualizes the age and gender distribution within a population, facilitating comparisons between different regions.
  • Example: Side-by-side pyramids for continents like Africa, Asia, and Europe, highlighting differences in age structures and potential socio-economic implications.

Population pyramid

Heat Map:

  • Purpose: Represents the density of specific age groups across various continents using color gradients, making patterns and concentrations easily identifiable.
  • Example: A heat map where age groups are plotted against continents, with color intensity indicating the population size within each age bracket.

Heat map

Bubble Chart:

  • Purpose: Combines age distribution data with another variable, such as GDP per capita, to explore potential correlations.
  • Example: A chart where each bubble represents a continent; the bubble’s position indicates the median age, its size reflects the population under 15, and the color denotes GDP per capita, revealing insights into the relationship between age distribution and economic factors.

Bubble Chart

These visualization techniques can provide deeper insights into this data and move beyond basic plots to more effectively communicate complex relationships and trends.