Welcome

Welcome to the Course Website for EN.580.428 Genomic Data Visualization!

As the primary mode through which analysts and audience members alike consume data, data visualization remains an important hypothesis generating and analytical technique in data-driven research to facilitate new discoveries. However, if done poorly, data visualization can also mislead, bias, and slow down progress. This hands-on course will cover the principles of perception and cognition relevant for data visualization and apply these principles to genomic data, including large-scale spatially-resolved omics datasets, using the R statistical programming language. Students will be expected to complete class readings, create weekly data visualizations as homework assignments, and make a major class presentation.

Course Information

Course Staff: Prof. Jean Fan and Suki
Lectures: 8:00am-9:50am Monday, Wednesday, and Friday. See Canvas for location details.
Office Hours: 10:00am-10:50am Monday, Wednesday, and by request. See Canvas for location details.

Course Details
☞ see Course tab


All Visualizations

Identification of Proximal Tubule Cells in Kidney Tissue

In this data visualization, I explored the gene expression patterns of Cluster 2 from a Visium spatial transcriptomics dataset of kidney tissue. The visualization consists of five integrated panels that...

HW3

Instructions: Create a multi-panel data visualization that includes at minimum the following components:​ (1) A panel visualizing your one cluster of interest in reduced dimensional space (PCA, tSNE, etc)​, (2)...

HW 3

Figure Description and Interpretation This figure integrates dimensionality reduction, spatial mapping, and differential expression analysis to characterize an Aqp2-positive cell population. In PCA space, cells form distinct clusters, with Cluster...

HW2

## In this homework, I wanted to explore the following question: How do the genes with high versus low loadings relate to each other? How are they patterned relative to...

HW3

Describe your figure briefly so we know what you are depicting (you no longer need to use precise data visualization terms as you have been doing). Write a description to...

HW 2

1. How do the gene loadings on the first PC relate to features of the genes such as its mean or variance? For context, the data was log-transformed, but not...

HW2 Submission

2. How do the genes with high versus low loadings relate to each other? How are they patterned relative to each other in the tissue? Genes with high loadings on...

A multipanel data visualization of PC1 gene loadings against gene features

1. What data types are you visualizing? Categorical- PCA unscaled and PCA scaled Quantitative data- PC1 gene loading Gene expression variability (variance) Mean gene expression

HW2

Question explored: “How do tSNE coordinates change as you increase or decrease the perplexity?”

Kidney PCA vs Loading Size Analysis(Prompt 2)

1. What data types are you visualizing?

Comparing high vs. low PC1 loading genes

Aim: How do the genes with high versus low loadings relate to each other? How are they patterned relative to each other in the tissue? Note: Panel numbers reference the...