Welcome

Welcome to the Course Website for EN.580.428 Genomic Data Visualization!

As the primary mode through which analysts and audience members alike consume data, data visualization remains an important hypothesis generating and analytical technique in data-driven research to facilitate new discoveries. However, if done poorly, data visualization can also mislead, bias, and slow down progress. This hands-on course will cover the principles of perception and cognition relevant for data visualization and apply these principles to genomic data, including large-scale spatially-resolved omics datasets, using the R statistical programming language. Students will be expected to complete class readings, create weekly data visualizations as homework assignments, and make a major class presentation.

Course Information

Course Staff: Prof. Jean Fan and Suki
Lectures: 8:00am-9:50am Monday, Wednesday, and Friday. See Canvas for location details.
Office Hours: 10:00am-10:50am Monday, Wednesday, and by request. See Canvas for location details.

Course Details
☞ see Course tab


All Visualizations

HW3

Description I’m depicting the identification and characterization of Cluster 2 in the Visium spatial transcriptomics data from a mouse kidney sample. The top row shows the discovery and validation of...

Identification of kidney collecting duct principal cells through principal component analysis, k-means clustering, and differential expression analysis

1. Figure description This multi-panel data visualization uses principal component analysis, k-means clustering, and differential expression analysis to characterize a cluster of interest based on gene expression patterns. In the...

HW3

Describe your figure briefly so we know what you are depicting (you no longer need to use precise data visualization terms as you have been doing). Write a description to...

Identifying a cluster of Proximal Tubule Epithelial Cells

Description To identify and characterize a transcriptionally distinct cell cluster from the Xenium dataset, I first normalized the raw counts and did PCA for dimensionality reduction. Based on the scree...

Visualization of Proximal Tubule Cells in Kidney Tissue Sample

Description of Data Visualization: The raw Xenium dataset was normalized according to library size and log normalization before having its dimensionality reduced using principal component analysis.

Identification of Proximal Tubule Cells in Kidney Tissue

In this data visualization, I explored the gene expression of Cluster 1 from a single-cell resolution spatial kidney tissue sample. The two uppermost plots highlight this cluster of interest by...

Identification of Thick Ascending Limb Cells in Visium Spatial Transcriptomics of Mouse Kidney

1. Describe your figure briefly so we know what you are depicting (you no longer need to use precise data visualization terms as you have been doing). Write a description...

Identification of Proximal Tubule Cells in Kidney Tissue

In this data visualization, I explored the gene expression patterns of Cluster 2 from a Visium spatial transcriptomics dataset of kidney tissue. The visualization consists of five integrated panels that...

HW3

Instructions: Create a multi-panel data visualization that includes at minimum the following components:​ (1) A panel visualizing your one cluster of interest in reduced dimensional space (PCA, tSNE, etc)​, (2)...

HW 3

Figure Description and Interpretation This figure integrates dimensionality reduction, spatial mapping, and differential expression analysis to characterize an Aqp2-positive cell population. In PCA space, cells form distinct clusters, with Cluster...

HW2

## In this homework, I wanted to explore the following question: How do the genes with high versus low loadings relate to each other? How are they patterned relative to...