Home | Genomic Data Visualization

Welcome

Welcome to the Course Website for EN.580.428 Genomic Data Visualization!

As the primary mode through which analysts and audience members alike consume data, data visualization remains an important hypothesis generating and analytical technique in data-driven research to facilitate new discoveries. However, if done poorly, data visualization can also mislead, bias, and slow down progress. This hands-on course will cover the principles of perception and cognition relevant for data visualization and apply these principles to genomic data, including large-scale spatially-resolved omics datasets, using the R statistical programming language. Students will be expected to complete class readings, create weekly data visualizations as homework assignments, and make a major class presentation.

Course Information

Course Staff: Prof. Jean Fan and Suki
Lectures: 8:00am-9:50am Monday, Wednesday, and Friday. See Canvas for location details.
Office Hours: 10:00am-10:50am Monday, Wednesday, and by request. See Canvas for location details.

Course Details

☞ see Course tab

Featured Visualizations

Using clustering and deconvolution to visualize cell types and upregulated genes in different data sets

1. Figure description This multi-panel data visualization uses principal component analysis (PCA), t-distributed stochastic neighbor embedding (tSNE), k-means clustering, deconvolution, and differential expression analysis to...

Jamie L
02 Mar 2026

tSNE on varying PC numbers

Description This animation adresses the question: “If I perform non-linear dimensionality reduction on PCs, what happens when I vary how many PCs I use?”

Isabella G
02 Mar 2026

Effect of Varying PC Count on tSNE Space - Visium

Write a a brief description of your figure so we know what you are visualizing.

Emma Meihofer
02 Mar 2026

Deconvolution and Multi-Modal Comparison of the Renal S3 Segment

Note, the png is named “EC2_ooni5.png”, as a desired name was not specified in the HW powerpoint.

Oluwadurotimi O.
01 Mar 2026

HW5

1. Figure Description I created a multipanel figure to show the distribution of B cells and T cells in thhe spleen. Throughout, I used the...

Will Li
20 Feb 2026

Identification of CODEX data as White Pulp

Perform a full analysis (quality control, dimensionality reduction, kmeans clustering, differential expression analysis) on your data. Your goal is to figure out what tissue structure...

Emma Meihofer
19 Feb 2026

HW 5

###Summary To identify the tissue structure represented in this CODEX dataset, I performed quality control, dimensionality reduction, k means clustering, differential expression analysis, and cell-type...

Aarna Sanghai
19 Feb 2026

Identification of kidney collecting duct principal cells through dimensionality reduction, k-means clustering, and differential expression analysis

1. Figure description This multi-panel data visualization uses principal component analysis (PCA), t-distributed stochastic neighbor embedding (tSNE), k-means clustering, and differential expression analysis to characterize...

Jamie L
17 Feb 2026

hw4: cortical tubule area in Xenium data

I’ve been analyzing Visium data so far, and this time I switched to Xenium data to try to identify the same cell type I found...

Tiya Z
16 Feb 2026

HW3: Multi-Panel Data Visualization of a Transcriptionally Distinct Proximal Tubule Epithelial Cell Cluster in the Xenium Dataset

Describe your figure briefly so we know what you are depicting (you no longer need to use precise data visualization terms as you have been...

Yuki H
11 Feb 2026

A multipanel data visualization distinguishing the ascending loop of henle in mouse kidney tissue

Describe your figure briefly so we know what you are depicting (you no longer need to use precise data visualization terms as you have been...

Saadia J
11 Feb 2026

Visualization of Proximal Tubule Cells in Kidney Tissue Sample

Description of Data Visualization: The raw Xenium dataset was normalized according to library size and log normalization before having its dimensionality reduced using principal component...

Henry Aceves
11 Feb 2026

HW2

Question explored: “How do tSNE coordinates change as you increase or decrease the perplexity?”

Sofia A
03 Feb 2026

Comparing high vs. low PC1 loading genes

Aim: How do the genes with high versus low loadings relate to each other? How are they patterned relative to each other in the tissue?...

Maanya Bajaj
03 Feb 2026

Spatial Organization of Genes with Extreme PCA Loadings

1. What data types are you visualizing? I’m visualizing both quantitative and categorical data. The dataset has quantitative spatial information of x and y coordinates...

Isabella G
03 Feb 2026

Spatial Expression of Avpr2, Inmt, and Rnf24

1. What data types are you visualizing? I am visualizing 3 data types. First, categorical data of 3 genes: Avpr2, Inmt, and Rnf24. Second, spatial...

Maanya Bajaj
28 Jan 2026

HW1 Submission

1. What data types are you visualizing? I am visualizing quantitative data of the gene expression counts of the Cyp2e1, Cyp4b1, and Slc22a6 genes for...

Yuki H
26 Jan 2026

HW1

1. What about the data would you like to make salient through this data visualization? Since I am working with Visium 10x geneomics data, every...

Lillian L
25 Jan 2026

All Visualizations

Validating Identity of Splenic White Pulp with B-cell and T-cell Markers

The CODEX dataset for spleen tissue was analyzed in this visualization. To identify a tissue structure in the data, a combination of methods were utilized such as normalization, PCA and...

Maanya Bajaj
19 Feb 2026

HW5: Identifying White Pulp and Red Pulp in CODEX Data

Description The White Pulp (Clusters 1 & 2):

Lillian L
19 Feb 2026

Identification of splenic white pulp through dimensionality reduction, k-means clustering, and differential expression analysis

1. Figure description This multi-panel data visualization uses principal component analysis (PCA), t-distributed stochastic neighbor embedding (tSNE), k-means clustering, and differential expression analysis to characterize clusters of interest based on...

Jamie L
19 Feb 2026

Identifying Tissue Type in Spleen

Description My full analysis followed a similar pipeline to the previous homework assignments. I began by performing quality control by removing cells in the bottom 1% of total protein expression...

Isabella G
19 Feb 2026

Identification of Human Splenic White Pulp from CODEX Spatial Proteomics

This data visualization of a spleen CODEX dataset highlights two cell clusters, B cells and T cells, identifying the tissue as white pulp based on protein markers and physical organization...

Grace X
19 Feb 2026

Identification of CODEX data as White Pulp

Perform a full analysis (quality control, dimensionality reduction, kmeans clustering, differential expression analysis) on your data. Your goal is to figure out what tissue structure is represented in the CODEX...

Emma Meihofer
19 Feb 2026

title

Description In this analysis, I used CODEX spatial proteomics data from spleen tissue to identify what tissue structure was present. After filtering out low-quality cells by area, I performed tSNE...

Catherine Cheng
19 Feb 2026

HW 5

###Summary To identify the tissue structure represented in this CODEX dataset, I performed quality control, dimensionality reduction, k means clustering, differential expression analysis, and cell-type signature scoring. Proteins with low...

Aarna Sanghai
19 Feb 2026

HW4

In HW3, I worked with the Visium dataset and identified a cluster corresponding to thick ascending limb (TAL) / distal tubule epithelial cells. That cluster stood out because of its...

John-Paul Akinbami (JP)
18 Feb 2026

HW4

Discussion For the previous homework assignments, I was using the Xenium dataset. For this assignment, I switched to the Visium dataset. To identify the same cell type that I found...

Tavishi
17 Feb 2026

Visium Dataset: Finding PT epithelial cells

#Use/adapt your code from HW3 to identify the same cell-type in the other dataset. Create a multi-panel visualization and write a description to convince me you found the same cell-type...

Sakshi Singhal
17 Feb 2026

A multipanel data visualization distinguishing the ascending loop of henle in mouse kidney tissue

Describe your figure briefly so we know what you are depicting (you no longer need to use precise data visualization terms as you have been doing). Write a description to...

Saadia J
17 Feb 2026