Welcome

Welcome to the Course Website for EN.580.428 Genomic Data Visualization!

As the primary mode through which analysts and audience members alike consume data, data visualization remains an important hypothesis generating and analytical technique in data-driven research to facilitate new discoveries. However, if done poorly, data visualization can also mislead, bias, and slow down progress. This hands-on course will cover the principles of perception and cognition relevant for data visualization and apply these principles to genomic data, including large-scale spatially-resolved omics datasets, using the R statistical programming language. Students will be expected to complete class readings, create weekly data visualizations as homework assignments, and make a major class presentation.

Course Information

Course Staff: Prof. Jean Fan and Suki
Lectures: 8:00am-9:50am Monday, Wednesday, and Friday. See Canvas for location details.
Office Hours: 10:00am-10:50am Monday, Wednesday, and by request. See Canvas for location details.

Course Details
☞ see Course tab


All Visualizations

HW4

Changes made to code from HW3: Switched from Visium to Xenium and had to cut down data frame to first 10,000 cells in order to make the dataset more manageable...

Validating Identity of Proximal Convoluted Tubule Segments

For this visualization, the Visium dataset is analyzed. The previous visualization with the Xenium dataset analysis brought to light the PCT cell-type, particularly the early segments of proximal tubules located...

Llucero3

In HW3, I identified a transcriptionally distinct cell type using an unsupervised workflow consisting of library-size normalization (CP10K + log1p), highly variable gene selection, PCA, tSNE embedding, and k-means clustering....

HW4: Identifying Straight Proximal Tubule in Visium and Xenium Datasets

Description To identify the Straight Proximal Tubule (SPT) in the Xenium dataset using my HW3 Visium code, I increased the number of k-means clusters from 5 to 7 centers =...

Identification of kidney collecting duct principal cells through dimensionality reduction, k-means clustering, and differential expression analysis

1. Figure description This multi-panel data visualization uses principal component analysis (PCA), t-distributed stochastic neighbor embedding (tSNE), k-means clustering, and differential expression analysis to characterize a cluster of interest based...

Identifying a cluster of Proximal Tubule Epithelial Cells

Description To identify the same group of cells in the Visium dataset, I adapted the same analysis I used previously for the Xenium dataset. Like before, I normalized the raw...

Identifying Proximal Tubule Cell Populations in Spot-Resolution Spatial Transcriptomics

For these visualizations, I switched over to the spot-resolution Visium dataset from the single-cell Xenium dataset. I was able to successfully apply the same pre-processing pipeline and maintain parameters from...

Identification of Thick Ascending Limb Cells in Xenium Dataset

Write a description to convince me you found the same cell-type. You will likely need to change your code from HW3. Write a description of what you changed and why...

Adapting Cell Type Characterization from Visium Dataset to Xenium Dataset

Visium –> Xenium I made three main code changes when switching from Visium to Xenium. First, I updated the data source to Xenium-IRI-ShamR_matrix.csv.gz. Second, I reduced point size from 0.6...

HW 4

Write a description to convince me you found the same cell-type. Cluster 2 most likely represents proximal tubule S3 epithelial cells. This is supported by its inner stripe/outer medulla location...

HW4: Multi-Panel Data Visualization of a Transcriptionally Distinct Proximal Tubule Epithelial Cell Cluster in the Visium Dataset

Use/adapt your code from HW3 to identify the same cell-type in the other dataset. Create a multi-panel data visualization and write a description to convince me you found the same...

HW4

Describe your figure briefly so we know what you are depicting (you no longer need to use precise data visualization terms as you have been doing). Write a description to...