Welcome

Welcome to the Course Website for EN.580.428 Genomic Data Visualization!

As the primary mode through which analysts and audience members alike consume data, data visualization remains an important hypothesis generating and analytical technique in data-driven research to facilitate new discoveries. However, if done poorly, data visualization can also mislead, bias, and slow down progress. This hands-on course will cover the principles of perception and cognition relevant for data visualization and apply these principles to genomic data, including large-scale single-cell and spatially-resolved omics datasets, using the R statistical programming language. Students will be expected to complete class readings, create weekly data visualizations as homework assignments, and make a major class presentation.

Course Information

Course Staff: Prof. Jean Fan and Caleb Hallinan
Lectures: 8:00am-9:50am Monday, Wednesday, and Friday. See Canvas for location details.
Office Hours: 10:00am-10:50am Monday, Wednesday, and by request. See Canvas for location details.

Course Details
☞ see Course tab


All Visualizations

Validating Efficacy of Spatial Capture Spot Transcriptomic Deconvolution in Interrogating Cell Type

In the previous analysis (HW3), I depicted a visium dataset in embedded space and clustered to identify a B-cell-related cell type. Here, I perform STdeconvolution to parse the cell type...

Exploring t-SNE Embeddings with Varying Numbers of Principal Components

Write a brief description of your figure so we know what you are visualizing.

Using Clustering and Deconvolution to Identify B-Cell Populations in Spatial Transcriptomics

In HW4, I identified a B-cell population within the Eevee dataset, based on the upregulation of CD79A gene expression. I used K-means clustering with K=4 to classify spatial transcriptomics spots...

Linear vs Nonlinear Dimension Reduction

I used the Pikachu dataset, an imaging based dataset. Therefore, I did not normalize the gene expression, as it was already for each cell. I then used a linear technique...

Deconvolution

The data was normalized by counts and then log transformed. Deconvolution was performed on the raw data.

PCA, tSNE, and Spatial Distribution Animation of Pikachu Dataset

This visualization explores the differences between linear and nonlinear dimensionality reduction techniques for analyzing spatial transcriptomics data. Specifically, the animation compares the spatial distribution of cells, their representation in t-SNE...

Effect of Varying Number of Principal Components on t-SNE Visualization of Spatial Transcriptomics Data

Description visualization This figure visualizes the effect of varying the number of principal components (PCs) used in t-SNE for dimensionality reduction on a spatial transcriptomics dataset. The animation transitions smoothly...

Visualizing the Impact of the Number of PCs used to perform Nonlinear Dimensionality Reduction using tSNE

Write a brief description of your figure so we know what you are visualizing.

EC1- tSNE on genes vs on PCs

Describe your figure briefly so we know what you are depicting (you no longer need to use precise data visualization terms as you have been doing).