Welcome

Welcome to the Course Website for EN.580.428 Genomic Data Visualization!

As the primary mode through which analysts and audience members alike consume data, data visualization remains an important hypothesis generating and analytical technique in data-driven research to facilitate new discoveries. However, if done poorly, data visualization can also mislead, bias, and slow down progress. This hands-on course will cover the principles of perception and cognition relevant for data visualization and apply these principles to genomic data, including large-scale single-cell and spatially-resolved omics datasets, using the R statistical programming language. Students will be expected to complete class readings, create weekly data visualizations as homework assignments, and make a major class presentation.

Course Information

Course Staff: Prof. Jean Fan and Rafael dos Santos Peixoto
Lectures: 8:00am-9:50am Monday, Wednesday, and Friday. See Canvas for location details.
Office Hours: 10:00am-10:50am Monday, Wednesday, and Friday. See Canvas for location details.

Course Details
☞ see Course tab


All Visualizations

Visualizing Tighter Clustering Using More PCs

Description This gif is a visualization of what happens when you run nonlinear dimensionality reduction (in this case tSNE) on an increasing number of principal components (PCs) after principal component...

Locating glandular epithelial cells within the Pikachu data

Write a description of what you changed and why you think you had to change it. For Homework 5, I am switching from the Eevee dataset to the Pikachu dataset....

Analysis of most expressed gene (POSTN): PCA, t-SNE, and UMAP Plots

gganimate animation from hw3 Animation of the expression of the top gene POSTN can be expressed across different dimensionality reduction plots (PCA, t-sne, and UMAP).

Analysis of Sequencing Dataset Clustering, AGR3 Expression, and cell-typing

Description For this assignment, I switched from the Pikachu to the Eevee dataset. I previously found that most of the variation was captured by about 20 PCs following PCA. With...

ANXA3 spatial expression

Code changes from pikachu to eevee dataset

Multi-Panel Data Visualization

We are visualizing clusters in reduced dimensional space and identified cell cluster 1. We found this cell cluster to have several upregulated genes, including CXCL12, which plays a role in...

Differentially expressed gene (DSC2) in cell clusters by k-means

Write a description to convince me that your cluster interpretation is correct. Your description may reference papers and content that allowed you to interpret your cell cluster as a particular...

Identification of Epithelial Cells in Breast Cancer Tissue

Figure Description: Looking at the data in the pikachu data set, we can see that there are various cells, each with their own gene expression and cell type. However, our...

Differential Expression of APOD Gene in Barcode Data

Figure Description Figure 1 shows an elbow plot using points to display the withiness based on the k value. Figure 2 shows the 4 clusters created via kmeans clustering of...

Interpreting cell cluster through dimensionality reduction and differential gene expression analysis

Things to know about my visualization The visualization aims to provide evidence for characterizing a specific cluster of cells and understanding its gene expression profile. For data pre-processing, I first...

Multi-panel visualization of an immune cell cluster with differentially expressed genes

Describe your figure briefly so we know what you are depicting (you no longer need to use precise data visualization terms as you have been doing). This figure is a...

Multi-Panel Data Visualization of Breast Cancer Cell Cluster and Genes

Describe your figure briefly so we know what you are depicting (you no longer need to use precise data visualization terms as you have been doing).