Welcome

Welcome to the Course Website for EN.580.428 Genomic Data Visualization!

As the primary mode through which analysts and audience members alike consume data, data visualization remains an important hypothesis generating and analytical technique in data-driven research to facilitate new discoveries. However, if done poorly, data visualization can also mislead, bias, and slow down progress. This hands-on course will cover the principles of perception and cognition relevant for data visualization and apply these principles to genomic data, including large-scale single-cell and spatially-resolved omics datasets, using the R statistical programming language. Students will be expected to complete class readings, create weekly data visualizations as homework assignments, and make a major class presentation.

Course Information

Course Staff: Prof. Jean Fan and Caleb Hallinan
Lectures: 8:00am-9:50am Monday, Wednesday, and Friday. See Canvas for location details.
Office Hours: 10:00am-10:50am Monday, Wednesday, and by request. See Canvas for location details.

Course Details
☞ see Course tab


All Visualizations

PCA, tSNE, and Spatial Distribution Animation of Pikachu Dataset

This visualization explores the differences between linear and nonlinear dimensionality reduction techniques for analyzing spatial transcriptomics data. Specifically, the animation compares the spatial distribution of cells, their representation in t-SNE...

Effect of Varying Number of Principal Components on t-SNE Visualization of Spatial Transcriptomics Data

Description visualization This figure visualizes the effect of varying the number of principal components (PCs) used in t-SNE for dimensionality reduction on a spatial transcriptomics dataset. The animation transitions smoothly...

Visualizing the Impact of the Number of PCs used to perform Nonlinear Dimensionality Reduction using tSNE

Write a brief description of your figure so we know what you are visualizing.

EC1- tSNE on genes vs on PCs

Describe your figure briefly so we know what you are depicting (you no longer need to use precise data visualization terms as you have been doing).

HW5: Identifying Cell Types and Tissue Structures in CODEX data

1. Figure Description. Figure A: 6 clusters in physical space. The axes represent x and y position. Figure B: 6 clusters in t-SNE space. The axes represent X1 and X2....

Identifying White Pulp Tissue Structure in CODEX Data

1. Figure Description and Interpretation I have performed quality control, dimensionality reduction using t-SNE, k-means clustering with optimal k=9 (from an elbow plot), and differential expression analysis on the CODEX...

Analysis of CODEX dataset

Based on the CODEX data, I hypothesize this tissue sample is taken from the white pulp region of the spleen, which is surrounded by red pulp. Some evidence/reasoning is outlined...

Analyzing Immune Cell Clusters in CODEX Dataset

Visualization Summary In this visualization, I analyzed two cell types within the CODEX dataset: T cells and B cells. First, the genes in the dataset were normalized, log-transformed, and clustered...

Identifying tissue structure in spleen tissue sample

1. Written Answer I decided to use techniques such as t-SNE and dimensionality reduction as well as normalizing the protein expression data to figure out the tissue structure in the...