Welcome

Welcome to the Course Website for EN.580.428 Genomic Data Visualization!

As the primary mode through which analysts and audience members alike consume data, data visualization remains an important hypothesis generating and analytical technique in data-driven research to facilitate new discoveries. However, if done poorly, data visualization can also mislead, bias, and slow down progress. This hands-on course will cover the principles of perception and cognition relevant for data visualization and apply these principles to genomic data, including large-scale spatially-resolved omics datasets, using the R statistical programming language. Students will be expected to complete class readings, create weekly data visualizations as homework assignments, and make a major class presentation.

Course Information

Course Staff: Prof. Jean Fan and Suki
Lectures: 8:00am-9:50am Monday, Wednesday, and Friday. See Canvas for location details.
Office Hours: 10:00am-10:50am Monday, Wednesday, and by request. See Canvas for location details.

Course Details
☞ see Course tab


All Visualizations

HW EC1

Instructions Make a new data visualization of your spatial transcriptomics dataset to explore one of the following questions. Selected question: “(4) If I perform non-linear dimensionality reduction on PCs, what...

Deconvolution and Multi-Modal Comparison of the Renal S3 Segment

Note, the png is named “EC2_ooni5.png”, as a desired name was not specified in the HW powerpoint.

Xenium Dimensionality Reduction with or without normalization and transformation

This animation depicts how normalization and log-transformation steps are essential in accurate dimensionality reduction that allows biological interpretation rather than being obscured in noise. Without normalization, the PCA space appears...

Using animation to visualize the importance of data normalization and log-transformation for quality control

1. Figure description This data visualization uses animation to visualize the effect of data normalization and log-transformation before performing principal component analysis (PCA) and k-means clustering. The data being analyzed...

HW5

1. Figure Description I created a multipanel figure to show the distribution of B cells and T cells in thhe spleen. Throughout, I used the gestalt principle of similarity to...

HW5

Tissue Structure Identified: White Pulp

Characterizing Tissue Structure Identity of CODEX Dataset using K-means Clustering and Differential Expression Analysis

Description of Data Visualization: The CODEX dataset was normalized (log10(CPM+1)) by library size and its dimensionality was reduced using tSNE (seed(123)). The normalized dataset was also reduced using PCA, from...

HW 5

Write a description to convince me you found the same cell-type. Cluster 4 most likely represents white pulp due to its association with CD20 expression, a B-cell marker found in...

CODEX Spleen Data: White Pulp

##Description I filtered out the bottom and top 1% of cells by total signal and cell area to remove dead cells, doublets, and extreme outliers, retaining 9,599 out of 10,000...

HW5: Multi-Panel Data Visualization of the White Pulp Tissue Structure in the CODEX Data

Perform a full analysis (quality control, dimensionality reduction, kmeans clustering, differential expression analysis) on your data. Your goal is to figure out what tissue structure is represented in the CODEX...