Welcome

Welcome to the Course Website for EN.580.428 Genomic Data Visualization!

As the primary mode through which analysts and audience members alike consume data, data visualization remains an important hypothesis generating and analytical technique in data-driven research to facilitate new discoveries. However, if done poorly, data visualization can also mislead, bias, and slow down progress. This hands-on course will cover the principles of perception and cognition relevant for data visualization and apply these principles to genomic data, including large-scale single-cell and spatially-resolved omics datasets, using the R statistical programming language. Students will be expected to complete class readings, create weekly data visualizations as homework assignments, and make a major class presentation.

Course Information

Course Staff: Prof. Jean Fan and Caleb Hallinan
Lectures: 8:00am-9:50am Monday, Wednesday, and Friday. See Canvas for location details.
Office Hours: 10:00am-10:50am Monday, Wednesday, and by request. See Canvas for location details.

Course Details
☞ see Course tab


All Visualizations

Discovering Epithelial cells

[description] Figures A, B, and C share a common legend and analyze the dataset at the cluster level, where green highlights the cluster of interest and gray represents all other...

hw 3 DEG analysis

Description of analysis This differential gene expression analysis explores a transcriptionally distinct cluster of cells related to the cardiac conduction system, with a focus on the GJA1 gene, which encodes...

HW3: Identifying and analysing cluters via K-means and dimensionality reduction

<!– Create a multi-panel data visualization that includes at minimum the following components: A panel visualizing your one cluster of interest in reduced dimensional space (PCA, tSNE, etc) A panel...

Homework 3: Differentially Expressed Genes analysis

[description] Those panels present a comprehensive visualization of Cluster 0 and its association with the gene SFRP4 through a combination of UMAP, spatial, and gene expression analysis. The top-left UMAP...

Spatial Transcriptomics Reveals a Distinct Epithelial Cell Population Defined by ELF3 Expression: A Multi-Dimensional Analysis of the Cluster in Interest

1. Describe your figure briefly so we know what you are depicting. Write a description to convince me that your cluster interpretation is correct.

Comparing PCA and t-SNE Dimensionality Reduction on Spatial Transcriptomics Dataset

In many tissues, cells with similar gene expression patterns tend to cluster together both in a dimensionality-reduced “gene expression space” (like the PCA or t-SNE plots) and in their actual...

HW2: Exploring PC1 Loading Vs. Gene Expression Variance Before and After Normalization

1. How do the gene loadings on the first PC relate to features of the genes such as its variance? Using the raw data, when the gene expression variance is...

Homework 2 submission

[description] In my visualization, I use points as the geometric primitive, angle and color for visual channel. The x-axis represents the PCA loadings for each gene, while the y-axis shows...

Making a Multi-Panel Data Visualization

The visualization effectively conveys relationships between gene expression and spatial organization by utilizing dimensionality reduction (PCA) to simplify high-dimensional gene expression data. The PCA scatter plot helps distinguish patterns in...

Comparison of Scaled and Unscaled PCA: Gene Mean Expression, Variance, and PC1 Loadings

1. What data types are you visualizing? I am visualizing quantitative data, which includes log-transformed mean expression (x-axis), log-transformed variance (y-axis), and PC1 loading values (color hue).