
Welcome to the Course Website for EN.580.428 Genomic Data Visualization!

As the primary mode through which analysts and audience members alike consume data, data visualization remains an important hypothesis generating and analytical technique in data-driven research to facilitate new discoveries. However, if done poorly, data visualization can also mislead, bias, and slow down progress. This hands-on course will cover the principles of perception and cognition relevant for data visualization and apply these principles to genomic data, including large-scale single-cell and spatially-resolved omics datasets, using the R statistical programming language. Students will be expected to complete class readings, create weekly data visualizations as homework assignments, and make a major class presentation.

Course Information

Course Staff: Prof. Jean Fan and Caleb Hallinan
Lectures: 8:00am-9:50am Monday, Wednesday, and Friday. See Canvas for location details.
Office Hours: 10:00am-10:50am Monday, Wednesday, and by request. See Canvas for location details.

Course Details
☞ see Course tab

All Visualizations

Spatial Distribution of Cells by Nucleus-to-Cell Ratio Write Up

1. What data types are you visualizing? I am visualizing quantitative data representing the nucleus-to-cell area ratio for each cell, quantitative data of ERBB2 expression levels to indicate gene activity,...

Homework 1 Submission


Mean Gene Expression of Top Genes in COL Gene Group

1. What data types are you visualizing? I wanted to visualize the expression of genes related to collagen in the sequencing dataset. More specifically, I looked at the mean expression...

The Top 10 Genes Expressed in the Eevee Dataset

1. What data types are you visualizing? I am visualizing gene expression data obtained through sequencing of the top 10 genes expressed in the Eevee dataset.

Scatter Plot of POSTN vs LUM Expression

1. What data types are you visualizing? I am visualizing quantitative data for the expression levels of the POSTN and LUM genes in all the individual cells and I am...

Relationship between expression of LUM and POSTN

1. What data types are you visualizing? I am visualizing quantitative data of the expression count of the POSTN and LUM gene for each cell, and the quantitative data of...

Correlation Between COL1A1 and COL1A2 Gene Expression Levels in the Eevee Dataset

1. What data types are you visualizing? I am visualizing quantitative data of the expression levels of the COL1A1 and COL1A2 genes for each cell in the dataset.

Spatial Distribution of Total Genes Expressed

1. What data types are you visualizing?

Spatial Visualization of POSTN Expression in Tissue

What data types are you visualizing? The data visualized represents the spatial distribution of MS4A1 expression. This is a gene encoding CD20, a well-known surface marker expressed on B-cells, which...

Comparison of Spatial Gene Expression of ESR1 and PGR

1. What data types are you visualizing? I am visualizing quantitative data representing the expression levels of the ESR1 and PGR genes for each cell. Additionally, I am visualizing spatial...

Expression Level Amongst Top 3 Genes

1. What data types are you visualizing? I am visualizing the top 3 genes (COL1A1, IGHG1, and IGKC) expression levels. These are all categorical data types. As in they represent...