Welcome

Welcome to the Course Website for EN.580.428 Genomic Data Visualization!

As the primary mode through which analysts and audience members alike consume data, data visualization remains an important hypothesis generating and analytical technique in data-driven research to facilitate new discoveries. However, if done poorly, data visualization can also mislead, bias, and slow down progress. This hands-on course will cover the principles of perception and cognition relevant for data visualization and apply these principles to genomic data, including large-scale single-cell and spatially-resolved omics datasets, using the R statistical programming language. Students will be expected to complete class readings, create weekly data visualizations as homework assignments, and make a major class presentation.

Course Information

Course Staff: Prof. Jean Fan and Kalen Clifton
Office Hours: 10:00am-10:50am Monday, Wednesday, and Friday. See Slack for location details.
Lectures: 8:00am-9:50am Monday, Wednesday, and Friday. See Slack for location details.

Course Details
☞ see Course tab


All Visualizations

Description of HW5

In order to determine cell type from cell cluster, I aimed to find particular genes which are both highly specific to one type of breast cell and also present in...

Identification of a Cluster Associated with CD8+ T cells

Description of my multi-panel plot Here, I identified a cluster that seems to be CD8+ T cells. In order to generate the plot above, I normalized the raw gene expression...

Determining Cell Type with Kmeans Approach

I used kmeans clustering to identify different cell types by looking at clusters in my data. I preproceessed my data by normalizing by total gene count and putting everything on...

Identifying an Epithelial Cell Population within the Breast Tissue Dataset

After performing kmeans clustering on my dataset, I randomly decided on investigating cluster 5 of my kmeans clustering. After a thorough analysis, I have concluded that this cluster is likely...

Homework 5

Description

Identification of the Breast Glandular Cells

The visualization presented above comprises eight panels, all of which provide evidence to support the hypothesis that cluster 1 corresponds to breast glandular cells, a type of epithelial cell (1)....

Validation of cell type clustering via differential gene expression

The purpose of this visualization to present the usage of differential gene expression to validate cell type identification in k-means and tsne analysis of the dataset. The quantitative data of...

Comparison of Dimensionality Reduction on Normal, Log10 Transformed and ScaleD Gene Expression

Should I normalize and/or transform the gene expression data (e.g. log and/or scale) prior to dimensionality reduction?

Comparison of using normalized and unnormalized data on gene expression clustering in t-SNE graphs

What data types are you visualizing? I am visualizing quantitative data of the comparative gene expression of two genes KRT7 and PTPRC. I am also visualizing the quantitative data of...

Effects of normalizing by gene count in the reduced dimension visualization

What data types are you visualizing? I present quantitative data of the PCA and tSNE reduced dimension applied to the raw gene expression data and the normalized by gene count...

The effects of log transformation, scaling and normalization prior to PCA and non-linear dimensionality reduction (tSNE) on PDGFRB gene expression data

What data types are you visualizing? I am visualizing quantitative data of 2 dimensional reduction through PCA and tSNE of original PDGFRB expression for each cell, quantitative data of 2...