Welcome

Welcome to the Course Website for EN.580.428 Genomic Data Visualization!

As the primary mode through which analysts and audience members alike consume data, data visualization remains an important hypothesis generating and analytical technique in data-driven research to facilitate new discoveries. However, if done poorly, data visualization can also mislead, bias, and slow down progress. This hands-on course will cover the principles of perception and cognition relevant for data visualization and apply these principles to genomic data, including large-scale single-cell and spatially-resolved omics datasets, using the R statistical programming language. Students will be expected to complete class readings, create weekly data visualizations as homework assignments, and make a major class presentation.

Course Information

Course Staff: Prof. Jean Fan and Rafael dos Santos Peixoto
Lectures: 8:00am-9:50am Monday, Wednesday, and Friday. See Canvas for location details.
Office Hours: 10:00am-10:50am Monday, Wednesday, and Friday. See Canvas for location details.

Course Details
☞ see Course tab


All Visualizations

Gganimate on HW3 of Varying PC's Influence on tSNEs

If I perform non-linear dimensionality reduction on PCs, what happens when I vary how many PCs should I use?​ If I perform non-linear dimensionality reduction on varying PCs the clustering...

Spatial Distribution of Gene Expression

What data types are you visualizing? I am visualizing quantitative data of the log-10-transformed expression level of the TP53 gene, quantitative data of the total gene expression, and spatial data...

Using CRAWDAD on pikachu dataset

Apply SEraster, STalign, or CRAWDAD to a spatial omics dataset from the class I applied CRAWDAD to the pikachu dataset. After normalizing the dataset, I performed Kmeans clustering using the...

Comparison of Loading Values on PC1 in Raw and Log Transformed Data

Write a a brief description of your figure so we know what you are visualizing. You do not need to use the vocabulary terms from Lesson 1.

Dimensionality Reduction using GGanimate

Description of figure In the gif, I visualized the pca (linear) dimensionality reduction in 2D space which transitions into the tSNE (nonlinear) dimensionality reduciton in 2D space. The axes for...

Comparing the effect of tSNE on varying number of PCs:KRT7 expression (using gganimate)

If I perform non-linear dimensionality reduction on PCs, what happens when I vary how many PCs should I use?

Normalizing and Transforming Gene Expression

Description of my visualization:

gganimate: Visualizing IGKC in tSNE Space with Non-linear Dimensionality Reduction on Varying Numbers of PCs

What data types are you visualizing? For the plots on the left side of the animation, I am visualizing the quantitative data of the X1 and X2 tSNE embedding values,...

Visualizing Top Genes Driving Reduced Dimensionality Components

What are you visualizing? What genes are driving my reduced dimensionality components? I used gganimate to visualize the gene expression levels of genes that are driving the principal components (PC1...

Using SEraster on Xenium Breast Cancer Data

Brief Description I decided to utilize SEraster on a Xenium breast cancer dataset (https://www.10xgenomics.com/products/xenium-in-situ/preview-dataset-human-breast) to rasterize gene expression. Plotted below is the non-rasterized gene expression compared to the rasterized gene...

Linear Dimensionality Reduction on original, normalized, and log transformed normalized data of gene expression

What happens if I do or do not normalize and/or transform the gene expression data (e.g. log and/or scale) prior to dimensionality reduction?