Category HW1

HW4

Discussion For the previous homework assignments, I was using the Xenium dataset. For this assignment, I switched to the Visium dataset. To identify the same cell type that I found...

HW3

Discussion

HW1

1. What data types are you visualizing? I am plotting 2D spatial coordinates and quantitative data through ACOX1 expression counts.

A data visualization for sequencing spatial transcriptomics data

1. What data types are you visualizing? Categorical- Gene type Aqp1, Slc34a1 (in the second plot) Quantitative data- Expression levels of genes Aqp1 and Slc34a1 in each cell The euclidean...

HW1

1. What about the data would you like to make salient through this data visualization? I wanted to make the total level of fibrosis-related gene expression for each spot apparent...

HW1

1. What about the data would you like to make salient? The visualization is designed to make spatial differences in gene expression patterns salient. By placing Aqp1 and Nphs2 in...

Kidney Oxygen Depletion Visualization

1. What data types are you visualizing?

Spatial Expression of Avpr2, Inmt, and Rnf24

1. What data types are you visualizing? I am visualizing 3 data types. First, categorical data of 3 genes: Avpr2, Inmt, and Rnf24. Second, spatial data regarding the x and...

Spatial Visualization of Gene Expression Across Tissue

1. What data types are you visualizing? I would like to make salient the spatial distribution of gene expression within the sham tissue, emphasizing how expression of biologically relevant genes...

Spatial distribution of transcription factor Krüppel‐like factor 5 (KLF5) gene expression in coronal kidney section

1. What data types are you visualizing? The represented data type is quantitative (x and y spatial positions of cells as well as levels of KLF5 gene expression on the...

HW1

1. What data types are you visualizing? This visualization comprises categorical data for the gene marker and quantitative data for the expression values.

Spatial Gene Expression Patterns of Umod and Cyp4b1 (Xenium Dataset)

1. What data types are you visualizing? I’m visualizing both quantitative and categorical data. The original dataset contains quantitative data (the x and y coordinates) for each spot in the...

HW 1 Spatial Distribution of Aqp1 and Aqp2 Expressing Cells

1. What are the data types represented? I am visualizing the cells in a kidney tissue cross section that contain the genes for two water channels, Aqp1 and Aqp2. I...

HW1

1. What data types are you visualizing? I am visualizing quantitative gene expression count data (non-negative integer counts), specifically the total counts per gene aggregated across all spatial locations/cells. 2....

HW1

1. What data types are you visualizing? I am visualizing quantitative data representing the expression levels of two genes, Aqp1 and Aqp2, for individual cells. Categorical data is the 2...

Visualizing canonical markers and spatially variable genes in Visium data

1. What data types are you visualizing? I’m visualizing quantitative gene expression values for multiple genes across quantitative spatial coordinates (x, y) for each visium spot. The gene expression values...

HW1

1. What data types are you visualizing? I am visualizing spatial data and quantitative data in this plot. The spatial data consists of the X and Y coordinates in micrometers...

Abundance of and spatial relationship between co-expression of genes within the Kynurenine Pathway

1. What data types are you visualizing? Categorical - each of the three genes and their respective overlaps are split into categories. Quantitative - the spot count for each gene...

Spatial Expression of Cyp4b1 in Kidney

1. What data types are you visualizing? I am visualizing quantitative data in terms of the gene expression levels of Cyp4b1 in each cell across the kidney sample and spatial...

Spatial Expression of Slc12a1

1. What data types are you visualizing?

HW1 Submission

1. What data types are you visualizing? I am visualizing quantitative data of the gene expression counts of the Cyp2e1, Cyp4b1, and Slc22a6 genes for each cell.

A descriptive title

1. What about the data would you like to make salient? I would like to make salient the spatial expression pattern of the gene Acox1 across individual cells in the...

HW1

1. What data types are you visualizing?

HW1

1. What about the data are you trying to make salient through this data visualization? The goal of this visualization was to make salient the spatial location of the highest...

HW1

1. What about the data would you like to make salient through this data visualization? Since I am working with Visium 10x geneomics data, every row is a spatial spot...

HW1

1. What data types are you visualizing? I’m plotting the quantitative data of the expression count of genes for each cell.

A descriptive title

1. What data types are you visualizing? I am visualizing quantitative data of the expression count of the ERBB2 gene for each cell, quantitative data of the area for each...

Category HW3

HW 3

Figure description Panel A describes the PCA plot between cluster 7 and all other cells, depicting the distinct separation of cluster 7 in the reduced dimensional space. Panel B visualizes...

HW3: Multi-Panel Data Visualization of a Transcriptionally Distinct Proximal Tubule Epithelial Cell Cluster in the Xenium Dataset

Describe your figure briefly so we know what you are depicting (you no longer need to use precise data visualization terms as you have been doing). Write a description to...

Identifying a kidney cortical tubule region using marker genes

The data were normalized and log-transformed. I then ran PCA on the normalized matrix, used the scree plot of PC standard deviations to pick a safe cutoff (PC = 10)...

A multipanel data visualization distinguishing the ascending loop of henle in mouse kidney tissue

Describe your figure briefly so we know what you are depicting (you no longer need to use precise data visualization terms as you have been doing). Write a description to...

Highlighting Proximal Convoluted Tubule Segments with SLC gene family

Description This multi-panel visualization combines a multitude of concepts essential in spatial transcriptomic data analysis and visualization, including normalization/log-transformation, dimensionality reduction, k-means clustering, and differential expression. By combining these methods,...

Identification and Spatial Characterization of a Transcriptionally Distinct Cell Cluster

This figure explores a transcriptionally distinct cluster of Visium spots identified using PCA, t-SNE, and k-means clustering. In the t-SNE plot (top left), the cluster of interest appears as a...

HW3

Description I’m depicting the identification and characterization of Cluster 2 in the Visium spatial transcriptomics data from a mouse kidney sample. The top row shows the discovery and validation of...

Identification of kidney collecting duct principal cells through principal component analysis, k-means clustering, and differential expression analysis

1. Figure description This multi-panel data visualization uses principal component analysis, k-means clustering, and differential expression analysis to characterize a cluster of interest based on gene expression patterns. In the...

HW3

Describe your figure briefly so we know what you are depicting (you no longer need to use precise data visualization terms as you have been doing). Write a description to...

Identifying a cluster of Proximal Tubule Epithelial Cells

Description To identify and characterize a transcriptionally distinct cell cluster from the Xenium dataset, I first normalized the raw counts and did PCA for dimensionality reduction. Based on the scree...

Visualization of Proximal Tubule Cells in Kidney Tissue Sample

Description of Data Visualization: The raw Xenium dataset was normalized according to library size and log normalization before having its dimensionality reduced using principal component analysis.

Identification of Proximal Tubule Cells in Kidney Tissue

In this data visualization, I explored the gene expression of Cluster 1 from a single-cell resolution spatial kidney tissue sample. The two uppermost plots highlight this cluster of interest by...

Identification of Thick Ascending Limb Cells in Visium Spatial Transcriptomics of Mouse Kidney

1. Describe your figure briefly so we know what you are depicting (you no longer need to use precise data visualization terms as you have been doing). Write a description...

Identification of Proximal Tubule Cells in Kidney Tissue

In this data visualization, I explored the gene expression patterns of Cluster 2 from a Visium spatial transcriptomics dataset of kidney tissue. The visualization consists of five integrated panels that...

HW3

Instructions: Create a multi-panel data visualization that includes at minimum the following components:​ (1) A panel visualizing your one cluster of interest in reduced dimensional space (PCA, tSNE, etc)​, (2)...

HW 3

Figure Description and Interpretation This figure integrates dimensionality reduction, spatial mapping, and differential expression analysis to characterize an Aqp2-positive cell population. In PCA space, cells form distinct clusters, with Cluster...

HW3

Describe your figure briefly so we know what you are depicting (you no longer need to use precise data visualization terms as you have been doing). Write a description to...

Xenium t-SNE and Cyp7b1

Panel 1 represents t-SNE clustering, Panel 2 represents cluster 2 spatial mapping. Panel 3 is top DE genes in cluster 2. Panel 4 is t-SNE mapping of the top cluster...

Category HW4

HW4

In HW3, I worked with the Visium dataset and identified a cluster corresponding to thick ascending limb (TAL) / distal tubule epithelial cells. That cluster stood out because of its...

Visium Dataset: Finding PT epithelial cells

#Use/adapt your code from HW3 to identify the same cell-type in the other dataset. Create a multi-panel visualization and write a description to convince me you found the same cell-type...

A multipanel data visualization distinguishing the ascending loop of henle in mouse kidney tissue

Describe your figure briefly so we know what you are depicting (you no longer need to use precise data visualization terms as you have been doing). Write a description to...

HW4

Changes made to code from HW3: Switched from Visium to Xenium and had to cut down data frame to first 10,000 cells in order to make the dataset more manageable...

Validating Identity of Proximal Convoluted Tubule Segments

For this visualization, the Visium dataset is analyzed. The previous visualization with the Xenium dataset analysis brought to light the PCT cell-type, particularly the early segments of proximal tubules located...

HW4: Identifying Straight Proximal Tubule in Visium and Xenium Datasets

Description To identify the Straight Proximal Tubule (SPT) in the Xenium dataset using my HW3 Visium code, I increased the number of k-means clusters from 5 to 7 centers =...

Identification of kidney collecting duct principal cells through dimensionality reduction, k-means clustering, and differential expression analysis

1. Figure description This multi-panel data visualization uses principal component analysis (PCA), t-distributed stochastic neighbor embedding (tSNE), k-means clustering, and differential expression analysis to characterize a cluster of interest based...

Identifying a cluster of Proximal Tubule Epithelial Cells

Description To identify the same group of cells in the Visium dataset, I adapted the same analysis I used previously for the Xenium dataset. Like before, I normalized the raw...

Identifying Proximal Tubule Cell Populations in Spot-Resolution Spatial Transcriptomics

For these visualizations, I switched over to the spot-resolution Visium dataset from the single-cell Xenium dataset. I was able to successfully apply the same pre-processing pipeline and maintain parameters from...

Identification of Thick Ascending Limb Cells in Xenium Dataset

Write a description to convince me you found the same cell-type. You will likely need to change your code from HW3. Write a description of what you changed and why...

Adapting Cell Type Characterization from Visium Dataset to Xenium Dataset

Visium –> Xenium I made three main code changes when switching from Visium to Xenium. First, I updated the data source to Xenium-IRI-ShamR_matrix.csv.gz. Second, I reduced point size from 0.6...

HW 4

Write a description to convince me you found the same cell-type. Cluster 2 most likely represents proximal tubule S3 epithelial cells. This is supported by its inner stripe/outer medulla location...

HW4: Multi-Panel Data Visualization of a Transcriptionally Distinct Proximal Tubule Epithelial Cell Cluster in the Visium Dataset

Use/adapt your code from HW3 to identify the same cell-type in the other dataset. Create a multi-panel data visualization and write a description to convince me you found the same...

HW4

Describe your figure briefly so we know what you are depicting (you no longer need to use precise data visualization terms as you have been doing). Write a description to...

hw4: cortical tubule area in Xenium data

I’ve been analyzing Visium data so far, and this time I switched to Xenium data to try to identify the same cell type I found in HW3 using Visium. However,...

HW 4

Figure Description and Interpretation In Homework 3, I analyzed the Xenium dataset using k-means clustering with k = 10 and identified cluster 3 as an Aqp2-enriched population corresponding to collecting...

Characterizing Cell Type in Cluster 2 of Visium Dataset Using Dimensionality Reduction and Differential Expression Analysis

Description of Data Visualization: The Visium dataset was normalized (CPM) and its dimensionality was reduced using principal component analysis. The data was then plotted in 2 dimensional (PC1, PC2) space...

Visium t-SNE and Cyp7b1

Panel 1 represents t-SNE clustering, Panel 2 represents cluster 4 spatial mapping. Panel 3 is top DE genes in cluster 4. Panel 4 is t-SNE mapping of the top cluster...

Category HW5

HW5

1. Figure Description I created a multipanel figure to show the distribution of B cells and T cells in thhe spleen. Throughout, I used the gestalt principle of similarity to...

HW5

Tissue Structure Identified: White Pulp

Characterizing Tissue Structure Identity of CODEX Dataset using K-means Clustering and Differential Expression Analysis

Description of Data Visualization: The CODEX dataset was normalized (log10(CPM+1)) by library size and its dimensionality was reduced using tSNE (seed(123)). The normalized dataset was also reduced using PCA, from...

HW 5

Write a description to convince me you found the same cell-type. Cluster 4 most likely represents white pulp due to its association with CD20 expression, a B-cell marker found in...

CODEX Spleen Data: White Pulp

##Description I filtered out the bottom and top 1% of cells by total signal and cell area to remove dead cells, doublets, and extreme outliers, retaining 9,599 out of 10,000...

HW5: Multi-Panel Data Visualization of the White Pulp Tissue Structure in the CODEX Data

Perform a full analysis (quality control, dimensionality reduction, kmeans clustering, differential expression analysis) on your data. Your goal is to figure out what tissue structure is represented in the CODEX...

B and T compartments in white pulp

I checked total protein signal and its relationship with cell area, then log-transformed the protein intensities (log1p), ran PCA, embedded cells with tSNE using the top PCs, and applied k-means...

HW5

Discussion

A multipanel data visualization identifying cell types in splenic white pulp

Figure out what tissue structure is represented in the CODEX data. Options include: (1) Artery/Vein, (2) White pulp, (3) Red pulp, (4) Capsule/Trabecula. You will need to visualize and interpret...

HW5

Instructions Perform a full analysis (quality control, dimensionality reduction, kmeans clustering, differential expression analysis) on your data. Your goal is to figure out what tissue structure is represented in the...

Validating Identity of Splenic White Pulp with B-cell and T-cell Markers

The CODEX dataset for spleen tissue was analyzed in this visualization. To identify a tissue structure in the data, a combination of methods were utilized such as normalization, PCA and...

HW5: Identifying White Pulp and Red Pulp in CODEX Data

Description The White Pulp (Clusters 1 & 2):

Identification of splenic white pulp through dimensionality reduction, k-means clustering, and differential expression analysis

1. Figure description This multi-panel data visualization uses principal component analysis (PCA), t-distributed stochastic neighbor embedding (tSNE), k-means clustering, and differential expression analysis to characterize clusters of interest based on...

Identifying Tissue Type in Spleen

Description My full analysis followed a similar pipeline to the previous homework assignments. I began by performing quality control by removing cells in the bottom 1% of total protein expression...

Identification of Human Splenic White Pulp from CODEX Spatial Proteomics

This data visualization of a spleen CODEX dataset highlights two cell clusters, B cells and T cells, identifying the tissue as white pulp based on protein markers and physical organization...

Identification of CODEX data as White Pulp

Perform a full analysis (quality control, dimensionality reduction, kmeans clustering, differential expression analysis) on your data. Your goal is to figure out what tissue structure is represented in the CODEX...

title

Description In this analysis, I used CODEX spatial proteomics data from spleen tissue to identify what tissue structure was present. After filtering out low-quality cells by area, I performed tSNE...

HW 5

###Summary To identify the tissue structure represented in this CODEX dataset, I performed quality control, dimensionality reduction, k means clustering, differential expression analysis, and cell-type signature scoring. Proteins with low...

Codex Spleen Analysis

Visualized is a UMAP Cluster graph(A), a Global Spatial Map based on cluster labels(B), a Cluster Expression Profile heatmap(C), a Spatial Map of higlighted cell type(B cells—CD20, T cells—CD3e, Vessels—)(D),...

Category HW2

HW2

## In this homework, I wanted to explore the following question: How do the genes with high versus low loadings relate to each other? How are they patterned relative to...

HW 2

1. How do the gene loadings on the first PC relate to features of the genes such as its mean or variance? For context, the data was log-transformed, but not...

HW2 Submission

2. How do the genes with high versus low loadings relate to each other? How are they patterned relative to each other in the tissue? Genes with high loadings on...

A multipanel data visualization of PC1 gene loadings against gene features

1. What data types are you visualizing? Categorical- PCA unscaled and PCA scaled Quantitative data- PC1 gene loading Gene expression variability (variance) Mean gene expression

HW2

Question explored: “How do tSNE coordinates change as you increase or decrease the perplexity?”

Kidney PCA vs Loading Size Analysis(Prompt 2)

1. What data types are you visualizing?

Comparing high vs. low PC1 loading genes

Aim: How do the genes with high versus low loadings relate to each other? How are they patterned relative to each other in the tissue? Note: Panel numbers reference the...

Visualizing Spatial Gene Expression with PCA and tSNE

1. How do the gene loadings on the first PC relate to features of the genes such as its mean or variance? The first principal component captures the direction of...

HW2: How Distance Metric in tSNE affects Spatial Tissue Structure

1. What data types are you visualizing? Spatial data: X and Y coordinates representing physical tissue location Quantitative data: t-SNE embeddings (continuous numerical values) and euclidean distances to centroids Categorical...

Using PCA to visualize spatial patterns in high-dimensional gene expression within coronal kidney section

1. What data types are you visualizing? The represented data type is quantitative. I am visualizing the x and y spatial positions of cells in the coronal kidney section (all...

HW2

1. Write a description explaining what you are trying to make salient This visualization shows the expression of the five genes that contribute most positively and most negatively to PC1....

Spatial Organization of Genes with Extreme PCA Loadings

1. What data types are you visualizing? I’m visualizing both quantitative and categorical data. The dataset has quantitative spatial information of x and y coordinates for each spot in the...

Influence of Gene Mean and Gene Variance on PC1

1. What data types are you visualizing? I am visualizing quantitative data of: 1) the mean expression of each gene, averaged across all of all spatial spots within the data...

Comparing High and Low Loading Genes Across Spatial and PCA Spaces

Write a description explaining what you are trying to make salient and why you believe your data visualization is effective, using vocabulary terms from Lesson 1. (Question 2: How do...

HW 2 How Gene Loadings on the First PC Relates to Mean and Variance

1. What data types are you visualizing? I am visualizing quantitative data for the genes, including the PC1 loadings, mean expression per gene, and variance per gene. I also visualized...

HW 2

###Summary PC1 loadings correlate positively with both mean expression (r = 0.52-0.58) and variance (r = 0.44-0.48). This indicates PC1 primarily captures overall expression magnitude - highly expressed genes dominate...

Impact of Principal Component Selection on t-SNE Coordinates

1. What about the data would you like to make salient?

HW2

1. Write a description explaining what you are trying to make salient and why you believe your data visualization is effective using vocabulary terms from Lesson 1.

How do tSNE coordinates change as increasing the number of PCs?

1. What data types are you visualizing? I am answering how do tSNE coordinates change as increasing the number of PCs. I computed PCA on the log-transformed, normalized gene expression...

How PC1 Gene Loadings Relate to Gene Mean Expression and Variance

1. What data types are you visualizing? I am visualizing both categorical and quantitative data. The categorical data is the set of gene names in the left panel, since each...

HW2

external resources: prompt: make setwd as output path, improve the layout structure. 1. What data types are you visualizing? I visualize (1) quantitative gene-level data: PC1 loadings, mean expression, and...

Category HW EC1

An Animation to View the Impact of Normalization on Gene Expression

What happens if I do or not not normalize and/or transform the gene expression data (e.g. log and/or scale) prior to dimensionality reduction?

Xenium Spatial Transcriptomics: Data Normalization Workflow

Note, the gif is named “oluwadurotimioni.gif” instead of “hwEC1_ooni5.gif” as this is what the assignment powerpoint instructed, that we use “names.gif”.

How the Number of PCs Influences t-SNE Embeddings in Visium Data

If I perform non-linear dimensionality reduction on PCs, what happens when I vary how many PCs should I use?

HW EC1: PC1 Loadings of Top 5 Log-Normalized Genes Across PCA Preprocessing Methods

Description Question: What happens if I do or not not normalize and/or transform the gene expression data (e.g. log and/or scale) prior to dimensionality reduction?

Effect of Varying PC Count on tSNE Space - Visium

Write a a brief description of your figure so we know what you are visualizing.

Animation of Non-Linear Dimensionality Reduction (tSNE) on Varied number of PCs

4. If I perform non-linear dimensionality reduction on PCs, what happens when I vary how many PCs I use? Write a brief description of your figure so we know what...

HW EC1

Instructions Make a new data visualization of your spatial transcriptomics dataset to explore one of the following questions. Selected question: “(4) If I perform non-linear dimensionality reduction on PCs, what...

Xenium Dimensionality Reduction with or without normalization and transformation

This animation depicts how normalization and log-transformation steps are essential in accurate dimensionality reduction that allows biological interpretation rather than being obscured in noise. Without normalization, the PCA space appears...

Using animation to visualize the importance of data normalization and log-transformation for quality control

1. Figure description This data visualization uses animation to visualize the effect of data normalization and log-transformation before performing principal component analysis (PCA) and k-means clustering. The data being analyzed...

Category HW EC2

Cross-Platform Comparison of Spatial Transcriptomic Methods for Proximal Tubule Cell Detection

##Description This figure compares three different spatial transcriptomic analysis strategies to identify proximal tubule (PT) cells in mouse kidney tissue using both Visium and Xenium data. From HW3 Research I...

HW EC2

Instructions Make a new data visualization of the multi-cellular spot resolution spatial transcriptomics sequencing dataset. Compare your result with the clustering and differential expression analysis you did previously in HW3/4....

A deconvolution approach to identify Proximal Convoluted Tubule segments

This visualization assesses a coronal kidney tissue section using deconvolution and clustering across Xenium and Visium Platforms. Using STdeconvolve, Panel A depicts 7 distinct cell types in a scatterpie visualization....

Using clustering and deconvolution to visualize cell types and upregulated genes in different data sets

1. Figure description This multi-panel data visualization uses principal component analysis (PCA), t-distributed stochastic neighbor embedding (tSNE), k-means clustering, deconvolution, and differential expression analysis to compare the analyses of different...

Identification of TAL cells using Deconvolution

Compare your result with the clustering and differential expression analysis you did previously in HW3/4. Explain how your results are similar or different. Create a data visualization comparing all three...

Deconvolution and Multi-Modal Comparison of the Renal S3 Segment

Note, the png is named “EC2_ooni5.png”, as a desired name was not specified in the HW powerpoint.