Code
Statistical Methods in Computational Biology | Spring 2024
scRNA-seq Data Normalization Methods Comparison[real data][simulated data] and Simulation and Statistical Derivation of Negative Binomial scRNA-seq Model
Dimension Reduction Methods: PCA, GLM-PCA, MDS, NMF, ICA, PNMF, t-SNE, and UMAP
Clustering Algorithms: k-means, k-means++, hierarchical clustering, Seurat clustering, and Evaluation: ARI, Silhouette Score
Generalized Linear Models [course link] | Spring 2024
Comparison of Four GLM Models for Microbiome Sequencing Data
- In this project, I compared the performance of four GLM models (Quasi-Poisson, Negative Binomial, Zero-Inflated Poisson, and Zero-Inflated Negative Binomial) for differential abundance analysis of microbiome sequencing data at genus and species levels, assessing type I error rates with simulated data.
Linear Models, GLM, Survival Analysis, and Longitudinal Data Analysis
Data Science [course link] | Winter 2024
- Linux Shell Commands, and HPC
- Ingesting Big Data Files, and Inquiry and Analysis on Database
- Big Data Visualization, and developing Shiny App for interactive graphics
- Statistical Learning in R
Statistical Simulation | Fall 2024
Linear Models | Winter 2024
- [ Poster ] Assessing Key Factors Associated with Depression before Adjuvant Therapy in Women with Breast Cancer
Data Management | Fall 2023
- [ Report ] Modeling Depression Risk among Noninstitutionalized US Citizens: A Retrospective Study by SAS