Streamlining Data Science

Exploring the Frontiers of Data Analysis

Data Science Portfolio

As a data scientist, I believe in the power of open source, open science, and open data. My recent focus has been on productionizing machine learning and deep learning tools for versatile use across various datasets.

Current Projects

A. Data Science for Bioinformatics (OmixHub)

Vision

  • Build well-documented, user-friendly modules for omics datasets
  • Provide tutorials for early-stage data scientists in bioinformatics
  • Standardize ML/DL and bioinformatics tools

Implemented Projects

Data source: Genomic Data Commons

Codebase: OmixHub

Code RTD: readthedocs

Website omixhub.com

B. Data Science for E-Commerce

Vision

  • Develop generalizable tools for e-commerce analysis
  • Demonstrate advanced Python programming (OOP) with real-world data

Projects

Data source: Kaggle

Codebase: E-Commerce Projects

C. Data Science for Medical Diagnosis

Vision

  • Assemble deep learning algorithms for medical imaging analysis
  • Create unified modules for 2D and 3D image processing

Projects

Data source: Contact via email

Codebase: AI for Medicine

D. Grad School Projects