Data, AI, and Infrastructure for Healthcare and Life Sciences

Building production-ready ML systems for genomics, clinical decision-making, and discovery.

Intelligent Data Systems

Adaptive data systems that unify clinical, genomic, and operational data while supporting traceability, versioning, and auditability. Built to power AI-driven reasoning and decision workflows—not just analytics.

Agentic AI & Decision Intelligence

AI systems that go beyond prediction. We design agentic workflows that combine models, tools, and domain logic to reason, plan, and adapt over time. Inspired by expert systems, built for high-stakes biomedical and clinical deployment.

Production AI Platforms

Cloud-native AI platforms designed for reliability, governance, and continuous improvement. We enable secure orchestration of AI services, outcome-driven learning, and clear separation between experimentation and production. GCP-first.

Selected Work

Protein Language Models for Biological Representation Learning

Built scalable ML pipelines leveraging transformer-based protein language models for structure-aware representation learning and exploratory biological analysis. Implemented distributed training, model evaluation, and deployment workflows on GCP using modern MLOps practices.

PyTorch Transformers

Transformer-Based Cancer Survival Prediction

Adapted Geneformer (transformer pre-trained on single-cell RNA-seq) for bulk tumor survival prediction using TCGA data. Demonstrated strong correlation with clinical outcomes and improved patient stratification across tumor stages.

PyTorch Transformers Geneformer
View Publication →

Transformer-Based Tissue-of-Origin Classifier

Pre-trained transformer model for multi-class cancer tissue-of-origin prediction across diverse tumor types, including metastatic disease. Supports diagnostic precision and downstream treatment decisions.

PyTorch Transformers Scikit-Learn
View Publication →

Biomedical Knowledge Graph & RAG Pipeline

Graph-based learning and retrieval-augmented generation for knowledge extraction, search, and summarization across structured and unstructured biomedical data.

GCNs GraphRAG BigQuery

Clinical Decision Support Platform

Production ML platform deployed on Vertex AI for personalized rheumatoid arthritis therapy. Integrated EMR, claims, genomic, and clinical trial data in a Medicare-covered AI product.

Vertex AI CI/CD Feature Store

TB Diagnosis and Disease Severity Monitoring

Applied ML research in GC-MS metabolomics for tuberculosis diagnosis and disease severity monitoring through multi-institution clinical collaboration.

Statistical ML

Engagements