Hi, I'm Glenn Dalbey

AI Engineer & Data Scientist specializing in Data Analytics, multi-modal systems, and production ML deployments

Kaggle Bronze Medalist
93.8% Healthcare AI Accuracy
Live Production Systems
WGU Data Science Graduate
Kaggle Bronze Medal - NFL Big Data Bowl 2026
Kaggle Bronze Medal

NFL Big Data Bowl 2026 -- Top 8% of 1,134 teams

Live Production AI

Apollo Healthcare Connect -- Multi-modal AI triage system serving real patients with 93.8% accuracy and sub-second response times.

DistilBERTComputer VisionFlaskAWS
  • 5-model ensemble (NLP + CV)
  • 98% burn classification accuracy
  • Handles 29.7:1 class imbalance

About Glenn

Passionate AI engineer with a proven track record of building and deploying production machine learning systems that make a real-world impact.

My Journey

My journey into AI and data science began with a fascination for solving complex problems through technology. What started as curiosity about machine learning algorithms evolved into a passion for building systems that can genuinely improve people's lives.

The turning point came when I developed Apollo Healthcare Connect - a multi-modal AI system that combines natural language processing with computer vision to provide medical triage with 93.8% accuracy. Seeing this system develop into a tool that could help real patients find appropriate care reinforced my commitment to practical, impactful AI applications.

Today, I specialize in bridging the gap between cutting-edge AI research and real-world deployment, with particular expertise in data analytics, multi-modal systems, and production ML pipelines.

Education

Master of Science, Data Science and Data Analytics
Western Governors University
2025

Advanced graduate program focused on machine learning, statistical analysis, and practical AI applications. Completed advanced capstone projects in healthcare AI and sports analytics with production deployment.

Bachelor of Science, Data Analytics
Western Governors University
Associate of Science, IT Programming
Clinton Community College

Key Achievements

Kaggle Bronze Medal
NFL Big Data Bowl 2026 -- Top 8% of 1,134 teams
93.8% AI Accuracy
Live healthcare triage system serving real patients
Multi-modal AI
Combined text and image analysis expertise
25.4% Cost Reduction
Supply chain optimization through linear programming

By the Numbers

0+
Years Experience
0+
Projects Completed
0
Technologies Mastered
0
Production Systems

Featured Projects

Explore my portfolio of AI and data science projects, from healthcare applications to sports analytics and optimization solutions.

NFL_Big_Data_Bowl_2026

Kaggle Bronze Medal (Top 8% of 1,134 teams) - Deep learning solution for predicting NFL player trajectories from tracking data. Explored 15+ architectures across 847+ experiments with systematic hyperparameter optimization.

Python
Python
PyTorch
Spatial-Temporal Transformers
GRU/RNN
Multi-scale CNN
Geometric Attention
+5 more

missing-persons-outlier-detection

Multi-method statistical and ML pipeline analyzing 41,200 NamUs cases across 55 jurisdictions and 101 years. Applies 7 outlier detection methods, Isolation Forest and LOF ensemble, spatial autocorrelation (Moran's I), and ARIMA forecasting. Identified the I-35 trafficking corridor (+170% acceleration) and validated against known serial killers. Live Streamlit dashboard.

Python
Python
Streamlit
Plotly
Statistical Analysis
Geospatial Analysis
Time Series Forecasting
+2 more

Apollo_Healthcare_Connect

Production-deployed multi-modal AI healthcare triage system achieving 93.8% combined accuracy. Analyzes text symptoms (DistilBERT, 94%) and medical images (5-model CNN ensemble, 98% burn classification) across 8,085 images with 29.7:1 class imbalance handling. WGU MS Data Science Capstone. Live at apollohealthcareconnect.com.

HTML
HTML
Python
PyTorch
TensorFlow
DistilBERT
Flask
+3 more

RSNA_Intracranial_Aneurysm_Detection

3D deep learning solution for detecting intracranial aneurysms from CT angiography. Trained 105 models (21 architectures x 5 folds), tested 51 ensemble configurations achieving best AUC 0.8624. Key finding: smaller models significantly outperform larger ones on limited medical data (r=-0.42, p<0.01).

Python
Python
PyTorch
3D CNNs
SE-ResNet
DenseNet
EfficientNet
+4 more

OceanEterna
In Progress

Multi-month engineering effort building a high-performance local RAG system. Originally prototyped in Python but rewritten from scratch in C++17 for performance — evolved through 4 major versions with systematic optimization. Search engine core indexes 2.45 billion tokens across 5M+ chunks with 0-42ms search latency (avg 12ms, down from 500ms in v1) and 12-second cold startup (down from 41s). Runs entirely on CPU with minimal RAM. Dual LZ4/Zstd compression with auto-format detection. Conversations and queries continuously indexed with intelligent tagging; supports ingesting any file type to grow the corpus. 47 tests at near 100% accuracy, 15 REST API endpoints, zero per-query costs — no GPU, Docker, database, or cloud required. Next: LLM chat interface and MCP tool integration for terminal-based AI workflows and project knowledge management.

C++
C++17
Information Retrieval
RAG
Search Engine
LZ4/Zstd Compression
Memory-Mapped I/O
+3 more

OE-OS
In Progress

Distributed AI orchestration platform for a private multi-node GPU cluster. Features three-tier LLM routing (local Ollama to cheap API to Claude Opus) reducing costs by routing ~80% of requests to free local models, triple-layer RAG memory (BM25 over 5M+ chunks, ChromaDB semantic search, Redis session cache), 18 MCP-compatible tools, and a multi-agent sandbox where 4 LLM personas deliberate at zero API cost. 4,200+ lines of async Python on FastAPI.

Python
Python
FastAPI
ChromaDB
Redis
Ollama
Pydantic
+3 more

opportunity-intelligence

AI-powered market analysis assistant for senior living opportunity evaluation. Agentic 3-call LLM pipeline that pulls public data from Census Bureau and CMS Care Compare, runs it through a reusable analytics library of 22 statistical methods, then uses an LLM to direct the analysis and synthesize findings into an executive briefing with full citations. 14 analyses executed across two LLM-directed passes, 622-word briefing with source tags on every claim, total cost 6 cents.

Python
Python
OpenAI API
Pandas
scipy
scikit-learn
matplotlib
+4 more

Blue-Zones-Longevity-Analysis

Longitudinal statistical analysis of life expectancy trends across Blue Zone countries vs. 88-nation global baseline, 1960-2023. Formal hypothesis testing with bootstrap confidence intervals, GDP-controlled partial correlations, sigma and beta convergence testing, and COVID impact analysis. Live Streamlit dashboard.

Jupyter Notebook
Jupyter Notebook
Statistical Hypothesis Testing
Bootstrap
Partial Correlations
Convergence Analysis
Streamlit

Kaggle_House_Prices_Advanced_Regression

Comprehensive machine learning solution for Kaggle House Prices competition featuring advanced feature engineering, ensemble methods, and modern AutoML techniques

Jupyter Notebook
Python
Jupyter Notebook
XGBoost
LightGBM
CatBoost
TabPFN
+6 more

business-analytics-AI-platform

AI-powered Excel analytics platform for Thompson Parking & Mobility Consultants. Upload Excel files, get instant business insights, generate professional charts, and chat with your data using natural language queries.

TypeScript
TypeScript
Next.js
AI
Excel Processing
Business Intelligence
Natural Language Processing

TandemAI

Local-First LLM Ensemble Orchestration Platform. Let your AI models work together: relay, debate, verify. Support for Ollama, LM Studio, OpenAI, Anthropic, Groq and more.

TypeScript
TypeScript
React
Node.js
AI Orchestration
LLM Integration
Privacy-First

Ml-Model-Recommender

Automated ML model selection for tabular data. Analyzes datasets, compares preprocessing strategies, trains baselines, and provides actionable recommendations.

Jupyter Notebook
Jupyter Notebook
Python
Machine Learning
AutoML
Data Analysis
scikit-learn
+1 more

NFL_Rookie_WR_1K_Analysis

Capstone project for Bachelor's in Data Analytics - using previous years rookie statistics to determine future WR production. Achieved 90.9% ROC AUC.

Jupyter Notebook
Jupyter Notebook
Python
scikit-learn
XGBoost
Sports Analytics
Machine Learning
+1 more

Computer-Vision-Portfolio

CNN-based landmark classification using PyTorch with custom 4-block convolutional architecture, batch normalization, dropout regularization, and transfer learning comparison studies.

HTML
HTML
Computer Vision
CNNs
Deep Learning
PyTorch

YouTube_Video_Analyzer

GPU-accelerated multimodal AI for YouTube video analysis with transcription, visual frame analysis, and comprehensive summaries

Python
Python
FastAPI
Streamlit
PyTorch
Whisper
BLIP
+5 more
Projects automatically synced fromGitHub

Technical Skills

A comprehensive overview of my technical expertise across AI/ML, data science, and software development technologies.

Expert
Advanced
Intermediate

Programming Languages
9

JavaScript
Python
TypeScript
R
HTML/CSS
SQL
C#
C++
Java

AI/ML Frameworks
11

CatBoost
LightGBM
PyTorch
scikit-learn
TensorFlow
Transformers
XGBoost
GRU/RNN Architectures
OpenAI API
AutoGluon
TabPFN

Web Frameworks
5

Flask
React
FastAPI
Node.js
Streamlit

Cloud & Deployment
4

AWS
Weights & Biases
CI/CD
Docker

Data & Analytics
7

Data Visualization
pandas
Tableau
MongoDB
numpy
PostgreSQL
Statistical Analysis

AI Specializations
12

3D Medical Imaging
Computer Vision
Ensemble Learning
Feature Engineering
Healthcare AI
Multi-modal AI
Natural Language Processing
Spatial-Temporal Transformers
Sports Analytics
Trajectory Prediction
Geometric Deep Learning
Statistical Anomaly Detection
48
Total Skills
27
Featured
6
Categories

Resume & CV

Complete professional resume showcasing my experience in AI, data science, and machine learning.

One Page
Detailed

Glenn Dalbey

Data Science & Analytics Professional

319-233-4445
dalbeyglenn@gmail.com
LinkedIn ProfileGitHub Portfolio

Professional Summary

Data Scientist with an MS in Data Science who builds things that actually work. I enjoy building and training models on my own multi-GPU homelab, deploying new techniques, and discovering unique ways to solve problems. Whether it's analysis, research, or projections, I dig into how something ticks, figure out why, and extract information that drives better outcomes. My multi-node homelab stack lets me run production applications, train models, and run MLflow, all on my own hardware. 20+ projects on GitHub covering deep learning, medical imaging, NLP, computer vision, and analytics.

Core Technical Skills

Programming & Deep Learning

Python
PyTorch
TensorFlow
SQL
TypeScript

Neural Network Architectures

Spatial-Temporal Transformers
GRU/RNN
3D CNNs
Geometric Attention

ML & Infrastructure

scikit-learn
XGBoost
Multi-GPU Training
Mixed Precision
AWS

Specializations

Trajectory Prediction
3D Medical Imaging
Multi-modal AI
Ensemble Methods
Distributed AI Systems

Selected Projects

NFL Big Data Bowl 2026 - Kaggle Bronze Medal

GitHub

Player Trajectory Prediction | Bronze Medal - Top 8% of 1,134 teams

  • Bronze Medal in prestigious Kaggle competition predicting NFL player trajectories
  • • 847+ experiments across 15+ architectures (ST Transformers, GRU, CNN, Perceiver IO)
  • • Engineered 167 features with Voronoi tessellation and geometric attention
PyTorch
Transformers
GRU
Multi-GPU
TTA

RSNA Intracranial Aneurysm Detection

3D Medical Imaging | 105 Models Trained

  • • Trained 105 deep learning models (21 architectures × 5 folds) for CT angiography
  • • Best ensemble AUC 0.8624; discovered smaller models outperform larger on limited data
  • • Complete pipeline: DICOM→NIfTI→ROI→Training→Ensemble on 4 GPUs
PyTorch
3D CNNs
SE-ResNet
nibabel
Multi-GPU

Apollo Healthcare Connect

Live System

Production Multi-modal AI Healthcare Triage | MS Capstone

  • • Live production system achieving 93.8% accuracy with sub-second response
  • • 5-model ensemble combining DistilBERT and CNNs; handled 29.7:1 class imbalance
PyTorch
TensorFlow
Flask
DistilBERT
AWS S3

Education

Master of Science in Data Science

August 2025

Western Governors University

Capstone: Multi-modal AI healthcare triage system (production deployed)

Bachelor of Science in Data Analytics

Western Governors University

Capstone: NFL Rookie Wide Receiver Performance Prediction Model

Key Accomplishments

  • Kaggle Bronze Medal - NFL Big Data Bowl 2026 (Top 8% of 1,134 teams)
  • 847+ deep learning experiments across 15+ neural network architectures
  • Trained 105 3D medical imaging models achieving AUC 0.8624 ensemble
  • Production healthcare AI achieving 93.8% accuracy with sub-second response
  • Analyzed 41,200 cases identifying trafficking patterns at up to 46.86σ significance
  • Published 20+ open-source projects on GitHub

Get In Touch

Ready to collaborate on your next AI project? Let's discuss how my expertise in machine learning and data science can help bring your ideas to life.

Let's Connect

I'm always interested in discussing new opportunities, whether you're looking for an AI consultant, data scientist, or full-time team member. Feel free to reach out for:

  • AI & Machine Learning Consulting
  • Healthcare AI Solutions
  • Data Science Projects
  • Full-time Opportunities
  • Technical Collaboration

Connect on Social

Quick Response

I typically respond to all inquiries within 24 hours.

Send a Message

Your message will be securely stored and I'll respond within 24 hours.