Hi, I'm Glenn Dalbey

AI Engineer & Data Scientist specializing in Data Analytics, multi-modal systems, and production ML deployments

Kaggle Bronze Medalist

93.8% Healthcare AI Accuracy

Live Production Systems

WGU Data Science Graduate

Kaggle Bronze Medal

NFL Big Data Bowl 2026 -- Top 8% of 1,134 teams

Live Production AI

Apollo Healthcare Connect -- Multi-modal AI triage system serving real patients with 93.8% accuracy and sub-second response times.

DistilBERTComputer VisionFlaskAWS

5-model ensemble (NLP + CV)
98% burn classification accuracy
Handles 29.7:1 class imbalance

About Glenn

Passionate AI engineer with a proven track record of building and deploying production machine learning systems that make a real-world impact.

My Journey

My journey into AI and data science began with a fascination for solving complex problems through technology. What started as curiosity about machine learning algorithms evolved into a passion for building systems that can genuinely improve people's lives.

The turning point came when I developed Apollo Healthcare Connect - a multi-modal AI system that combines natural language processing with computer vision to provide medical triage with 93.8% accuracy. Seeing this system develop into a tool that could help real patients find appropriate care reinforced my commitment to practical, impactful AI applications.

Today, I specialize in bridging the gap between cutting-edge AI research and real-world deployment, with particular expertise in data analytics, multi-modal systems, and production ML pipelines.

Education

Master of Science, Data Science and Data Analytics

Western Governors University

2025

Advanced graduate program focused on machine learning, statistical analysis, and practical AI applications. Completed advanced capstone projects in healthcare AI and sports analytics with production deployment.

Bachelor of Science, Data Analytics

Western Governors University

Associate of Science, IT Programming

Clinton Community College

Key Achievements

Kaggle Bronze Medal

NFL Big Data Bowl 2026 -- Top 8% of 1,134 teams

93.8% AI Accuracy

Live healthcare triage system serving real patients

Multi-modal AI

Combined text and image analysis expertise

25.4% Cost Reduction

Supply chain optimization through linear programming

By the Numbers

Years Experience

Projects Completed

Technologies Mastered

Production Systems

Connect & Explore

Featured Projects

Explore my portfolio of AI and data science projects, from healthcare applications to sports analytics and optimization solutions.

NFL_Big_Data_Bowl_2026

Kaggle Bronze Medal (Top 8% of 1,134 teams) - Deep learning solution for predicting NFL player trajectories from tracking data. Explored 15+ architectures across 847+ experiments with systematic hyperparameter optimization.

Python

PyTorch

Spatial-Temporal Transformers

GRU/RNN

Multi-scale CNN

Geometric Attention

+5 more

missing-persons-outlier-detection

Multi-method statistical and ML pipeline analyzing 41,200 NamUs cases across 55 jurisdictions and 101 years. Applies 7 outlier detection methods, Isolation Forest and LOF ensemble, spatial autocorrelation (Moran's I), and ARIMA forecasting. Identified the I-35 trafficking corridor (+170% acceleration) and validated against known serial killers. Live Streamlit dashboard.

Python

Streamlit

Plotly

Statistical Analysis

Geospatial Analysis

Time Series Forecasting

+2 more

Apollo_Healthcare_Connect

Production-deployed multi-modal AI healthcare triage system achieving 93.8% combined accuracy. Analyzes text symptoms (DistilBERT, 94%) and medical images (5-model CNN ensemble, 98% burn classification) across 8,085 images with 29.7:1 class imbalance handling. WGU MS Data Science Capstone. Live at apollohealthcareconnect.com.

HTML

Python

PyTorch

TensorFlow

DistilBERT

Flask

+3 more

RSNA_Intracranial_Aneurysm_Detection

3D deep learning solution for detecting intracranial aneurysms from CT angiography. Trained 105 models (21 architectures x 5 folds), tested 51 ensemble configurations achieving best AUC 0.8624. Key finding: smaller models significantly outperform larger ones on limited medical data (r=-0.42, p<0.01).

Python

PyTorch

3D CNNs

SE-ResNet

DenseNet

EfficientNet

+4 more

OceanEterna
In Progress

Multi-month engineering effort building a high-performance local RAG system. Originally prototyped in Python but rewritten from scratch in C++17 for performance — evolved through 4 major versions with systematic optimization. Search engine core indexes 2.45 billion tokens across 5M+ chunks with 0-42ms search latency (avg 12ms, down from 500ms in v1) and 12-second cold startup (down from 41s). Runs entirely on CPU with minimal RAM. Dual LZ4/Zstd compression with auto-format detection. Conversations and queries continuously indexed with intelligent tagging; supports ingesting any file type to grow the corpus. 47 tests at near 100% accuracy, 15 REST API endpoints, zero per-query costs — no GPU, Docker, database, or cloud required. Next: LLM chat interface and MCP tool integration for terminal-based AI workflows and project knowledge management.

C++

C++17

Information Retrieval

RAG

Search Engine

LZ4/Zstd Compression

Memory-Mapped I/O

+3 more

OE-OS
In Progress

Distributed AI orchestration platform for a private multi-node GPU cluster. Features three-tier LLM routing (local Ollama to cheap API to Claude Opus) reducing costs by routing ~80% of requests to free local models, triple-layer RAG memory (BM25 over 5M+ chunks, ChromaDB semantic search, Redis session cache), 18 MCP-compatible tools, and a multi-agent sandbox where 4 LLM personas deliberate at zero API cost. 4,200+ lines of async Python on FastAPI.

Python

FastAPI

ChromaDB

Redis

Ollama

Pydantic

+3 more

opportunity-intelligence

AI-powered market analysis assistant for senior living opportunity evaluation. Agentic 3-call LLM pipeline that pulls public data from Census Bureau and CMS Care Compare, runs it through a reusable analytics library of 22 statistical methods, then uses an LLM to direct the analysis and synthesize findings into an executive briefing with full citations. 14 analyses executed across two LLM-directed passes, 622-word briefing with source tags on every claim, total cost 6 cents.

Python

OpenAI API

Pandas

scipy

scikit-learn

matplotlib

+4 more

Blue-Zones-Longevity-Analysis

Longitudinal statistical analysis of life expectancy trends across Blue Zone countries vs. 88-nation global baseline, 1960-2023. Formal hypothesis testing with bootstrap confidence intervals, GDP-controlled partial correlations, sigma and beta convergence testing, and COVID impact analysis. Live Streamlit dashboard.

Jupyter Notebook

Statistical Hypothesis Testing

Bootstrap

Partial Correlations

Convergence Analysis

Streamlit

Kaggle_House_Prices_Advanced_Regression

Comprehensive machine learning solution for Kaggle House Prices competition featuring advanced feature engineering, ensemble methods, and modern AutoML techniques

Jupyter Notebook

Python

Jupyter Notebook

XGBoost

LightGBM

CatBoost

TabPFN

+6 more

business-analytics-AI-platform

AI-powered Excel analytics platform for Thompson Parking & Mobility Consultants. Upload Excel files, get instant business insights, generate professional charts, and chat with your data using natural language queries.

TypeScript

Next.js

Excel Processing

Business Intelligence

Natural Language Processing

TandemAI

Local-First LLM Ensemble Orchestration Platform. Let your AI models work together: relay, debate, verify. Support for Ollama, LM Studio, OpenAI, Anthropic, Groq and more.

TypeScript

React

Node.js

AI Orchestration

LLM Integration

Privacy-First

Ml-Model-Recommender

Automated ML model selection for tabular data. Analyzes datasets, compares preprocessing strategies, trains baselines, and provides actionable recommendations.

Jupyter Notebook

Python

Machine Learning

AutoML

Data Analysis

scikit-learn

+1 more

NFL_Rookie_WR_1K_Analysis

Capstone project for Bachelor's in Data Analytics - using previous years rookie statistics to determine future WR production. Achieved 90.9% ROC AUC.

Jupyter Notebook

Python

scikit-learn

XGBoost

Sports Analytics

Machine Learning

+1 more

Computer-Vision-Portfolio

CNN-based landmark classification using PyTorch with custom 4-block convolutional architecture, batch normalization, dropout regularization, and transfer learning comparison studies.

HTML

Computer Vision

CNNs

Deep Learning

PyTorch

YouTube_Video_Analyzer

GPU-accelerated multimodal AI for YouTube video analysis with transcription, visual frame analysis, and comprehensive summaries

Python

FastAPI

Streamlit

PyTorch

Whisper

BLIP

+5 more

Projects automatically synced fromGitHub

Technical Skills

A comprehensive overview of my technical expertise across AI/ML, data science, and software development technologies.

Expert

Advanced

Intermediate

Programming Languages
9

JavaScript

Python

TypeScript

HTML/CSS

SQL

C++

Java

AI/ML Frameworks
11

CatBoost

LightGBM

PyTorch

scikit-learn

TensorFlow

Transformers

XGBoost

GRU/RNN Architectures

OpenAI API

AutoGluon

TabPFN

Web Frameworks
5

Flask

React

FastAPI

Node.js

Streamlit

Cloud & Deployment
4

AWS

Weights & Biases

CI/CD

Docker

Data & Analytics
7

Data Visualization

pandas

Tableau

MongoDB

numpy

PostgreSQL

Statistical Analysis

AI Specializations
12

3D Medical Imaging

Computer Vision

Ensemble Learning

Feature Engineering

Healthcare AI

Multi-modal AI

Natural Language Processing

Spatial-Temporal Transformers

Sports Analytics

Trajectory Prediction

Geometric Deep Learning

Statistical Anomaly Detection

Total Skills

Featured

Resume & CV

Complete professional resume showcasing my experience in AI, data science, and machine learning.

One Page

Detailed

Glenn Dalbey

Data Science & Analytics Professional

319-233-4445

dalbeyglenn@gmail.com

LinkedIn Profile GitHub Portfolio

Professional Summary

Data Scientist with an MS in Data Science who builds things that actually work. I enjoy building and training models on my own multi-GPU homelab, deploying new techniques, and discovering unique ways to solve problems. Whether it's analysis, research, or projections, I dig into how something ticks, figure out why, and extract information that drives better outcomes. My multi-node homelab stack lets me run production applications, train models, and run MLflow, all on my own hardware. 20+ projects on GitHub covering deep learning, medical imaging, NLP, computer vision, and analytics.

Core Technical Skills

Programming & Deep Learning

Python

PyTorch

TensorFlow

SQL

TypeScript

Neural Network Architectures

Spatial-Temporal Transformers

GRU/RNN

3D CNNs

Geometric Attention

ML & Infrastructure

scikit-learn

XGBoost

Multi-GPU Training

Mixed Precision

AWS

Specializations

Trajectory Prediction

3D Medical Imaging

Multi-modal AI

Ensemble Methods

Distributed AI Systems

Selected Projects

NFL Big Data Bowl 2026 - Kaggle Bronze Medal

GitHub

Player Trajectory Prediction | Bronze Medal - Top 8% of 1,134 teams

• Bronze Medal in prestigious Kaggle competition predicting NFL player trajectories
• 847+ experiments across 15+ architectures (ST Transformers, GRU, CNN, Perceiver IO)
• Engineered 167 features with Voronoi tessellation and geometric attention

PyTorch

Transformers

GRU

Multi-GPU

TTA

RSNA Intracranial Aneurysm Detection

3D Medical Imaging | 105 Models Trained

• Trained 105 deep learning models (21 architectures × 5 folds) for CT angiography
• Best ensemble AUC 0.8624; discovered smaller models outperform larger on limited data
• Complete pipeline: DICOM→NIfTI→ROI→Training→Ensemble on 4 GPUs

PyTorch

3D CNNs

SE-ResNet

nibabel

Multi-GPU

Apollo Healthcare Connect

Live System

Production Multi-modal AI Healthcare Triage | MS Capstone

• Live production system achieving 93.8% accuracy with sub-second response
• 5-model ensemble combining DistilBERT and CNNs; handled 29.7:1 class imbalance

PyTorch

TensorFlow

Flask

DistilBERT

AWS S3

Education

Master of Science in Data Science

August 2025

Western Governors University

Capstone: Multi-modal AI healthcare triage system (production deployed)

Bachelor of Science in Data Analytics

Western Governors University

Capstone: NFL Rookie Wide Receiver Performance Prediction Model

Key Accomplishments

Kaggle Bronze Medal - NFL Big Data Bowl 2026 (Top 8% of 1,134 teams)
847+ deep learning experiments across 15+ neural network architectures
Trained 105 3D medical imaging models achieving AUC 0.8624 ensemble

Production healthcare AI achieving 93.8% accuracy with sub-second response
Analyzed 41,200 cases identifying trafficking patterns at up to 46.86σ significance
Published 20+ open-source projects on GitHub

Get In Touch

Ready to collaborate on your next AI project? Let's discuss how my expertise in machine learning and data science can help bring your ideas to life.

Let's Connect

I'm always interested in discussing new opportunities, whether you're looking for an AI consultant, data scientist, or full-time team member. Feel free to reach out for:

AI & Machine Learning Consulting
Healthcare AI Solutions
Data Science Projects
Full-time Opportunities
Technical Collaboration

Connect on Social

GitHub LinkedIn

Quick Response

I typically respond to all inquiries within 24 hours.

Hi, I'm Glenn Dalbey

Live Production AI

About Glenn

My Journey

Education

Key Achievements

By the Numbers

Connect & Explore

Featured Projects

NFL_Big_Data_Bowl_2026

missing-persons-outlier-detection

Apollo_Healthcare_Connect

RSNA_Intracranial_Aneurysm_Detection

OceanEternaIn Progress

OE-OSIn Progress

opportunity-intelligence

Blue-Zones-Longevity-Analysis

Kaggle_House_Prices_Advanced_Regression

business-analytics-AI-platform

TandemAI

Ml-Model-Recommender

NFL_Rookie_WR_1K_Analysis

Computer-Vision-Portfolio

YouTube_Video_Analyzer

Technical Skills

Programming Languages9

AI/ML Frameworks11

Web Frameworks5

Cloud & Deployment4

Data & Analytics7

AI Specializations12

Resume & CV

Glenn Dalbey

Professional Summary

Core Technical Skills

Programming & Deep Learning

Neural Network Architectures

ML & Infrastructure

Specializations

Selected Projects

NFL Big Data Bowl 2026 - Kaggle Bronze Medal

RSNA Intracranial Aneurysm Detection

Apollo Healthcare Connect

Education

Master of Science in Data Science

Bachelor of Science in Data Analytics

Key Accomplishments

Get In Touch

Let's Connect

Connect on Social

Send a Message

OceanEterna
In Progress

OE-OS
In Progress

Programming Languages
9

AI/ML Frameworks
11

Web Frameworks
5

Cloud & Deployment
4

Data & Analytics
7

AI Specializations
12