MedCompress: compression-as-clinical-context
Building MedCompress, a tool for collapsing long medical documents into structured, faithful summaries usable by downstream NLP pipelines and clinical decision-support workflows.
Applied ML · Computer Vision · NLP · Health
Systems Administrator, Denison University · Applied ML Researcher
I work on small, careful machine learning systems — image captioning, clinical text compression, climate pattern classification — and on the surrounding craft of making them reproducible, evaluable, and useful. My projects sit at the seam between modern deep learning and the operational reality of getting models into the hands of people who need them.
Recent
Building MedCompress, a tool for collapsing long medical documents into structured, faithful summaries usable by downstream NLP pipelines and clinical decision-support workflows.
Unsupervised regional clustering of atmospheric data followed by supervised classification of climate-impacted regions, evaluated with F1 and confusion matrices on global maps.
BSc in Computer Science, GPA 3.87, Phi Beta Kappa. Presented Image Caption Generator (CNN + LSTM, CUDA-tuned) at the 47th Annual Beloit Student Symposium.
Comparative study of recurrent and convolutional models on the UCI Heart Failure Clinical Records dataset, with K-Means and SOM clustering used to surface patient groupings.
Focus
Where my IT work and ML work talk to each other.
Featured
End-to-end image captioning pipeline using VGG16 as a CNN encoder paired with an LSTM decoder, trained on Flickr 30k. Optimized CUDA kernel configuration for parallel batch processing, evaluated with BLEU, presented at the 47th Annual Beloit Student Symposium.
A tool for collapsing long medical documents into compact, structured summaries that preserve the clinical signal needed by downstream pipelines and decision support. Built to slot into existing EHR-adjacent workflows rather than replace them.
Trained Self-Organizing Maps on surface temperature, precipitation, and pressure readings to cluster geographic regions, then layered ANN and CNN classifiers to identify areas with the strongest climate change signal. Reported with confusion matrices and regional visualizations on global maps.
Binary classification on the UCI Heart Failure Clinical Records dataset, predicting mortality risk from clinical features. K-Means and SOM clustering surfaced natural patient groupings before supervised training. RNN and CNN performance compared with confusion matrices, precision, recall, and AUC.
Always happy to talk about applied ML, computer vision, sequence models, or ML for health and climate.
Say hello