The Ohio State University · Computer Science

Oliver Proudfoot

PhD student in AI

Research at the boundary of large language models (LLMs) and reinforcement learning (RL), with an emphasis on building systems that reliably reason, decide, and improve with experience.

I work on methods that connect language-based reasoning with decision-making and learning from interaction, aiming to make LLM-driven agents more robust, controllable, and effective on real tasks.

Advisor: Andrew Perrault

CV Publications Projects Blog

Industry Experience

5 years

Research Experience

2.5 years

Fields

CS / Math

Research Topics

RL & LLMs

Publications

GitHub Stars

Research

LLM + RL for agents

Training and evaluating agents that must act, not just answer, by bridging language understanding with sequential decision-making.

policy optimization for language agents • reward design • long-horizon tasks • evaluation

Reliable reasoning and learning signals

Methods for making improvements measurable and verifiable with explicit objectives, robust benchmarks, and careful feedback design.

process supervision • calibration • objective alignment • robust scoring

Systems + experiments

Building reproducible training and evaluation pipelines for large-scale experimentation with practical rigor.

distributed runs • reproducibility • toolchain reliability • experiment ops

Selected Projects

ND Optimizer Arena

Interactive optimization arena and benchmark environment for studying optimizer behavior.

Optimization · Visualization · Experimentation

Decomposed Reasoning

Framework for structuring language-agent reasoning and evaluating reliability across staged decomposition pipelines.

LLMs · RL · Agents · Evaluation

Agent Evaluation Suite

Standalone benchmarking and average@k style analyses for comparing model reasoning behavior under controlled settings.

Metrics · Agents · Reliability

Details