
Education
Interests
I am a computational linguist at Amazon. I develop LLM evaluation infrastructure (both with and for LLMs), manage end-to-end prompt optimization pipelines, design model post-training recipes, and build tools for human-in-the-loop workflows.
Before transitioning to industry, I received my PhD in Linguistics from the University of Pennsylvania, using experimental, corpus, and computational approaches to study linguistic meaning, of various flavors. My dissertation investigated the learning problem of acquiring hierarchical noun meanings in early childhood, from the angles of semantics/pragmatics, distributional learning, and conceptual development.
I am also active in the R programming community as a mentor and open-source developer. I maintain and collaborate on several open-source software in statistical computing (jlmerclusterperm), data visualization (ggtrace), data quality assurance (pointblank), and interfaces to data APIs (openalexR). My work on graphics received an award from the American Statistical Association in 2023. I enjoy writing in my free time and have been maintaining a technical blog over the past 5+ years, covering topics including R language internals, software design principles, and practical tutorials for everyday data analysis.