Glenn Matlin
  • Home
  • About
  • Research
  • Publications
  • Blog
  • CV

Publications

Papers, preprints, and project pages.

This is the full publication archive. For the broader research agenda and representative projects, visit the Research page.

Google Scholar Semantic Scholar

Trust by Design: Skill Profiles for Transparent, Cost-Aware LLM Routing

AI
LLM Systems
Interpretability

BELLA (Budget-Efficient LLM Selection via Automated skill-profiling), a framework that recommends optimal LLM selection for tasks through interpretable skill-based model selection.

Feb 2, 2026
Mika Okamoto, Ansel Kaplan Erol, Glenn Matlin

FinForge: Semi-Synthetic Financial Benchmark Generation

NLP
Finance
Benchmarks

A framework for semi-synthetic financial benchmark generation to enable rigorous and scalable evaluation of language models in finance.

Jan 11, 2026
Glenn Matlin, Akhil Theerthala, Avinash Gupta, JM Anirudh, Ricardo Castilla, Yee Man Ng, Sudheer Chava

Financial Instruction Following Evaluation (FIFE)

NLP
Finance
Evaluation

A novel, high-difficulty benchmark designed to assess LM instruction-following capabilities for financial analysis tasks.

Dec 1, 2025
Glenn Matlin, Siddharth , Anirudh JM, Aditya Shukla, Yahya Hassan, Sudheer Chava

Shall We Play a Game? Language Models for Open-ended Wargames

AI
Security
Wargaming

An exploration of language models for open-ended wargames, investigating AI systems’ ability to influence large-scale decisions with implications for safety and interpretability.

Sep 21, 2025
Glenn Matlin, Parv Mahajan, Isaac Song, Yixiong Hao, Ryan Bard, Stu Topp, Evan Montoya, M. Rehan Parwani, Soham Shetty, Mark Riedl

Do Language Models Agree with Human Perceptions of Suspense in Stories?

NLP
Computational Linguistics
Narrative

An investigation into whether language models can distinguish and estimate suspense in text sequences compared to human judgments.

Aug 13, 2025
Glenn Matlin, Devin Zhang, RB Loza, DM Popescu, J Isbell, C Chakraborty, Mark Riedl

Finance Language Model Evaluation (FLaME)

NLP
Finance
ACL

A comprehensive evaluation framework for studying language models against reasoning-reinforced LMs across 20+ core NLP tasks in finance.

Jun 18, 2025
Glenn Matlin, Mika Okamoto, Huzaifa Pardawala, Yang Yang, Sudheer Chava

UnfoldML: Cost-Aware and Uncertainty-Based Dynamic 2D Prediction for Multi-Stage Classification

Machine Learning
Systems
NeurIPS

A cost-aware and uncertainty-based framework for dynamic 2D prediction in multi-stage classification systems.

Oct 19, 2022
Yanbo Xu, Alind Khare, Glenn Matlin, Monish Ramadoss, Chao Zhang, Alexey Tumanov
No matching items

© 2025-2026 Glenn Matlin