Financial Instruction Following Evaluation (FIFE)

NLP

Finance

Evaluation

A novel, high-difficulty benchmark designed to assess LM instruction-following capabilities for financial analysis tasks.

Authors

Glenn Matlin

Siddharth

Anirudh JM

Aditya Shukla

Yahya Hassan

Sudheer Chava

Published

December 1, 2025

Publication

Financial Instruction Following Evaluation (FIFE)

A novel, high-difficulty benchmark designed to assess LM instruction-following capabilities for financial analysis tasks.

Published

December 1, 2025

Authors

Glenn Matlin, Siddharth, Anirudh JM, Aditya Shukla, Yahya Hassan, Sudheer Chava

Venue

GenAI Finance Workshop at the 39th Annual Conference on Neural Information Processing Systems (NeurIPS) 2025

Read on arXiv All Publications

Abstract

This work introduces FIFE, a novel, high-difficulty benchmark designed to assess LM instruction-following capabilities for financial analysis tasks, and evaluates 53 models (proprietary, open-weight, open-source) in a zero-shot setting. FIFE provides a rigorous evaluation of how well language models can follow complex, domain-specific instructions in finance.