Glenn Matlin
Siddharth
Anirudh JM
Aditya Shukla
Yahya Hassan
Sudheer Chava
December 1, 2025
Publication
A novel, high-difficulty benchmark designed to assess LM instruction-following capabilities for financial analysis tasks.
Published
December 1, 2025
Authors
Glenn Matlin, Siddharth, Anirudh JM, Aditya Shukla, Yahya Hassan, Sudheer Chava
Venue
GenAI Finance Workshop at the 39th Annual Conference on Neural Information Processing Systems (NeurIPS) 2025
This work introduces FIFE, a novel, high-difficulty benchmark designed to assess LM instruction-following capabilities for financial analysis tasks, and evaluates 53 models (proprietary, open-weight, open-source) in a zero-shot setting. FIFE provides a rigorous evaluation of how well language models can follow complex, domain-specific instructions in finance.
Continue exploring
Return to the publication archive or step back to the broader research agenda.