Advancements in Financial Natural Language Processing

The field of financial natural language processing is rapidly evolving, with a focus on developing innovative methods for extracting insights from financial texts. Recent research has centered on improving the accuracy of financial event entity extraction, question answering, and information retrieval. The use of large language models (LLMs) and retrieval-augmented generation (RAG) systems has shown significant promise in addressing the challenges posed by the complexity and nuance of financial language. Notable developments include the proposal of new datasets, such as FinNLI and FinDER, which provide benchmarks for evaluating the performance of LLMs and RAG systems in financial natural language inference and question answering tasks. Additionally, the introduction of novel models, including FinSage and JurisCTC, has demonstrated improved performance in financial question answering and legal judgment prediction tasks.

Some noteworthy papers include: FinSage, which proposes a multi-aspect RAG system for financial filings question answering, achieving an impressive recall of 92.51% on expert-curated questions. JurisCTC, which employs cross-domain transfer and contrastive learning to improve legal judgment prediction, achieving peak accuracies of 76.59% and 78.83%.

Sources

BASIR: Budget-Assisted Sectoral Impact Ranking -- A Dataset for Sector Identification and Performance Prediction Using Language Models

KFinEval-Pilot: A Comprehensive Benchmark Suite for Korean Financial Language Understanding

Template-Based Financial Report Generation in Agentic and Decomposed Information Retrieval

FinSage: A Multi-aspect RAG System for Financial Filings Question Answering

Harnessing Generative LLMs for Enhanced Financial Event Entity Extraction Performance

Efficient Document Retrieval with G-Retriever

FinTextSim: Enhancing Financial Text Analysis with BERTopic

FinDER: Financial Dataset for Question Answering and Evaluating Retrieval-Augmented Generation

LegalRAG: A Hybrid RAG System for Multilingual Legal Information Retrieval

FinNLI: Novel Dataset for Multi-Genre Financial Natural Language Inference Benchmarking

Transformer-Based Extraction of Statutory Definitions from the U.S. Code

JurisCTC: Enhancing Legal Judgment Prediction via Cross-Domain Transfer and Contrastive Learning

Built with on top of