Evidence-grounded NLP and LLMs

DTALab develops NLP and LLM-based systems whose outputs are grounded in explicit evidence. The goal is to move beyond answer generation and black-box prediction by making language-based AI systems traceable, inspectable and auditable.

This research line focuses on methods for claim verification, question answering, retrieval, report analysis and evidence-grounded decision support. In fact-checking and verification tasks, the aim is not only to classify a claim as supported or refuted, but also to identify which sources support the decision, which sources contradict it, and which information is irrelevant or insufficient.

A specific focus is on the use of Large Language Models for the integrated analysis of textual and tabular data. This includes Table Question Answering, Multi-table Question Answering, extraction of information from complex reports, and evaluation of how models use evidence distributed across documents, tables and semi-structured sources.

The research includes methods and tools for:

claim verification over textual, tabular and structured evidence;
evidence retrieval and source attribution;
explainable fact-checking and evidence-level explanations;
RAG and LLM pipelines whose outputs can be inspected and audited;
Table QA and Multi-table QA over complex reports;
evaluation of how models use external evidence;
analysis of unsupported, contradictory or insufficient evidence.

This theme supports applications in information integrity, compliance, education, document intelligence, ESG reporting, financial reporting and decision-support workflows where AI outputs must be justified and validated by domain experts.

Selected publications

Michele Luca Contalbo, Sara Pederzoli, Francesco Del Buono, Valeria Venturelli, Francesco Guerra, Matteo Paganelli. GRI-QA: a Comprehensive Benchmark for Table Question Answering over Environmental Data. Findings of ACL, 2025.
Marta Santacroce, Michele Luca Contalbo, Sara Pederzoli, Riccardo Benassi, Valeria Venturelli, Matteo Paganelli, Francesco Guerra. CLARIESG: An End-to-End System for ESG Analysis over Complex Tables in Corporate Reports. EACL System Demonstrations, 2026.
Chiara Magurno, Michele Luca Contalbo, Matteo Paganelli, Enrico Giliberti, Chiara Bertolini, Francesco Guerra. CLAIRE: A Controllable LLM Tutoring Framework for Reading Comprehension. Artificial Intelligence in Education, AIED 2026.