Deductive Closure Training of Language Models for Coherence, Accuracy, and Updatability

Akyürek, Afra Feyza; Akyürek, Ekin; Choshen, Leshem; Wijaya, Derry; Andreas, Jacob

Computer Science > Computation and Language

arXiv:2401.08574 (cs)

[Submitted on 16 Jan 2024 (v1), last revised 26 Jun 2024 (this version, v2)]

Title:Deductive Closure Training of Language Models for Coherence, Accuracy, and Updatability

Authors:Afra Feyza Akyürek, Ekin Akyürek, Leshem Choshen, Derry Wijaya, Jacob Andreas

View PDF HTML (experimental)

Abstract:While language models (LMs) can sometimes generate factually correct text and estimate truth values of individual claims, these generally do not reflect a globally coherent, manipulable model of the world. As a consequence, current LMs also generate incorrect or nonsensical content, and are difficult to edit and bring up to date. We present a method called Deductive Closure Training (DCT) that uses LMs themselves to identify implications of (and contradictions within) the text that they generate, yielding an efficient self-supervised procedure for improving LM factuality. Given a collection of seed documents, DCT prompts LMs to generate additional text implied by these documents, reason globally about the correctness of this generated text, and finally fine-tune on text inferred to be correct. Given seed documents from a trusted source, DCT provides a tool for supervised model updating; if seed documents are sampled from the LM itself, DCT enables fully unsupervised fine-tuning for improved coherence and accuracy. Across the CREAK, MQUaKE, and Reversal Curse datasets, supervised DCT improves LM fact verification and text generation accuracy by 3-26%; on CREAK fully unsupervised DCT improves verification accuracy by 12%. These results show that LMs' reasoning capabilities during inference can be leveraged during training to improve their reliability.

Comments:	ACL Findings
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2401.08574 [cs.CL]
	(or arXiv:2401.08574v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2401.08574

Submission history

From: Afra Feyza Akyürek [view email]
[v1] Tue, 16 Jan 2024 18:58:37 UTC (8,442 KB)
[v2] Wed, 26 Jun 2024 19:52:35 UTC (9,374 KB)

Computer Science > Computation and Language

Title:Deductive Closure Training of Language Models for Coherence, Accuracy, and Updatability

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Deductive Closure Training of Language Models for Coherence, Accuracy, and Updatability

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators