KBLaM: Knowledge Base augmented Language Model

Wang, Xi; Isazawa, Taketomo; Mikaelyan, Liana; Hensman, James

Computer Science > Artificial Intelligence

arXiv:2410.10450 (cs)

[Submitted on 14 Oct 2024 (v1), last revised 9 Feb 2025 (this version, v2)]

Title:KBLaM: Knowledge Base augmented Language Model

Authors:Xi Wang, Taketomo Isazawa, Liana Mikaelyan, James Hensman

View PDF

Abstract:In this paper, we propose Knowledge Base augmented Language Model (KBLaM), a new method for augmenting Large Language Models (LLMs) with external knowledge. KBLaM works with a knowledge base (KB) constructed from a corpus of documents, transforming each piece of knowledge in the KB into continuous key-value vector pairs via pre-trained sentence encoders with linear adapters and integrating them into pre-trained LLMs via a specialized rectangular attention mechanism. Unlike Retrieval-Augmented Generation, KBLaM eliminates external retrieval modules, and unlike in-context learning, its computational overhead scales linearly with KB size rather than quadratically. Our approach enables integrating a large KB of more than 10K triples into an 8B pre-trained LLM of only 8K context window on one single A100 80GB GPU and allows for dynamic updates without model fine-tuning or retraining. Experiments demonstrate KBLaM's effectiveness in various tasks, including question-answering and open-ended reasoning, while providing interpretable insights into its use of the augmented knowledge. Code and datasets are available at this https URL

Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2410.10450 [cs.AI]
	(or arXiv:2410.10450v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2410.10450

Submission history

From: Xi Wang [view email]
[v1] Mon, 14 Oct 2024 12:45:10 UTC (1,079 KB)
[v2] Sun, 9 Feb 2025 04:45:43 UTC (1,076 KB)

Computer Science > Artificial Intelligence

Title:KBLaM: Knowledge Base augmented Language Model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:KBLaM: Knowledge Base augmented Language Model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators