Chain-of-query prompting pipeline for improving small-scale language models in multi-hop open-domain question answering

Akula, Rithika

Chain-of-query prompting pipeline for improving small-scale language models in multi-hop open-domain question answering

Files

AkulaRithikaResearch.pdf (1.88 MB)

Authors

Akula, Rithika

Date

2024

Format

Thesis

Abstract

Large Language Models (LLMs) have shown to exhibit robust performance in multi-hop open-domain question-answering (ODQA), which is often attributed to the large number of parameters and extensive training. While smaller-scale language models (LMs) offer a more cost-effective approach for real-world applications, these LMs are often challenged with maintaining factual responses in multi-hop ODQA settings. In this thesis, we introduce a novel prompting approach viz., Chain-of-Query (CoQ), that is designed to enhance smaller-scale LMs by decomposing complex queries into context-based subqueries for robust ODQA in multi-hop settings. Our CoQ prompting approach creates an efficient pipeline that integrates with Retrieval-Augmented Generation (RAG) LMs for optimizing the retrieval process through multiple query generation, thereby adding external knowledge to the LM with small amount of context. We show how our CoQ approach can substantially boost the performance of small-scale LMs against state-of-the-art LLMs using metrics like Exact Match (EM) and F1 score, making it a valuable advancement for complex QA tasks. Lastly, we discuss some future research directions and extensions of our work to generative models and conclude by discussing some applications of our pipeline to improve open-source availability of powerful small-scale Language Models.

URI

https://hdl.handle.net/10355/106120
https://doi.org/10.32469/10355/106120

Degree

M.S.

Thesis Department

Computer science (MU)

Collections

2024 MU Theses - Freely available online
Computer Science electronic theses and dissertations (MU)

Full item page

Chain-of-query prompting pipeline for improving small-scale language models in multi-hop open-domain question answering

Files

Authors

Meeting name

Sponsors

Date

Journal Title

Format

Subject

Research Projects

Organizational Units

Journal Issue

Abstract

Table of Contents

URI

DOI

PubMed ID

Degree

Thesis Department

Rights

License

Collections