International Journal of Advances in Applied Sciences
Vol 13, No 1: March 2024

Improving the BERT model for long text sequences in question answering domain

Vijayan Ramaraj (Vellore Institute of Technology (VIT))
Mareeswari Venkatachala Appa Swamy (Vellore Institute of Technology (VIT))
Ephzibah Evan Prince (Vellore Institute of Technology (VIT))
Chandhan Kumar (Vellore Institute of Technology (VIT))



Article Info

Publish Date
01 Mar 2024

Abstract

The text-based question-answering (QA) system aims to answer natural language questions by querying the external knowledge base. It can be applied to real-world systems like medical documents, research papers, and crime-related documents. Using this system, users don't have to go through the documents manually the system will understand the knowledge base and find the answer based on the text and question given to the system. Earlier state-of-the-art natural language processing (NLP) was recurrent neural network (RNN) and long short-term memory (LSTM). As a result, these models are hard to parallelize and poor at retaining contextual relationships across long text inputs. Today, bidirectional encoder representations from transformers (BERT) are the contemporary algorithm for NLP. BERT is not capable of handling long text sequences; it can handle 512 tokens at a time which makes it difficult for long context. Smooth inverse frequency (SIF) and the BERT model will be incorporated together to solve this challenge. BERT trained on the Stanford question answering dataset (SQuAD) and SIF model demonstrates robustness and effectiveness on long text sequences from different domains. Experimental results suggest that the proposed approach is a promising solution for QA on long text sequences.

Copyrights © 2024






Journal Info

Abbrev

IJAAS

Publisher

Subject

Earth & Planetary Sciences Environmental Science Materials Science & Nanotechnology Mathematics Physics

Description

International Journal of Advances in Applied Sciences (IJAAS) is a peer-reviewed and open access journal dedicated to publish significant research findings in the field of applied and theoretical sciences. The journal is designed to serve researchers, developers, professionals, graduate students and ...