Indonesian Journal of Electrical Engineering and Computer Science
Vol 28, No 3: December 2022

Topic modelling of legal documents using NLP and bidirectional encoder representations from transformers

Amar Jeet Rawat (Uttaranchal University)
Sunil Ghildiyal (Uttaranchal University)
Anil Kumar Dixit (Uttaranchal University)



Article Info

Publish Date
01 Dec 2022

Abstract

Modeling legal text is a difficult task because of its unique features, such as lengthy texts, complex language structures, and technical terms. During the last decade, there has been a big rise in the number of legislative documents, which makes it hard for law professionals to keep up with legislation like analyzing judgements and implementing acts. The relevancy of topics is heavily influenced by the processing and presentation of legal documents in some contexts. The objective of this work is to understand the legal judgement corpus related to cases under the Hindu Marriage Act of India. The study looked into various methods to generate sentence embeddings from the judgement. This paper employs the power of the BERTopic algorithm for generating significant topics.

Copyrights © 2022