Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/180368
Title: Explore the fine-tuning of pre-trained LLM models to improve the quality of abstracts for long patent descriptions
Authors: Li, Yunong
Keywords: Computer and Information Science
Engineering
Issue Date: 2024
Publisher: Nanyang Technological University
Source: Li, Y. (2024). Explore the fine-tuning of pre-trained LLM models to improve the quality of abstracts for long patent descriptions. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/180368
Project: ISM-DISS-04202 
Abstract: Generating abstracts for long patent descriptions is a challenging task. Many patent descriptions exceed 10,000 words in length, while the maximum token limit for most standard text summary models is typically only 512, 2048, or 4096 tokens. As a result, patent descriptions often surpass the token limits of these models. This project focuses on fine-tuning large language models (LLM) to effectively summarize lengthy patent descriptions with high quality. It focuses on overcoming the challenges of generating concise abstracts for lengthy patent descriptions by selecting and adapting suitable models for extended text processing. It investigates the performance of Qwen, InterLM, LLaMA, and RWKV models. Through fine-tuning and comparison, the project successfully demonstrated that effective long-text summarization can be achieved. Techniques such as LoRA and position encoding extrapolation were employed to address hardware limitations (using a single graphics card). Additionally, enhancements to the models, including improvements in the attention mechanism, have made them better suited for handling long text tasks.
URI: https://hdl.handle.net/10356/180368
Schools: School of Electrical and Electronic Engineering 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:EEE Theses

Files in This Item:
File Description SizeFormat 
Li Yunong-Dissertation.pdf
  Restricted Access
2.03 MBAdobe PDFView/Open

Page view(s)

146
Updated on Mar 22, 2025

Download(s)

12
Updated on Mar 22, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.