Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/180892
Title: | Framework to evaluate and test defences against hallucination in large language model | Authors: | Pan, Johnny Shi Han | Keywords: | Computer and Information Science | Issue Date: | 2024 | Publisher: | Nanyang Technological University | Source: | Pan, J. S. H. (2024). Framework to evaluate and test defences against hallucination in large language model. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/180892 | Abstract: | The recent advancement of AI, particularly the large language models (LLMs) has en- abled unprecedented capabilities in natural language processing (NLP) tasks, including things such as content generation, translation, and question answering (QA). However, just like any new technology, LLMs faced some challenges. One of the key issues with LLMs is what’s known as “hallucination.” This happens when the model produces information that is incorrect or made up but still sounds plausible. In this paper, the goal is to outline a framework to help identify and assess these hallucinations through curating new hallucination evaluation methods, datasets and evaluation metrics. The framework is adaptable and can be used with a variety of models and LLMs prod- uct. The main objective is to offer developers and engineers a consistent approach to identifying hallucinations in LLM-based applications before they are released. | URI: | https://hdl.handle.net/10356/180892 | Schools: | College of Computing and Data Science | Fulltext Permission: | restricted | Fulltext Availability: | With Fulltext |
Appears in Collections: | CCDS Student Reports (FYP/IA/PA/PI) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Pan_Shi_Han_Johnny_FYP.pdf Restricted Access | 1.29 MB | Adobe PDF | View/Open |
Page view(s)
106
Updated on Mar 20, 2025
Download(s)
11
Updated on Mar 20, 2025
Google ScholarTM
Check
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.