Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/184392
Title: Towards trustworthy and reliable language models
Authors: Zhao, Ruochen
Keywords: Computer and Information Science
Issue Date: 2025
Publisher: Nanyang Technological University
Source: Zhao, R. (2025). Towards trustworthy and reliable language models. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/184392
Abstract: This thesis addresses the critical challenge of developing trustworthy and reliable Natural Language Processing (NLP) systems, specifically the newly emerged Large Language Models (LLMs). As LLMs become increasingly prevalent in various domains, the need for transparent, interpretable, and controllable AI systems has never been more pressing. However, the complexity of LLMs, the compositional nature of language, and the potential for hallucinations pose significant obstacles to achieving these goals. To increase user trust of AI systems in real-life deployment, we hope to enhance the trustworthiness and reliability of LLMs without requiring model revisions or compromising performance. Motivated by this overarching goal, we delve into two main goals that enhance trustworthiness, providing user-friendly explanations of the LLM’s decisions and controlling the LLM’s behaviors. Specifically, we raise three main research questions: How can we disentangle the true reasons behind LLM decisions from the complex architecture and vast number of parameters? How can we provide user-friendly explanations for LLM generations? How can we increase LLM controllability with minimal interventions?
URI: https://hdl.handle.net/10356/184392
DOI: 10.32657/10356/184392
Schools: College of Computing and Data Science 
Rights: This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:CCDS Theses

Files in This Item:
File Description SizeFormat 
Amended Thesis.pdf5.69 MBAdobe PDFView/Open

Page view(s)

125
Updated on May 7, 2025

Download(s) 50

55
Updated on May 7, 2025

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.