Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/181069
Title: Bias in the age of generative AI: a deep dive into autoregressive model fairness
Authors: Ng, Darren Joon Kai
Keywords: Computer and Information Science
Issue Date: 2024
Publisher: Nanyang Technological University
Source: Ng, D. J. K. (2024). Bias in the age of generative AI: a deep dive into autoregressive model fairness. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/181069
Abstract: This study presents a comprehensive evaluation of biases in prominent autoregressive language models, including GPT-2, Llama-7B, and Mistral-7B. The research systematically assesses the models' performance across multiple dimensions of bias, including toxicity, gender, race, religion, and LGBTQIA+ identities. To evaluate toxicity, the study employs the RealToxicityPrompts dataset and the "roberta-hate-speech-dynabench-r4" model. Gender, racial and religious biases are examined using the BOLD dataset and the REGARD metric, while the HONEST benchmark is leveraged to assess biases against LGBTQIA+ identities. Notably, the research explores the effectiveness of structured prompts, particularly zero-shot Chain-of-Thought (CoT)-based implication prompting, as a debiasing technique. The results demonstrate the potential of this approach to mitigate biases across various domains, with Llama-7B exhibiting the most consistent and substantial improvements. However, the study also highlights the challenges in effectively debiasing LGBTQIA+ biases, underscoring the need for more targeted and specialised techniques in this area. Overall, this work provides a comprehensive understanding of the biases present in contemporary autoregressive language models and offers insights into effective strategies for bias mitigation, paving the way for the development of more equitable and inclusive AI systems.
URI: https://hdl.handle.net/10356/181069
Schools: College of Computing and Data Science 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:CCDS Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
DarrenNgJoonKai_FYP_Report.pdf
  Restricted Access
3.06 MBAdobe PDFView/Open

Page view(s)

65
Updated on Dec 10, 2024

Download(s)

9
Updated on Dec 10, 2024

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.