Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/181069
Title: | Bias in the age of generative AI: a deep dive into autoregressive model fairness | Authors: | Ng, Darren Joon Kai | Keywords: | Computer and Information Science | Issue Date: | 2024 | Publisher: | Nanyang Technological University | Source: | Ng, D. J. K. (2024). Bias in the age of generative AI: a deep dive into autoregressive model fairness. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/181069 | Abstract: | This study presents a comprehensive evaluation of biases in prominent autoregressive language models, including GPT-2, Llama-7B, and Mistral-7B. The research systematically assesses the models' performance across multiple dimensions of bias, including toxicity, gender, race, religion, and LGBTQIA+ identities. To evaluate toxicity, the study employs the RealToxicityPrompts dataset and the "roberta-hate-speech-dynabench-r4" model. Gender, racial and religious biases are examined using the BOLD dataset and the REGARD metric, while the HONEST benchmark is leveraged to assess biases against LGBTQIA+ identities. Notably, the research explores the effectiveness of structured prompts, particularly zero-shot Chain-of-Thought (CoT)-based implication prompting, as a debiasing technique. The results demonstrate the potential of this approach to mitigate biases across various domains, with Llama-7B exhibiting the most consistent and substantial improvements. However, the study also highlights the challenges in effectively debiasing LGBTQIA+ biases, underscoring the need for more targeted and specialised techniques in this area. Overall, this work provides a comprehensive understanding of the biases present in contemporary autoregressive language models and offers insights into effective strategies for bias mitigation, paving the way for the development of more equitable and inclusive AI systems. | URI: | https://hdl.handle.net/10356/181069 | Schools: | College of Computing and Data Science | Fulltext Permission: | restricted | Fulltext Availability: | With Fulltext |
Appears in Collections: | CCDS Student Reports (FYP/IA/PA/PI) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
DarrenNgJoonKai_FYP_Report.pdf Restricted Access | 3.06 MB | Adobe PDF | View/Open |
Page view(s)
65
Updated on Dec 10, 2024
Download(s)
9
Updated on Dec 10, 2024
Google ScholarTM
Check
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.