Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/156860
Title: Investigating the improvement of transcriptome assembly via a consensus-based ensemble approach
Authors: Srinivasan, Niyathi
Keywords: Science::Biological sciences::Molecular biology
Science::Biological sciences::Botany::Plant physiology
Issue Date: 2022
Publisher: Nanyang Technological University
Source: Srinivasan, N. (2022). Investigating the improvement of transcriptome assembly via a consensus-based ensemble approach. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156860
Abstract: This paper investigates the development of a transcriptome assembly pipeline that provides as clean and accurate of a result as possible, for the plant Cycas edentata. De novo transcriptome assembly is a necessary process to understand the gene expression of non-model organisms that may not have a sequenced genome available. It can be quite difficult to obtain transcriptomes free of extraneous material such as splicing isoforms and partially transcribed sequences. Therefore, this paper aims to investigate various methods that may address this issue, by comparing the performance of single assemblers and merged assemblers, as well as the impact of varying k-mer size and introducing post-assembly modifications. Results show that the best pipeline for Cycas edentata involves the program SOAPdenovo-Trans at K35, followed by the program CD-HIT-Est, to obtain the cleanest and most accurate transcriptome. This pipeline may be used in future for other datasets as well.
URI: https://hdl.handle.net/10356/156860
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SBS Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
FYP Final Report Niyathi Srinivasan.pdf
  Restricted Access
1.71 MBAdobe PDFView/Open

Page view(s)

34
Updated on May 20, 2022

Download(s)

2
Updated on May 20, 2022

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.