Please use this identifier to cite or link to this item:
Full metadata record
DC FieldValueLanguage
dc.contributor.authorTran, Nhat Sang
dc.description.abstractBackground: Genome-wide association studies (GWAS) and epigenome-wide association studies (EWAS) hold the promise to explore the relationships among genetic variants, epigenetic changes and human diseases. The challenges lie in their computational burden due to the number of data returned from epigenetic measures (450k CpGs measured by Illumina Infinium 450k array) and genetic variants (millions of SNPs by sequence technology). As EWAS is a young and emerging topic, comprehensive computational supports are currently far behind the demands. An R package called GEM was created to discover how genetic variants (G) and environment factors (E) influenced methylation changes (M) in EWAS. The first generation of GEM uses linear model to determine the associations, so GEM finds it difficult to go through millions of regressions in large sample size. Solution: In this project, we implement the second generation GEM. We replaced the linear regression in the old GEM package with the newly developed semi-parallel approach. We first simulated pseudo methylation data, SNP data and environment data. Then we benchmark new Gmodel and Emodel by comparing the results with the standard respective functions in the old GEM. We showed the new Emodel can achieve around 500 times of efficiency with 1,000 samples and 10,000 CpGs; Gmodel can greatly improve the efficiency of more than 1,500 times with the same sample and CpG size and 60,000 SNPs. Conclusion: We implemented the new models and reported the computational efficiency of them. We also analysed the quality of accuracy in their results. This quality control process proved that our solution is reliable and should be applied in real study.en_US
dc.format.extent46 p.en_US
dc.rightsNanyang Technological University
dc.subjectDRNTU::Engineering::Computer science and engineering::Computer applications::Life and medical sciencesen_US
dc.titleImprove the computation efficiency in epigenome-wide and genome-wide association studiesen_US
dc.typeFinal Year Project (FYP)en_US
dc.contributor.supervisorKwoh Chee Keongen_US
dc.contributor.schoolSchool of Computer Engineeringen_US
dc.description.degreeBachelor of Engineering (Computer Science)en_US
dc.contributor.researchCentre for Computational Intelligenceen_US
item.fulltextWith Fulltext-
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)
Files in This Item:
File Description SizeFormat 
  Restricted Access
FYP Report SCE14-05001.62 MBAdobe PDFView/Open

Page view(s)

Updated on Jul 18, 2024

Download(s) 50

Updated on Jul 18, 2024

Google ScholarTM


Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.