Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/182230
Title: Utilizing chemical domain knowledge and machine learning for nanoparticle and biochemical spectroscopic analysis
Authors: Tan, Emily Xi
Keywords: Chemistry
Issue Date: 2024
Publisher: Nanyang Technological University
Source: Tan, E. X. (2024). Utilizing chemical domain knowledge and machine learning for nanoparticle and biochemical spectroscopic analysis. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/182230
Abstract: Rapid and accurate chemical analysis is desirable in many scientific and technological fields but remains challenging. This thesis demonstrates the integration of domain knowledge-driven feature engineering and machine learning (ML) with UV-vis and SERS spectroscopic analyses for high-throughput characterization of both nanomaterials and biochemicals. Traditional electron microscopy for SERS-active nanoparticles is slow and tedious while existing SERS methods often rely on static spectral matching that only identifies known molecules and struggles with unknown chemical mixtures. To address these challenges, this work introduces a twin-pillar strategy: using ML and UV-vis spectroscopy for rapid nanocharacterization and applying ML-driven SERS to detect unknown biochemicals. Chapter 2 introduces a ML-based UV-vis method for characterizing gold nanospheres, achieving high accuracy over the widest size range through the use of basis spline regression. Chapter 3 extends this approach to more complex nanoshapes, such as nanocubes in mixtures, using feature engineering for unprecedented size, purity, and shape predictions from multiplex extinction spectra with low error rates. Chapter 4 presents a hierarchical ML framework for SERS that identifies and quantifies unknown cerebrosides at various concentrations. This signifies a paradigm shift from passive spectral analysis to active identification of unknown molecules. Chapter 5 develops a transfer learning framework achieving precise SERS identification and quantification of unknown carnitine mixtures. Finally, we discuss the prospects of ML-driven spectroscopic analysis to harness high-dimensional multimodal data and identify new and “unseen” analytes amid various interferences.
URI: https://hdl.handle.net/10356/182230
DOI: 10.32657/10356/182230
Schools: School of Chemistry, Chemical Engineering and Biotechnology 
Rights: This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:CCEB Theses

Files in This Item:
File Description SizeFormat 
Amended PhD Thesis_Emily Tan Xi G2003773C.pdf9.29 MBAdobe PDFThumbnail
View/Open

Page view(s)

182
Updated on May 7, 2025

Download(s) 50

101
Updated on May 7, 2025

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.