Please use this identifier to cite or link to this item:
Title: Machine learning for chemical components testing
Authors: Tan, Ashley Zhao Kiat
Keywords: Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Engineering::Mechanical engineering::Robots
Issue Date: 2021
Publisher: Nanyang Technological University
Source: Tan, A. Z. K. (2021). Machine learning for chemical components testing. Final Year Project (FYP), Nanyang Technological University, Singapore.
Project: C050
Abstract: Terahertz time domain spectroscopy (THz-TDS) involves the use of THz radiation to identify chemicals via their absorption spectra characteristics. Building on the preceding Final Year Project which proved the feasibility of incorporating machine learning with THz-TDS to identify pure chemicals, this report explores the improvement on chemical mixture identification. From data collected in the lab from the industrial partner, Anor Technologies, various new pre-processing approaches are applied. These include the use of Mixture Synthesis to bolster the mixture dataset, as well as a Stacked Area approach to average out the inconsistencies between individual datapoints obtained from the THz-TDS machine. Following this, two new machine learning approaches are taken to evaluate the effectiveness on chemical mixture identification. Multi-label problem transformation techniques and algorithm adaptations such as Binary Relevance, Classifier Chain, Label Powerset and MLkNN are taken to tackle the mixture identification problem, along with the application of a 1D CNN as a new machine learning approach. Results from the training and testing show that while the Stacked Area approach can greatly increase the training and validation recall and precision scores up to 0.99, the drawback is a five times reduction in dataset size, which can affect model generalization performance. Further testing results show that the 1D CNN model has a very good generalization performance on completely unseen data, achieving a recall and precision score of around 0.98. The two novel approaches are shown to be very effective in this field of chemical detection using THz-TDS, with the Mixture Synthesis method effectively able to double the size of the existing datasets, and the Stacked Area leading to trained models with consistently better recall and precision scores compared to the original data. Future considerations to build on this work could involve the incorporation of data augmentation methods by randomising the offset, slope, and multiplication of the original absorption spectra to produce even more datapoints. Further development can be made on the model, switching to a regression model that can quantitatively detect the composition of chemicals in a mixture.
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:MAE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
Ash FYP Report Final (Upload) .pdf
  Restricted Access
1.92 MBAdobe PDFView/Open

Page view(s)

Updated on May 21, 2022


Updated on May 21, 2022

Google ScholarTM


Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.