Please use this identifier to cite or link to this item:
|Title:||Machine learning for chemical components testing||Authors:||Tan, Ashley Zhao Kiat||Keywords:||Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
|Issue Date:||2021||Publisher:||Nanyang Technological University||Source:||Tan, A. Z. K. (2021). Machine learning for chemical components testing. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/150216||Project:||C050||Abstract:||Terahertz time domain spectroscopy (THz-TDS) involves the use of THz radiation to identify chemicals via their absorption spectra characteristics. Building on the preceding Final Year Project which proved the feasibility of incorporating machine learning with THz-TDS to identify pure chemicals, this report explores the improvement on chemical mixture identification. From data collected in the lab from the industrial partner, Anor Technologies, various new pre-processing approaches are applied. These include the use of Mixture Synthesis to bolster the mixture dataset, as well as a Stacked Area approach to average out the inconsistencies between individual datapoints obtained from the THz-TDS machine. Following this, two new machine learning approaches are taken to evaluate the effectiveness on chemical mixture identification. Multi-label problem transformation techniques and algorithm adaptations such as Binary Relevance, Classifier Chain, Label Powerset and MLkNN are taken to tackle the mixture identification problem, along with the application of a 1D CNN as a new machine learning approach. Results from the training and testing show that while the Stacked Area approach can greatly increase the training and validation recall and precision scores up to 0.99, the drawback is a five times reduction in dataset size, which can affect model generalization performance. Further testing results show that the 1D CNN model has a very good generalization performance on completely unseen data, achieving a recall and precision score of around 0.98. The two novel approaches are shown to be very effective in this field of chemical detection using THz-TDS, with the Mixture Synthesis method effectively able to double the size of the existing datasets, and the Stacked Area leading to trained models with consistently better recall and precision scores compared to the original data. Future considerations to build on this work could involve the incorporation of data augmentation methods by randomising the offset, slope, and multiplication of the original absorption spectra to produce even more datapoints. Further development can be made on the model, switching to a regression model that can quantitatively detect the composition of chemicals in a mixture.||URI:||https://hdl.handle.net/10356/150216||Fulltext Permission:||restricted||Fulltext Availability:||With Fulltext|
|Appears in Collections:||MAE Student Reports (FYP/IA/PA/PI)|
Updated on May 21, 2022
Updated on May 21, 2022
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.