Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/180288
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Low, Ashton Kin Yun | en_US |
dc.contributor.author | Nimrod, Lilith | en_US |
dc.contributor.author | Alam, Sameer | en_US |
dc.contributor.author | Poh, Leston Choo Kiat | en_US |
dc.date.accessioned | 2024-10-01T08:43:44Z | - |
dc.date.available | 2024-10-01T08:43:44Z | - |
dc.date.issued | 2024 | - |
dc.identifier.citation | Low, A. K. Y., Nimrod, L., Alam, S. & Poh, L. C. K. (2024). Deep neural network-based automatic speech recognition for ATC-pilot audio transcription. 2024 International Conference on Research in Air Transportation (ICRAT). | en_US |
dc.identifier.uri | https://hdl.handle.net/10356/180288 | - |
dc.description.abstract | Artificial Intelligence (AI) has demonstrated ability to manage complex processes highly effectively, and thus is widely seen as a key component in future airport ATM systems. Future AI tools for ATM will rely on digital data, such as surveillance, radar, weather, flight plans, for their operation. However, the foundational Air Traffic Control Officer (ATCo)-pilot communication medium is voice, which is a vital source of situational data. Controller Pilot Data Link Communications (CPDLC) has been developed as an alternative, text-based communication delivery method, however ATCo-pilot communications will not be completed transitioned to this framework in the near-term future. Moreover, as CPDLC is a one-to-one communication paradigm, the additional situational awareness of other traffic provided by traditional party-line VHF communications is potentially lost. Therefore, an automated speech to-text translation tool can be seen as a missing link, enabling traditional ATCo-pilot voice communications to be automatically translated and input into a datalink system such as CPDLC. To this end this paper presents a Machine Learning (ML) based Automatic Speech Recognition (ASR) framework that is able to accurately translate ATCo-pilot speech communication to text, achieving a Word Error Rate of only 6.13%. Moreover, the presented model is able to extract seven entities with an accuracy and F1-score of 91.8% and 84.4% respectively, which is similar to previously presented models but can only capable of extracting three. A detailed design of the framework is provided to enable its replication by the wider research community. | en_US |
dc.language.iso | en | en_US |
dc.relation | NTU Ref: 2017-1619 | en_US |
dc.rights | © 2024 ICRAT. All rights reserved. This article may be downloaded for personal use only. Any other use requires prior permission of the copyright holder. The Version of Record is available online at https://www.icrat.org/upcoming-conference/papers/. | en_US |
dc.subject | Engineering | en_US |
dc.title | Deep neural network-based automatic speech recognition for ATC-pilot audio transcription | en_US |
dc.type | Conference Paper | en |
dc.contributor.school | School of Mechanical and Aerospace Engineering | en_US |
dc.contributor.conference | 2024 International Conference on Research in Air Transportation (ICRAT) | en_US |
dc.contributor.research | Air Traffic Management Research Institute | en_US |
dc.description.version | Published version | en_US |
dc.identifier.url | https://www.icrat.org/upcoming-conference/papers/ | - |
dc.identifier.url | https://www.icrat.org/ | - |
dc.subject.keywords | Artificial intelligence | en_US |
dc.subject.keywords | Machine learning | en_US |
dc.subject.keywords | Automatic speech recognition | en_US |
dc.subject.keywords | Air traffic control | en_US |
dc.subject.keywords | ATCo-pilot communication | en_US |
dc.citation.conferencelocation | Singapore | en_US |
dc.description.acknowledgement | This research is supported by Saab Sweden under its Machine Learning For Airport Management And Tower Control project at NTU Singapore. | en_US |
item.grantfulltext | open | - |
item.fulltext | With Fulltext | - |
Appears in Collections: | MAE Conference Papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
ICRAT2024_paper_88.pdf | 1.17 MB | Adobe PDF | View/Open |
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.