Please use this identifier to cite or link to this item:
https://hdl.handle.net/10356/184182
Title: | Kubernetes: deploying speech SGDecoding for scalability | Authors: | Yeo, Marcus | Keywords: | Computer and Information Science | Issue Date: | 2025 | Publisher: | Nanyang Technological University | Source: | Yeo, M. (2025). Kubernetes: deploying speech SGDecoding for scalability. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/184182 | Project: | CCDS24-0009 | Abstract: | In recent years, cloud-based deployment of web applications has become standard practice due to its cost-effectiveness and ease of modification. As costs often scale with usage, achieving both efficient resource utilization and reliable scalability under varying loads has grown increasingly important. This project focuses on the deployment and scaling of the SGDecoding Web Application, an application offering live multilingual speech-to-text transcription, using Amazon Web Services (AWS). Prior to this project, SGDecoding was operational only in a local envi- ronment and lacked a structured online deployment strategy. Before cloud deployment, fundamental elements of the application, such as the existing technology stack, was thor- oughly analyzed. This information went into informing the selection of cloud services, scaling tools, and architectural patterns used in the project. Furthermore, an in-depth analysis of cloud services, scaling tools, and architectures was conducted to identify opti- mal approaches for hosting the application and ensuring its resilience under varying user loads. Following deployment, Kubernetes Horizontal Pod Autoscaler and Cluster Au- toscaler were used to implement cloud automated scaling mechanisms. Doing so enabled the cluster to dynamically adjust resources depending on prevailing user traffic to the application. In the testing phase, the application was shown to be able to handle up to 1,000 concurrent users under varying test conditions. This presented a 90% increase in concurrent users over default scaling approaches. These tests clearly demonstrate the system’s resilience under heavy and fluctuating workloads. | URI: | https://hdl.handle.net/10356/184182 | Schools: | College of Computing and Data Science | Fulltext Permission: | restricted | Fulltext Availability: | With Fulltext |
Appears in Collections: | CCDS Student Reports (FYP/IA/PA/PI) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Marcus Yeo FYP Report Final.pdf Restricted Access | 10.78 MB | Adobe PDF | View/Open |
Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.