Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/184182
Title: Kubernetes: deploying speech SGDecoding for scalability
Authors: Yeo, Marcus
Keywords: Computer and Information Science
Issue Date: 2025
Publisher: Nanyang Technological University
Source: Yeo, M. (2025). Kubernetes: deploying speech SGDecoding for scalability. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/184182
Project: CCDS24-0009
Abstract: In recent years, cloud-based deployment of web applications has become standard practice due to its cost-effectiveness and ease of modification. As costs often scale with usage, achieving both efficient resource utilization and reliable scalability under varying loads has grown increasingly important. This project focuses on the deployment and scaling of the SGDecoding Web Application, an application offering live multilingual speech-to-text transcription, using Amazon Web Services (AWS). Prior to this project, SGDecoding was operational only in a local envi- ronment and lacked a structured online deployment strategy. Before cloud deployment, fundamental elements of the application, such as the existing technology stack, was thor- oughly analyzed. This information went into informing the selection of cloud services, scaling tools, and architectural patterns used in the project. Furthermore, an in-depth analysis of cloud services, scaling tools, and architectures was conducted to identify opti- mal approaches for hosting the application and ensuring its resilience under varying user loads. Following deployment, Kubernetes Horizontal Pod Autoscaler and Cluster Au- toscaler were used to implement cloud automated scaling mechanisms. Doing so enabled the cluster to dynamically adjust resources depending on prevailing user traffic to the application. In the testing phase, the application was shown to be able to handle up to 1,000 concurrent users under varying test conditions. This presented a 90% increase in concurrent users over default scaling approaches. These tests clearly demonstrate the system’s resilience under heavy and fluctuating workloads.
URI: https://hdl.handle.net/10356/184182
Schools: College of Computing and Data Science 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:CCDS Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
Marcus Yeo FYP Report Final.pdf
  Restricted Access
10.78 MBAdobe PDFView/Open

Page view(s)

25
Updated on May 7, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.