Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/183913
Title: Benchmarking AI workflows: performance evaluation of DietCoke in visual question answering pipelines
Authors: Lim, Greg Song Wei
Keywords: Computer and Information Science
Issue Date: 2025
Publisher: Nanyang Technological University
Source: Lim, G. S. W. (2025). Benchmarking AI workflows: performance evaluation of DietCoke in visual question answering pipelines. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/183913
Abstract: Recent advances in Large Language Models (LLMs) have led to the emergence of complex, agentic AI workflows that go beyond single-model inference. These workflows, such as those used in visual question answering (VQA), involve multiple interdependent steps including image understanding, external knowledge integration, and iterative reasoning. However, current benchmarking tools largely focus on isolated model performance and fail to capture the nuanced performance characteristics of such multi-stage AI systems. This gap makes it difficult to evaluate and optimize real-world AI deployments holistically. This project extends the Agentic Workflow Benchmark framework to support the end-to-end evaluation of complex LLM workflows. Specifically, the DietCoke VQA pipeline was re-implemented and modularized into a benchmarkable workflow, deployed with vLLM as inference backends. LAVIS’ Img2LLM was also integrated as a separate microservice to simulate a full VQA pipeline. Key findings show that vLLM achieves significantly higher throughput—up to 25× over the Hugging Face baseline—while maintaining comparable accuracy. However, Img2LLM introduces a bottleneck due to its sequential design and memory-bound QA generation, underscoring the need for further optimization in upstream components. These insights highlight the importance of benchmarking AI workflows as full systems rather than isolated models.
URI: https://hdl.handle.net/10356/183913
Schools: College of Computing and Data Science 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:CCDS Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
Lim Song Wei Greg - Final FYP report_u.pdf
  Restricted Access
975.74 kBAdobe PDFView/Open

Page view(s)

13
Updated on May 5, 2025

Download(s)

1
Updated on May 5, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.