Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/184621
Title: Random walk-based erasure for medical imaging: augmenting data while preserving diagnostic features
Authors: Lyu, Lin
Keywords: Computer and Information Science
Medicine, Health and Life Sciences
Issue Date: 2025
Publisher: Nanyang Technological University
Source: Lyu, L. (2025). Random walk-based erasure for medical imaging: augmenting data while preserving diagnostic features. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/184621
Project: ISM-DISS-03486
Abstract: This study leverages a revised single‐label CXR14 dataset comprising 91,324 chest radiographs and addresses the dual challenges of limited data availability and model generalizability in medical imaging by means of novel data‐augmentation techniques. In the realm of image processing, we focus primarily on a random‐walk‐based erasure (RWE) algorithm, which we have specifically optimized and simplified for the CXR14 dataset. This approach achieves a controlled occlusion–random feature exploration trade‐off on chest X-ray images, thereby preserving diagnostically critical structures while bolstering model robustness. The experimental framework employs DenseNet121 with focal loss optimization, incorporating three key innovations: 1) Dual-phase training combining augmented and original data 2) AUC-aware early stopping function for class-imbalanced learning 3) Multi-scale image quality assessment (IQA) using both reference-based (PSNR, SSIM, MS-SSIM) and no-reference metrics (BRISQUE, NIQE). The results demonstrate the superiority of RWE. Compared to the baseline original dataset, the RWE-enhanced dataset achieved a relative improvement of 3.24% in AUC on the full dataset (91,324 images), while the noise-injected dataset showed a relative enhancement of 1.35%. On the smaller dataset (10,000 images), the improvements were 12.5% and 11.7%, respectively. IQA analysis reveals RWE's unique advantage: while trailing in pixel-level fidelity, it better preserves structural semantics critical for diagnosis. Our investigation uncovers two critical phenomena: 1) Augmentation efficacy inversely correlates with dataset size, suggesting strongest benefits for data-scarce scenarios 2) Randomization strategies significantly impact model stability. The study further proposes a parameter optimization pipeline for medical augmentation, identifying optimal RWE configurations through four-stage hyperparameter tuning. These findings position RWE as a promising solution for medical AI's data challenges, though future work must address multi-label generalization and anatomical-prior integration. The proposed methodology provides a template for developing domain-specific augmentation strategies that balance physical realism with diagnostic utility, ultimately supporting more reliable and interpretable medical image analysis systems.
URI: https://hdl.handle.net/10356/184621
Schools: School of Electrical and Electronic Engineering 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:EEE Theses

Files in This Item:
File Description SizeFormat 
Dissertation_LYU_LIN_0428_v3_0_Minor_Revision.pdf
  Restricted Access
3.34 MBAdobe PDFView/Open

Page view(s)

10
Updated on May 7, 2025

Download(s)

3
Updated on May 7, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.