Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/178803
Title: A method for mining condition-specific co-expressed genes in Camellia sinensis based on k-means clustering
Authors: Zheng, Xinghai
Lim, Peng Ken
Mutwil, Marek
Wang, Yuefei
Keywords: Medicine, Health and Life Sciences
Issue Date: 2024
Source: Zheng, X., Lim, P. K., Mutwil, M. & Wang, Y. (2024). A method for mining condition-specific co-expressed genes in Camellia sinensis based on k-means clustering. BMC Plant Biology, 24(1), 373-. https://dx.doi.org/10.1186/s12870-024-05086-5
Journal: BMC Plant Biology 
Abstract: Background: As one of the world’s most important beverage crops, tea plants (Camellia sinensis) are renowned for their unique flavors and numerous beneficial secondary metabolites, attracting researchers to investigate the formation of tea quality. With the increasing availability of transcriptome data on tea plants in public databases, conducting large-scale co-expression analyses has become feasible to meet the demand for functional characterization of tea plant genes. However, as the multidimensional noise increases, larger-scale co-expression analyses are not always effective. Analyzing a subset of samples generated by effectively downsampling and reorganizing the global sample set often leads to more accurate results in co-expression analysis. Meanwhile, global-based co-expression analyses are more likely to overlook condition-specific gene interactions, which may be more important and worthy of exploration and research. Results: Here, we employed the k-means clustering method to organize and classify the global samples of tea plants, resulting in clustered samples. Metadata annotations were then performed on these clustered samples to determine the “conditions” represented by each cluster. Subsequently, we conducted gene co-expression network analysis (WGCNA) separately on the global samples and the clustered samples, resulting in global modules and cluster-specific modules. Comparative analyses of global modules and cluster-specific modules have demonstrated that cluster-specific modules exhibit higher accuracy in co-expression analysis. To measure the degree of condition specificity of genes within condition-specific clusters, we introduced the correlation difference value (CDV). By incorporating the CDV into co-expression analyses, we can assess the condition specificity of genes. This approach proved instrumental in identifying a series of high CDV transcription factor encoding genes upregulated during sustained cold treatment in Camellia sinensis leaves and buds, and pinpointing a pair of genes that participate in the antioxidant defense system of tea plants under sustained cold stress. Conclusions: To summarize, downsampling and reorganizing the sample set improved the accuracy of co-expression analysis. Cluster-specific modules were more accurate in capturing condition-specific gene interactions. The introduction of CDV allowed for the assessment of condition specificity in gene co-expression analyses. Using this approach, we identified a series of high CDV transcription factor encoding genes related to sustained cold stress in Camellia sinensis. This study highlights the importance of considering condition specificity in co-expression analysis and provides insights into the regulation of the cold stress in Camellia sinensis.
URI: https://hdl.handle.net/10356/178803
ISSN: 1471-2229
DOI: 10.1186/s12870-024-05086-5
Schools: School of Biological Sciences 
Rights: © The Author(s) 2024. Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
Fulltext Permission: open
Fulltext Availability: With Fulltext
Appears in Collections:SBS Journal Articles

Files in This Item:
File Description SizeFormat 
s12870-024-05086-5.pdf7.79 MBAdobe PDFThumbnail
View/Open

SCOPUSTM   
Citations 50

2
Updated on Mar 16, 2025

Page view(s)

86
Updated on Mar 20, 2025

Download(s)

20
Updated on Mar 20, 2025

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.