Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/103840
Title: Beware of moving tagerts: reference proteome content fluctuates substantially over the years
Authors: Sirota, Fernanda L.
Batagov, Arsen
Schneider, Georg
Eisenhaber, Birgit
Eisenhaber, Frank
Maurer-Stroh, Sebastian
Keywords: DRNTU::Engineering::Computer science and engineering::Computer applications::Life and medical sciences
Issue Date: 2012
Source: Sirota, F. L., Batagov, A., Schneider, G., Eisenhaber, B., Eisenhaber, F., & Maurer-Stroh, S. (2012). Beware of moving tagerts: reference proteome content fluctuates substantially over the years. Journal of bioinformatics and computational biology, 10(06), 1250020-.
Series/Report no.: Journal of bioinformatics and computational biology
Abstract: Reference proteomes are generated by increasingly sophisticated annotation pipelines as part of regular genome build releases; yet, the corresponding changes in reference proteomes' content are dramatic. In the history of the NCBI-curated human proteome, the total number of entries has remained roughly constant but approximately half of the proteins from the 2003 build 33 are no longer represented by entries in current releases, while about the same number of new proteins have been added (for sequence identity thresholds 50–90%). Although mostly hypothetical proteins are affected, there are also spectacular cases of entry removal/addition of well studied proteins. The changes between the 2003 and recent human proteomes are in a similar order of magnitude as the differences between recent human and chimpanzee proteome releases. As an application example, we show that the proteome fluctuations affect the interpretation (about 74% of hits) of organelle-specific mass-spectrometry data. Although proteome quality tends to improve with more recent releases as, for example, the fraction of proteins with functional annotation has increased over time, existing evidence implies that, apparently, the proteome content still remains incomplete, not just pertaining to isoforms/sequence variants but also to proteins and their families that are clearly distinct.
URI: https://hdl.handle.net/10356/103840
http://hdl.handle.net/10220/17113
DOI: http://dx.doi.org/10.1142/S0219720012500205
metadata.item.grantfulltext: none
metadata.item.fulltext: No Fulltext
Appears in Collections:SBS Journal Articles

Google ScholarTM

Check

Altmetric

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.