Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/90338
Full metadata record
DC FieldValueLanguage
dc.contributor.authorSinha, Swatien
dc.contributor.authorEisenhaber, Birgiten
dc.contributor.authorJensen, Lars Juhlen
dc.contributor.authorKalbuaji, Bharataen
dc.contributor.authorEisenhaber, Franken
dc.date.accessioned2019-09-11T02:32:02Zen
dc.date.accessioned2019-12-06T17:46:08Z-
dc.date.available2019-09-11T02:32:02Zen
dc.date.available2019-12-06T17:46:08Z-
dc.date.issued2018en
dc.identifier.citationSinha, S., Eisenhaber, B., Jensen, L. J., Kalbuaji, B., & Eisenhaber, F. (2018). Darkness in the human gene and protein function space : widely modest or absent illumination by the life science literature and the trend for fewer protein function discoveries since 2000. Proteomics, 18(21-22), 1800093-. doi:10.1002/pmic.201800093en
dc.identifier.issn1615-9853en
dc.identifier.urihttps://hdl.handle.net/10356/90338-
dc.description.abstractThe mentioning of gene names in the body of the scientific literature 1901–2017 and their fractional counting is used as a proxy to assess the level of biological function discovery. A literature score of one has been defined as full publication equivalent (FPE), the amount of literature necessary to achieve one publication solely dedicated to a gene. It has been found that less than 5000 human genes have each at least 100 FPEs in the available literature corpus. This group of elite genes (4817 protein‐coding genes, 119 non‐coding RNAs) attracts the overwhelming majority of the scientific literature about genes. Yet, thousands of proteins have never been mentioned at all, ≈2000 further proteins have not even one FPE of literature and, for ≈4600 additional proteins, the FPE count is below 10. The protein function discovery rate measured as numbers of proteins first mentioned or crossing a threshold of accumulated FPEs in a given year has grown until 2000 but is in decline thereafter. This drop is partially offset by function discoveries for non‐coding RNAs. The full human genome sequencing does not boost the function discovery rate. Since 2000, the fastest growing group in the literature is that with at least 500 FPEs per gene.en
dc.description.sponsorshipASTAR (Agency for Sci., Tech. and Research, S’pore)en
dc.format.extent13 p.en
dc.language.isoenen
dc.relation.ispartofseriesProteomicsen
dc.rights© 2018 Bioinformatics Institute. Proteomics Published by WILEY‐VCH Verlag GmbH & Co. KGaA, Weinheim. This is an open access article under the terms of the Creative Commons Attribution‐NonCommercial License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.en
dc.subjectComplete Human Genomeen
dc.subjectGene Function Discoveryen
dc.subjectEngineering::Computer science and engineeringen
dc.titleDarkness in the human gene and protein function space : widely modest or absent illumination by the life science literature and the trend for fewer protein function discoveries since 2000en
dc.typeJournal Articleen
dc.contributor.schoolSchool of Computer Science and Engineeringen
dc.identifier.doi10.1002/pmic.201800093en
dc.description.versionPublished versionen
item.fulltextWith Fulltext-
item.grantfulltextopen-
Appears in Collections:SCSE Journal Articles
Files in This Item:
File Description SizeFormat 
pmic.201800093.pdf506.87 kBAdobe PDFThumbnail
View/Open

Google ScholarTM

Check

Altmetric


Plumx

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.