A comparison of curated gene sets versus transcriptomics-derived gene signatures for detecting pathway activation in immune cells

Downloadstatistik des Dokuments (Auswertung nach COUNTER):

Liu, B.; Lindner, P.; Jirmo, A.C.; Maus, U.; Illig, T. et al.: A comparison of curated gene sets versus transcriptomics-derived gene signatures for detecting pathway activation in immune cells. In: BMC Bioinformatics 21 (2020), Nr. 1, 28. DOI: https://doi.org/10.1186/s12859-020-3366-4

Version im Repositorium

Zum Zitieren der Version im Repositorium verwenden Sie bitte diesen DOI: https://doi.org/10.15488/10629

Zeitraum, für den die Download-Zahlen angezeigt werden:

Jahr: 
Monat: 

Summe der Downloads: 159




Kleine Vorschau
Zusammenfassung: 
Background: Despite the significant contribution of transcriptomics to the fields of biological and biomedical research, interpreting long lists of significantly differentially expressed genes remains a challenging step in the analysis process. Gene set enrichment analysis is a standard approach for summarizing differentially expressed genes into pathways or other gene groupings. Here, we explore an alternative approach to utilizing gene sets from curated databases. We examine the method of deriving custom gene sets which may be relevant to a given experiment using reference data sets from previous transcriptomics studies. We call these data-derived gene sets, "gene signatures" for the biological process tested in the previous study. We focus on the feasibility of this approach in analyzing immune-related processes, which are complicated in their nature but play an important role in the medical research. Results: We evaluate several statistical approaches to detecting the activity of a gene signature in a target data set. We compare the performance of the data-derived gene signature approach with comparable GO term gene sets across all of the statistical tests. A total of 61 differential expression comparisons generated from 26 transcriptome experiments were included in the analysis. These experiments covered eight immunological processes in eight types of leukocytes. The data-derived signatures were used to detect the presence of immunological processes in the test data with modest accuracy (AUC = 0.67). The performance for GO and literature based gene sets was worse (AUC = 0.59). Both approaches were plagued by poor specificity. Conclusions: When investigators seek to test specific hypotheses, the data-derived signature approach can perform as well, if not better than standard gene-set based approaches for immunological signatures. Furthermore, the data-derived signatures can be generated in the cases that well-defined gene sets are lacking from pathway databases and also offer the opportunity for defining signatures in a cell-type specific manner. However, neither the data-derived signatures nor standard gene-sets can be demonstrated to reliably provide negative predictions for negative cases. We conclude that the data-derived signature approach is a useful and sometimes necessary tool, but analysts should be weary of false positives. © 2020 The Author(s).
Lizenzbestimmungen: CC BY 4.0 Unported
Publikationstyp: Article
Publikationsstatus: publishedVersion
Erstveröffentlichung: 2020
Die Publikation erscheint in Sammlung(en):Naturwissenschaftliche Fakultät

Verteilung der Downloads über den gewählten Zeitraum:

Herkunft der Downloads nach Ländern:

Pos. Land Downloads
Anzahl Proz.
1 image of flag of Germany Germany 94 59,12%
2 image of flag of United States United States 22 13,84%
3 image of flag of China China 9 5,66%
4 image of flag of Russian Federation Russian Federation 4 2,52%
5 image of flag of Iran, Islamic Republic of Iran, Islamic Republic of 4 2,52%
6 image of flag of No geo information available No geo information available 3 1,89%
7 image of flag of Peru Peru 3 1,89%
8 image of flag of Netherlands Netherlands 3 1,89%
9 image of flag of Italy Italy 2 1,26%
10 image of flag of Czech Republic Czech Republic 2 1,26%
    andere 13 8,18%

Weitere Download-Zahlen und Ranglisten:


Hinweis

Zur Erhebung der Downloadstatistiken kommen entsprechend dem „COUNTER Code of Practice for e-Resources“ international anerkannte Regeln und Normen zur Anwendung. COUNTER ist eine internationale Non-Profit-Organisation, in der Bibliotheksverbände, Datenbankanbieter und Verlage gemeinsam an Standards zur Erhebung, Speicherung und Verarbeitung von Nutzungsdaten elektronischer Ressourcen arbeiten, welche so Objektivität und Vergleichbarkeit gewährleisten sollen. Es werden hierbei ausschließlich Zugriffe auf die entsprechenden Volltexte ausgewertet, keine Aufrufe der Website an sich.