We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
USNAP: fast unique dense region detection and its application to lung cancer.
- Authors
Wong, Serene W H; Pastrello, Chiara; Kotlyar, Max; Faloutsos, Christos; Jurisica, Igor
- Abstract
Motivation Many real-world problems can be modeled as annotated graphs. Scalable graph algorithms that extract actionable information from such data are in demand since these graphs are large, varying in topology, and have diverse node/edge annotations. When these graphs change over time they create dynamic graphs, and open the possibility to find patterns across different time points. In this article, we introduce a scalable algorithm that finds unique dense regions across time points in dynamic graphs. Such algorithms have applications in many different areas, including the biological, financial, and social domains. Results There are three important contributions to this manuscript. First, we designed a scalable algorithm, USNAP , to effectively identify dense subgraphs that are unique to a time stamp given a dynamic graph. Importantly, USNAP provides a lower bound of the density measure in each step of the greedy algorithm. Second, insights and understanding obtained from validating USNAP on real data show its effectiveness. While USNAP is domain independent, we applied it to four non-small cell lung cancer gene expression datasets. Stages in non-small cell lung cancer were modeled as dynamic graphs, and input to USNAP. Pathway enrichment analyses and comprehensive interpretations from literature show that USNAP identified biologically relevant mechanisms for different stages of cancer progression. Third, USNAP is scalable, and has a time complexity of O (m + m c log n c + n c log n c) , where m is the number of edges, and n is the number of vertices in the dynamic graph; m c is the number of edges, and n c is the number of vertices in the collapsed graph. Availability and implementation The code of USNAP is available at https://www.cs.utoronto.ca/~juris/data/USNAP22.
- Subjects
NON-small-cell lung carcinoma; LUNG cancer; GREEDY algorithms; TIME complexity; GRAPH algorithms; APPROXIMATION algorithms
- Publication
Bioinformatics, 2023, Vol 39, Issue 8, p1
- ISSN
1367-4803
- Publication type
Article
- DOI
10.1093/bioinformatics/btad477