Mining Dense Subgraphs with Similar Edges
Rozenshtein, Polina; Preti, Giulia; Gionis, Aristides; Velegrakis, Yannis
(2021) Hutter, Frank, Kersting, Kristian, Lijffijt, Jefrey, Valera, Isabel (eds.), Machine Learning and Knowledge Discovery in Databases, pp. 20 - 36
Lecture Notes in Computer Science, pp. 20 - 36
European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD 2020, pp. 20 - 36
(Part of book)
Abstract
When searching for interesting structures in graphs, it is often important to take into account not only the graph connectivity, but also the metadata available, such as node and edge labels, or temporal information. In this paper we are interested in settings where such metadata is used to define a
... read more
similarity between edges. We consider the problem of finding subgraphs that are dense and whose edges are similar to each other with respect to a given similarity function. Depending on the application, this function can be, for example, the Jaccard similarity between the edge label sets, or the temporal correlation of the edge occurrences in a temporal graph. We formulate a Lagrangian relaxation-based optimization problem to search for dense subgraphs with high pairwise edge similarity. We design a novel algorithm to solve the problem through parametric min-cut [15, 17], and provide an efficient search scheme to iterate through the values of the Lagrangian multipliers. Our study is complemented by an evaluation on real-world datasets, which demonstrates the usefulness and efficiency of the proposed approach.
show less
Download/Full Text
Keywords: Taverne, Theoretical Computer Science, General Computer Science
ISSN: 0302-9743
ISBN: 978-3-030-67663-6
978-3-030-67664-3
Publisher: Springer, Springer
Note: Funding Information: This research was partially supported by the National Research Foundation, Singapore under its AI Singapore Programme (AISG Award No: AISG-GC-2019-001). Aristides Gionis is supported by three Academy of Finland projects (286211, 313927, 317085), the ERC Advanced Grant REBOUND (834862), the EC H2020 RIA project ?SoBigData++? (871042), and the Wallenberg AI, Autonomous Systems and Software Program (WASP). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Publisher Copyright: © 2021, Springer Nature Switzerland AG.
(Non peer reviewed)