
Associate Professor
Amazon Scholar
Computer Science & Engineering
Computational Medicine and Bioinformatics (affiliated)
University of Michigan, Ann Arbor
3753 Bob and Betty Beyster bldg
E-mail: [email protected]
Work: (+1) 734-764-4237
My research in large-scale data mining and machine learning (ML) focuses on principled, interpretable, and scalable methods for discovering and summarizing
the unknown unknowns
in the world's data by leveraging the inherent connections within them.
These connections are naturally modeled in networks or graphs, which in turn span every facet of our lives: email communication networks, knowledge graphs for web search, social networks, coauthorship graphs, brain networks, artificial neural networks, and more.
My work harnesses the massive scale, heterogeneity, and complexity of these data by providing concise and interpretable network summaries as a way to: (a) speed up follow-up analysis and methods that only need to apply on smaller, representative data; (b) gain understanding into the underlying processes, and inform our decisions by removing the burden of manually sifting through mountains of data; and (c) provide insights into scientific data, generate new hypotheses, and lead to novel scientific discoveries.
Research Interests: data science, large-scale graph mining, data mining, graph neural networks, network representation learning, network neuroscience, summarization, network similarity, network alignment, mining time-evolving and streaming data, graph anomaly and event detection, applied machine learning
For recent projects, visit the GEMS Lab webpage and our github repository!
Advising: The GEMS Lab is recruiting motivated and hard-working students interested in graph mining, and large-scale data analytics. If you are interested in joining the group as a PhD student and you are not already at the University of Michigan, please apply to CSE (deadline in December). If you are an undergrad or grad student at UM, and you are interested in any of the papers or projects listed on this page, send an email with your interests and CV to [email protected]. Responses may be slow (please do not take it personally!), depending on availability of positions.
Preprints
- Marlena Duda, Danai Koutra, Chandra Sripada. Validating Dynamicity in Resting State fMRI with Activation-Informed Temporal Segmentation. biorxiv
Books
- Danai Koutra, Christos Faloutsos. Individual and Collective Graph Mining: Principles, Algorithms, and Applications. Synthesis Lectures on Data Mining and Knowledge Discovery, October 2017, 206 pages. Morgan & Claypool publishers.
[code + slides]
Conferences and Journals
[DBLP]
[Google Scholar]
[ Code by the GEMS Lab]
- Jiong Zhu, Ryan A. Rossi, Anup Rao, Tung Mai, Nedim Lipka, Nesreen K. Ahmed, Danai Koutra. Graph Neural Networks with Heterophily. AAAI Conference on Artificial Intelligence (AAAI'21), February 2021.
- Jiong Zhu, Yujun Yan, Lingxiao Zhao, Mark Heimann, Leman Akoglu, Danai Koutra. Generalizing Graph Neural Networks Beyond Homophily. International Conference on Neural
Information Processing Systems (NeurIPS'20), December 2020.
[code]
- Yujun Yan, Kevin Swersky, Danai Koutra, Parthasarathy Ranganathan, Milad Hashemi. Neural Execution Engines: Learning to Execute Subroutines. International Conference on Neural
Information Processing Systems (NeurIPS'20), December 2020. [deepai.org]
- Tara Safavi, Danai Koutra, Edgar Meij. Improving the Utility of Knowledge Graph Embeddings with Calibration. Conference on Empirical Methods in Natural Language Processing (EMNLP'20), November 2020. (long paper)
- Tara Safavi, Danai Koutra. CoDEx: A Comprehensive Knowledge Graph Completion Benchmark. Conference on Empirical Methods in Natural Language Processing (EMNLP'20), November 2020. (long paper)
[data & code]
- Caleb Belth, Alican Büyükcakir, Danai Koutra. A Hidden Challenge of Link Prediction: Which Pairs to Check? IEEE International Conference on Data Mining (ICDM'20), November 2020. (long paper, acceptance rate 9.8%)
Best paper candidate
[code]
- Josh Gardner, Jawad Mroueh, Natalia Jenuwine, Noah Weaverdyck, Samuel Krassenstein,
Arya Farahi, Danai Koutra. Modeling and Predicting Multidimensional Patterns in Fleet Maintenance Data Towards Better Municipal Vehicle Management. Data Science and Advanced Analytics (DSAA'20), October 2020.
[code]
- Kyle K. Qin, Flora D. Salim, Yongli Ren, Wei Shao, Mark Heimann, Danai Koutra. G-CREWE: Graph CompREssion With Embedding for Network Alignment. ACM International Conference on Information and Knowledge Management (CIKM'20), October 2020.
- Xiyuan Chen, Mark Heimann, Fatemeh Vahedian, Danai Koutra. Consistent Network Alignment with Node Embedding. ACM International Conference on Information and
Knowledge Management (CIKM'20), October 2020.
[code]
- Wenjie Feng, Shenghua Liu, Danai Koutra, Huawei Shen, Xueqi Cheng. SpecGreedy: Unified Dense Subgraph Detection. European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD'20), September 2020 (acceptance rate 19%)
Best student data mining award
[code]
- Caleb Belth, Xinyi (Carol) Zheng, Danai Koutra. Mining Persistent Activity in Continually Evolving Networks. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), August 2020 (acceptance rate 17%)
[code]
- Shengpu Tang, Parmida Davarmanesh, Yanmeng Song, Danai Koutra, Michael Sjoding, Jenna Wiens. Democratizing EHR Analyses with FIDDLE - A Flexible Preprocessing Pipeline for Structured Clinical Data. Journal of the American Medical Informatics Association (JAMIA), June 2020.
[code]
Served as the the basis for analyzing hundreds of health records for COVID-19 patients and will be incorporated into the Michigan Medicine system.
- Ryan A. Rossi, Di Jin, Sungchul Kim, Nesreen K. Ahmed, Danai Koutra, John Boaz Lee. On Proximity and Structural Role-based Embeddings in Networks: Misconceptions, Techniques, and Applications. Transactions on Knowledge Discovery from Data (TKDD), April 2020.
- Caleb Belth, Xinyi (Carol) Zheng, Jilles Vreeken, Danai Koutra. What is normal, What is Strange, and What is Missing in a Knowledge Graph: Unified Characterization via Inductive Summarization. The Web Copenminernference (WWW), April 2020 (oral presentation, acceptance rate 19%)
[code]
- Tara Safavi, Adam Fourney, Robert Sim, Marcin Juraszek, Shane Williams, Ned Friend, Danai Koutra, Paul Bennett. Toward Activity Discovery in the Personal Web. ACM International Conference on Web Search and Data Mining (WSDM), 2020. (oral presentation)
- Saba A. Al-Sayouri, Danai Koutra, Evangelos E. Papalexakis, Sarah S. Lam. SURREAL: Subgraph Robust Representation Learning. Applied Network Science 4(1): 88:1-88:20, December 2019.
- Tara Safavi, Caleb Belth, Lukas Faber, Davide Mottin, Emmanuel Müller, Danai Koutra. Personalized Knowledge Graph Summarization: From the Cloud to Your Pocket. IEEE International Conference on Data Mining (ICDM), 10 pages, November 2019. (long paper, acceptance rate: 9%)
[code]
- Mark Heimann, Tara Safavi, Danai Koutra. Distribution of Node Embeddings as Multiresolution Features for Graphs. IEEE International Conference on Data Mining (ICDM), 10 pages, November 2019. (long paper, acceptance rate: 9%)
Best student paper award
[code]
- Caleb Belth, Fahad Kamran, Donna Tjandra, Danai Koutra. When to Remember Where You Came from: Node Representation Learning in Higher-order Networks. IEEE/ACM International Conference on Social Networks Analysis and Mining (ASONAM), 4 pages, August 2019. (acceptance rate: 15%)
Also accepted for presentation at the 15th SIGKDD International Workshop on Mining and Learning with Graphs.
- Di Jin, Mark Heimann, Ryan Rossi, Danai Koutra. node2bits: Compact Time- and Attribute-aware Node Representations. ECML/PKDD European Conference on Principles and Practice of Knowledge Discovery in Databases, 16 pages, September 2019. (acceptance rate 18%)
[code]
- Michael Sjoding, Shengpu Tang, Parmida Davarmanesh, Yanmeng Song, Danai Koutra, and Jenna Wiens. Democratizing EHR Analyses - A Comprehensive, Generalizable Pipeline for Learning from Clinical Data. Machine Learning for Healthcare (MLHC), 1 page (clinical abstract), August 2019.
- Yujun Yan, Jiong Zhu, Marlena Duda, Eric Solarz, Chandra Sripada, Danai Koutra. GroupINN: Grouping-based Interpretable Neural Network-based Classification of Limited, Noisy Brain Data. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 9 pages + 1 page reproducibility appendix, August 2019. (oral presentation, acceptance rate 9%)
[code] Also accepted for presentation at the 15th SIGKDD International Workshop on Mining and Learning with Graphs.
- Di Jin, Ryan A. Rossi, Eunyee Koh, Sungchul Kim, Anup Rao, Danai Koutra. Latent Network Summarization: Bridging Network Embedding and Summarization. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 9 pages + 2 pages reproducibility appendix, August 2019. (acceptance rate 14%)
[code] Also accepted for presentation at the 15th SIGKDD International Workshop on Mining and Learning with Graphs.
- Di Jin*, Mark Heimann*, Tara Safavi, Mengdi Wang, Wei Lee, Lindsay Snider, Danai Koutra. Smart Roles: Inferring Professional Roles in Email Networks. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 9 pages + 1 page reproducibility appendix, August 2019. (acceptance rate 20.7%)
[code]
- Sang Won Lee, Aaron Willette, Danai Koutra, Walter Lasecki. The Effect of Social Interaction on Facilitating Audience Participation in a Live Music Performance. ACM Creativity and Cognition (C& C), 12 pages, June 2019. (acceptance rate 29.7%)
- Yike Liu, Linhong Zhu, Pedro Szekely, Aram Galstyan, Danai Koutra. Coupled Clustering of Time-Series and Networks. SIAM International Conference on Data Mining
(SDM), 9 pages (+4 pages supplementary material), May 2019. (acceptance rate 22.7%)
[code]
- Asso Hamzehei, Raymond K. Wong, Danai Koutra, Fang Chen. Collaborative topic regression for predicting topic-based social influence. Machine Learning Journal, Springer, January 2019.
- Mark Heimann, Haoming Shen, Tara Safavi, Danai Koutra. REGAL: Representation Learning-based Graph Alignment. ACM International Conference on Information and Knowledge Management (CIKM), October 2018 (acceptance rate 17%).
[code]
- Tara Safavi, Chandra Sripada, Danai Koutra. Fast Network Discovery on Sequence Data via Time-Aware Hashing. Knowledge and Information Systems (KAIS), December 2018.
[code] [slides]
- Oshini Goonetilleke, Kewen Liao, Danai Koutra, and Timos Sellis. On effective and efficient graph edge labeling. Distributed and Parallel Databases, 1-34 (to appear), 2018.
- Pin-Yu Chen, Chun-Chen Tu, Pai-Shun Ting, Ya-Yun Lo, Danai Koutra, Alfred O. Hero III. Identifying Influential Links for Event Propagation on Twitter: A Network of Networks Approach. IEEE Transactions on Signal and Information Processing over Networks (T-SIPN), July 2018.
- Saba Al-Sayouri, Ekta Gujral, Danai Koutra, Evangelos Papalexakis, Sarah Liam. t-PNE: Tensor-based Predictable Node Embeddings. ACM/IEEE ASONAM, August 2018 (acceptance rate 16%).
- Tara Safavi, Maryam Davoodi, Danai Koutra. Career Transitions and Trajectories: A Case Study in Computing. ACM SIGKDD, August 2018 (acceptance rate 22.5%).
[data] Also accepted for oral presentation at the 5th SIGKDD Workshop on Big Scholarly Data.
- Geoffrey D. Hannigan, Melissa B. Duhaime, Danai Koutra, Patrick D. Schloss. Biogeography & environmental conditions shape bacteriophage-bacteria networks across the human microbiome. PLOS Computational Biology, April 2018.
- Yike Liu, Tara Safavi, Abhilash Dighe, Danai Koutra. Graph Summarization Methods and Applications: A Survey. ACM Computing Surveys, July 2018.
- Yike Liu, Tara Safavi, Neil Shah, Danai Koutra. Reducing Large Graphs to Small Supergraphs: A Unified Approach. Social Network Analysis and Mining Journal, Springer, Februay 2018.
[code] [demo]
- Mark Heimann, Wei Lee, Shengjie Pan, Kuan-Yu Chen, Danai Koutra. HashAlign: Hash-based Alignment of Multiple Graphs. PAKDD, 2018 (acceptance rate 18%).
[code]
- Yujun Yan, Mark Heimann, Di Jin, Danai Koutra. Fast Flow-based Random Walk with Restart in a Multi-query Setting. SIAM SDM, 2018.
- Jie Song, Danai Koutra, Murali Mani, H.V. Jagadish. GeoAlign: Interpolating Aggregates over Unaligned Partitions. EDBT/ICDT, 2018 (regular paper).
Best paper runner-up award
Covered in the Michigan Engineer magazine: Built by humans, ruled by computers.
- Tara Safavi, Chandra Sripada and Danai Koutra. Scalable Hashing-Based Network Discovery. IEEE ICDM, 2017 (long paper, acceptance rate 9%).
[code]
[slides] Selected as one of the best papers of ICDM'17. Invited for potential publication at the KAIS Journal, Springer. ** Integrated into production systems to guide Google's network planning by identifying correlated anomalies.
- Di Jin and Danai Koutra. Exploratory Analysis of Graph Data by Leveraging Domain Knowledge. IEEE ICDM, 2017 (long paper, acceptance rate 9%).
[code]
- Neil Shah, Danai Koutra, Lisa Jin, Tianmin Zou, Brian Gallagher, Christos Faloutsos. On Summarizing Large-Scale Dynamic Graphs. Data Engineering Bulletin, September 2017, 40 (3).
- Josh Gardner, Danai Koutra, Jawad Mroueh, Victor Pang, Arya Farahi, Sam Krassenstein, and Jared Webb. Driving with Data: Modeling and Forecasting Vehicle Fleet Maintenance in Detroit. Data for Exchange Conference (D4XG'17), September 2017.
[code]
- Allie Cell, Bhavika Reddy Jalli, Adam Rauh, Xinyu Tan, Jared Webb, Joshua Bochu, Arya Farahi, Danai Koutra, Jonathan Stroud, Colin Tan. Understanding Blight Ticket Compliance in Detroit. Data Science for Social Good Conference (DSSG’17), September 2017.
- Di Jin, Aristotelis Leventidis, Haoming Shen, Ruowang Zhang, Junyue Wu and Danai Koutra. PERSEUS-HUB: Interactive and Collective Exploration of Large-Scale Graphs. Informatics 2017, 4 (22).
- Danai Koutra, Abhilash Dighe, Smriti Bhagat, Udi Weinsberg, Stratis Ioannidis, Christos Faloutsos and Jean Bolot. PNP: Fast Path Ensemble Method for Movie Design. ACM SIGKDD, August 2017. (oral presentation, acceptance rate 9%)
]
[slides]
- Amanda Minnich, Nikan Chavoshi, Danai Koutra and Abdullah Mueen. BotWalk: Efficient Adaptive Exploration of Twitter Bot Networks IEEE/ACM ASONAM, July 2017 (full paper, acceptance rate 19%).
[]
- Pravallika Devineni, Evangelos Papalexakis, Danai Koutra, Michalis Faloutsos. One Size Does Not Fit All: Profiling Personalized Time-Evolving User Behaviors. IEEE/ACM ASONAM, July 2017 (full paper, acceptance rate 19%).
[]
- Oshini Goonetilleke, Kewen Liao, Danai Koutra, and Timos Sellis. Edge Labeling Schemes for Graph Data. Statistical and Scientific Database Management (SSDBM), June 2017 (full paper, acceptance rate 23%).
[]
- Pravallika Devineni, Danai Koutra, Michalis Faloutsos, Christos Faloutsos. Facebook Wall Posts: A Model for User Behaviors. Social Network Analysis and Mining (SNAM), Springer, January 2017.
[]
- Asso Hamzehei, Jiang Qiang, Raymond Wong, Danai Koutra and Fang Chen. TSIM: Topic-based Social Influence Measurement for Social Networks. AusDM, December 2016.
Selected as one of the best papers of AusDM'16. Invited to Australasian Journal of Information Systems.
[]
- Venkata Krishna Pillutla, Zhanpeng Fang, Pravallika Devineni, Danai Koutra, Christos Faloutsos, Jie Tang. On Skewed Multi-dimensional Distributions: the FusionRP Model, Algorithms, and Discoveries. SIAM SDM 2016, May 2016.
[]
- Danai Koutra, Neil Shah, Joshua T. Vogelstein, Brian Gallagher, Christos Faloutsos. DeltaCon: A Principled Massive-Graph Similarity Function with Attribution. Transactions on Knowledge Discovery from Data (TKDD), February 2016.
[]
[slides]
[code in Matlab]
[code in R]
Taught in graduate courses: Rutgers University (CS 16:198:672).
- Neil Shah, Danai Koutra, Tianmin Zou, Brian Gallagher, Christos Faloutsos. TimeCrunch: Interpretable Dynamic Graph Summarization. Conference on Knowledge Discovery and Data Mining (KDD), August 2015.
]
[code]
- Pravallika Devineni, Danai Koutra, Michalis Faloutsos, Christos Faloutsos. If walls could talk: Patterns and anomalies in Facebook wallposts. IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), August 2015.
[]
- Danai Koutra, Paul N. Bennett, Eric Horvitz. Events and Controversies: Influences of a Shocking News Event on Information Seeking. WWW 2015, Florence, Italy. May 2015. [acceptance 14.1%]
A short version of this paper appeared at the TAIA workshop of SIGIR'14. Arxiv version.
[] []
News coverage (MIT Review, Technology.org).
- Stephen Ranshous, Shitian Shen, Danai Koutra, Steven Harenberg, Christos Faloutsos, and Nagiza F. Samatova. Anomaly Detection in Dynamic Networks: A Survey. WIREs Computational Statistics, Wiley, January 2015.
[]
- Miguel Araujo, Stephan Guennemann, Spiros Papadimitriou, Christos Faloutsos, Prithwish Basu, Ananthram Swami, Evangelos Papalexakis, Danai Koutra. Discovery of `comet' communities in temporal and labeled graphs (Com2). Knowledge and Information Systems (KAIS, Springer), 2015.
[]
- Danai Koutra, U Kang, Jilles Vreeken, Christos Faloutsos. Summarizing and Understanding Large Graphs. Special Issue of Statistical Analysis and Data Mining, "Best of SDM 2014". October 2014.
[]
- Wolfgang Gatterbauer, Stephan Guennemann, Danai Koutra, Christos Faloutsos. Linearized and Single-Pass Belief Propagation. Proceedings of the VLDB Endowment, Volume 8(4) (VLDB'15), August 2015.
[]
[code] [Python Library]
- Walter S. Lasecki, Mitchell Gordon, Danai Koutra, Malte Jung, Steven P. Dow and Jeff P. Bigham. Glance: Rapidly Coding Behavioral Video with the Crowd.
ACM Symposium on User Interface Science and Technology (UIST'14), October 2014.
[]
- U Kang, Jay-Yoon Lee, Danai Koutra, Christos Faloutsos. Net-Ray: Visualizing and Mining Web-Scale Graphs. PAKDD 2014, Tainan, Taiwan, May 2014.
[]
Recipient of a travel award.
- Yibin Lin, Agha Ali Raza, Jay-Yoon Lee, Danai Koutra, Roni Rosenfeld, Christos Faloutsos. Influence Propagation: Patterns, Model and Case Study. PAKDD 2014, Tainan, Taiwan, May 2014.
[]
[slides]
- Miguel Araujo, Spiros Papadimitriou, Stephan Guennemann, Christos Faloutsos, Prithwish Basu, Ananthram Swami, Evangelos E. Papalexakis, Danai Koutra. Com2: Fast Automatic Discovery of Temporal (Comet) Communities. PAKDD 2014, Tainan, Taiwan, May 2014.
[]
Best student paper award (runner up).
- Leman Akoglu, Hanghang Tong, Danai Koutra. Graph-based Anomaly Detection and Description: A Survey. Data Mining and Knowledge Discovery (DAMI), April 2014.
[]
- Danai Koutra, U Kang, Jilles Vreeken, Christos Faloutsos. VoG:Summarizing and Understanding Large Graphs. SDM 2014, Philadelphia, PA, April 2014.
[]
[slides] [code] Updated!
Selected as one of the best papers of SDM'14.
Taught in graduate courses: Saarland University at the Dept. of Databases and Information Systems (TADA).
Recipient of a travel award.
- Danai Koutra, Hanghang Tong, David Lubensky. BIG-ALIGN: Fast Bipartite Graph Alignment. IEEE ICDM 2013, Dallas, TX, December 2013.
[]
[slides]
Recipient of a travel award.
- Michele Berlingerio, Danai Koutra, Tina Elliasi-Rad, Christos Faloutsos.Network Similarity via Multiple Social Theories. Proceedings of the 5th IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013), Niagara Falls, Canada, August 2013.
[code]
[]
- Ted Senator, Danai Koutra et al. Detecting Insider Threats in a Real Corporate Database of Computer Usage Activities.
KDD 2013, Chicago, IL, August 2013.
[]
- Danai Koutra, Yu Gong, Sephira Ryman, Rex Jung, Joshua Vogelstein, Christos Faloutsos. Are all brains wired equally? OHBM 2013, Seattle, WA, June 2013.
[]
- Jay-Yoon Lee, U Kang, Danai Koutra, Christos Faloutsos. Fast anomaly detection despite the duplicates. WWW 2013, Rio de Janeiro, Brazil, May 2013. (poster)
[]
- Danai Koutra, Joshua Vogelstein, Christos Faloutsos. DeltaCon: A Principled Massive-Graph Similarity Function.SDM 2013, Austin, Texas, May 2013.
[]
[slides] [code]
Taught in graduate courses: Rutgers University (CS 16:198:672).
Recipient of a travel award.
- Danai Koutra, Vasileios Koutras, B. Aditya Prakash, Christos Faloutsos. Patterns amongst Competing Task Frequencies: Super-Linearities, and the Almond-DG model. PAKDD 2013, Gold Coast, Queensland, Australia, April 2013.
[]
[slides]
Taught in graduate courses: Virginia Tech (CS 6604).
- Danai Koutra, Evangelos Papalexakis, Christos Faloutsos. TENSORSPLAT: Spotting Latent Anomalies in Time.
PCI (16th Panhellenic Conference on Informatics w/ international participation), Piraeus, Greece, Oct. 2012.
[]
[slides]
- Keith Henderson, Brian Gallagher, Tina Eliassi-Rad, Hanghang Tong, Sugato Basu, Leman Akoglu, Danai Koutra, Lei Li, Christos Faloutsos. RolX: Structural Role Extraction & Mining in Large Graphs.
ACM SIGKDD, Beijing, China, Aug. 2012.
[]
[code - implementation 1] [code - implementation 2] [code - implementation 3]
Implemented in US gov software systems (e.g., IBM System G Graph Analytics) and in the Stanford Network Analysis Platform (SNAP).
- Danai Koutra, Tai-You Ke, U Kang, Duen Horng (Polo) Chau, Hsing-Kuo Kenneth Pao, and Christos Faloutsos. Unifying Guilt-by-Association Approaches: Theorems and Fast Algorithms. ECML PKDD, Athens, Greece, Sep. 2011.
[] [slides] [code] [poster]
Taught in graduate courses: CMU at Tepper School of Business (47-953), Rutgers University (CS 16:198:672).
Other Publications
- Tara Safavi, Danai Koutra. Generating Negative Commonsense Knowledge. Knowledge Representation & Reasoning Meets Machine Learning Workshop (NeurIPS KR22ML'20), poster, December 2020.
- Junchen Jin, Mark Heimann, Di Jin, Danai Koutra. Understanding and Evaluating Structural
Node Embeddings. ACM SIGKDD Workshop on Mining and Learning with Graphs (SIGKDD
MLG'20), August 2020. [video]
- Puja Trivedi, Alican Büyükcakir, Yin Lin, Yinlong Qian, Di Jin , Danai Koutra. On Structural
vs. Proximity-based Temporal Node Embeddings. ACM SIGKDD Workshop on Mining and
Learning with Graphs (SIGKDD MLG'20). August 2020. [video]
- Marlena Duda, Chandra Sripada, Danai Koutra. Data-Driven Approaches for Investigating
Functional Connectivity Dynamics in Resting State fMRI. Advanced Computational Neuroscience Network (ACNN'19) Big Data Neuroscience Workshop, poster, September 2019. Best poster award.
- Lukas Faber, Tara Safavi, Davide Mottin, Emmanuel Muller, Danai Koutra. Adaptive Personalized Knowledge Graph Summarization. KDD Workshop on Mining and Learning with Graphs (MLG), August 2018.
- Saba Al-Sayouri, Ekta Gujral, Danai Koutra, Evangelos Papalexakis, Sarah Lam. t-PINE: Tensor-based Predictable and Interpretable Node Embeddings. KDD Workshop on Mining and Learning with Graphs (MLG), August 2018.
- Jie Song, Danai Koutra, Murali Mani, H.V. Jagadish. GeoFlux: Hands-Off Data Integration Leveraging Join Key Knowledge. ACM SIGMOD, 2018 (demo).
- Mark Heimann, Danai Koutra. On Generalizing Neural Node Embedding Methods to Multi-Network Problems. ACM SIGKDD Workshop on Mining and Learning with Graphs (MLG), 2017.
- Saba A Syouri, Pravallika Devineni, Sarah Lam, Vagelis Papalexakis and Danai Koutra. GECS: Graph Embedding Using Connection Subgraphs. ACM SIGKDD Workshop on Mining and Learning with Graphs (MLG), 2017.
- Lisa Jin, Danai Koutra. ECOviz: Comparative Visualization of Time-Evolving Network Summaries. ACM SIGKDD IDEA workshop 2017.
[code] [poster]
- Yike Liu, Tara Safavi, Neil Shah, Danai Koutra. Reducing Million-Node Graphs to a Few Structural Patterns: A Unified Approach. KDD Workshop on Mining and Learning with Graphs (MLG), August 2016.
- Di Jin, Christos Faloutsos, Danai Koutra, Ticha Sethapakdi. PERSEUS3: Visualizing and Interactively Mining Large-Scale Graphs. KDD Workshop on Mining and Learning with Graphs (MLG), August 2016.
- Sai Gouravajhala, Danai Koutra, Walter S. Lasecki. Towards Crowd-Assisted Data Mining. CHI Workshop on Human Centred Machine Learning (HCML), May 2016.
- Sai Gouravajhala, Danai Koutra, Walter S. Lasecki. Towards Crowd-Assisted Data Mining. CHI Workshop on Human Centred Machine Learning (HCML), May 2016.
- Yike Liu, Neil Shah, Danai Koutra. An Empirical Comparison of the Summarization Power of Graph Clustering Methods. NIPS Networks in the Social and Information Sciences Workshop, December 2015.
- Danai Koutra. Exploring and Making Sense of Large Graphs. Dissertation, CMU, August 2015.
Winner of the 2016 ACM SIGKDD Dissertation Award.
Honorable Mention for the SCS Doctoral Dissertation Award.
Nominated to ACM for the Doctoral Dissertation Award.
- Danai Koutra, Di Jin, Yuanchi Ning, Christos Faloutsos. Perseus: An Interactive Large-Scale Graph Mining and Visualization Tool.Proceedings of the VLDB Endowment (VLDB'15 Demo), August 2015
- Danai Koutra, Paul N. Bennett, Eric Horvitz. Influences of a Shocking News Event on Web Browsing. SIGIR 2014 Workshop on Temporal, Social and Spatially-aware Information Access (TAIA'14), July 2014.
[]
[slides]
- Danai Koutra. Large Graph Mining and Sense-making. Thesis proposal, CMU, March 2014.
[]
- Michele Berlingerio, Danai Koutra, Tina Eliassi-Rad, Christos Faloutsos. NetSimile: A Scalable Approach to Size-Independent Network Similarity. WIN 2012, Workshop on Information in Networks, New York, NY, Sept. 2012. (presentation and panel discussion)
[code]
[]
- Leman Akoglu*, Duen Horng Chau*, U Kang*, Danai Koutra*, and Christos Faloutsos. Large Graph Mining System for Patterns, Anomalies & Visualization. 16th Pacific-Asia Conference, PAKDD 2012, Kuala Lumpur, Malaysia, May 2012. (demo, *: authors in alphabetical order)
[]
- Leman Akoglu*, Duen Horng Chau*, U Kang*, Danai Koutra*, and Christos Faloutsos. OPAvion: Mining and visualization in large graphs. ACM SIGMOD Conference 2012, Scottsdale, Arizona, USA, May 2012. (demo paper, *: authors in alphabetical order)
[]
- Michele Berlingerio, Danai Koutra, Tina Eliassi-Rad, Christos Faloutsos. A Scalable Approach to Size-Independent Network Similarity.NIPS 2012, Workshop on Social Network and Social Media Analysis, Methods, Models, and Applications, Lake Tahoe, NV, Dec. 2012.
[]
[poster]
- Danai Koutra. Approximate sequence matching with MapReduce. Diploma Thesis, NTUA, Jul. 2010.
[] [slides]
EECS 476 - Data Mining (~60 students), Winter 2020, University of Michigan, Ann Arbor.
EECS 576: Advanced Data Mining, Fall 2019, University of Michigan, Ann Arbor.
EECS 598-008: Advanced Data Mining, Winter 2019, University of Michigan, Ann Arbor.
EECS 498-001 - Data Mining (~60 students), Fall 2018, University of Michigan, Ann Arbor.
EECS 598-008: Mining Large-Scale Graph Data (65 students), Winter 2018, University of Michigan, Ann Arbor.
EECS 498-001 - Data Mining (new course!) (63 students), Fall 2017, University of Michigan, Ann Arbor.
Summarizing Large-Scale Graph Data: Algorithms, Applications and Open Challenges.
SIAM SDM 2017, Houston, TX, April 2017.
EECS 484 - Database Management Systems (283 students), Winter 2017, University of Michigan, Ann Arbor.
EECS 598-004: Mining Large-Scale Graph Data (34 students), Fall 2016, University of Michigan, Ann Arbor.
EECS 484 - Database Management Systems (120 students), Winter 2016, University of Michigan, Ann Arbor.
EECS 598 - Graph Mining and Exploration at Scale: Methods and Applications (23 students), Fall 2015, University of Michigan, Ann Arbor.
Node and graph similarity: Theory and Applications.
With Tina Eliassi-Rad and Christos Faloutsos. IEEE ICDM 2014, Shenzen, China, December 2014. (acceptance ratio: 22%)
Node similarity, graph similarity and matching: Theory and Applications.
With Tina Eliassi-Rad and Christos Faloutsos. SDM 2014, Philadelphia, PA, April 2014. (over 100 researchers attended!)
15-415 Database Applications: TA, Spring 2013 -- Instructor: Christos Faloutsos.
15-381 Artificial Intelligence: Representation and Problem Solving: TA, Fall 2012 -- Instructors: Ariel Procaccia and Emma Brunskill.
Our code and other resources can be found on our github repository. If what you're looking for is not there, feel free to email us directly.
Input: csv file with (x,y,value) triplets
Output: heatmap for scatter data in log-log scale
Code: heatmap.rar
Input: tab-separated file with one observation
per line (each column corresponds to a feature)
Output: the 1D distribution for each feature
all the pairwise 2D distributions
Code: distributionPlots.zip
Code: setEnv.sh
Danai Koutra is an Associate Professor in Computer Science and Engineering at the University of Michigan, where she leads the Graph Exploration and Mining at Scale (GEMS) Lab. She is also an Amazon Scholar. Her research focuses on principled, practical, and scalable methods for large-scale real networks, and her interests include graph learning, graph neural networks, graph summarization, knowledge graph mining, graph learning, similarity and alignment, and anomaly detection. She has won a Presidential Early Career Award for Scientists and Engineers (PECASE), an NSF CAREER award, an ARO Young Investigator award, the 2024 IBM Early Career Data Mining Research Award, the 2023 Tao Li Award, the 2020 SIGKDD Rising Star Award, research faculty awards from Google, Amazon, Facebook and Adobe, a Precision Health Investigator award, the 2016 ACM SIGKDD Dissertation award, and an honorable mention for the SCS Doctoral Dissertation Award (CMU). She holds a patent on bipartite graph alignment, and has multiple papers in top data mining conferences, including 9 award-winning papers and the 2022 IEEE ICDM Test-of-Time Award. She is Program co-Chair for ACM KDD 2024 and an Associate Editor of ACM TKDD. She was a Program co-Chair for ECML/PKDD 2023, a track co-chair for The Web Conference 2022, a co-chair of the Deep Learning Day at KDD 2022, the Secretary of the new SIAG on Data Science in 2021, and has also routinely served in the organizing committees of all the major data mining conferences. She has worked at IBM, Microsoft Research, and Technicolor Research. She earned her Ph.D. and M.S. in Computer Science from CMU, and her diploma in Electrical and Computer Engineering at the National Technical University of Athens.
|