Email: yuchenli AT smu DOT edu.sg
Address: #4041, 80 Stamford Rd, Singapore 178902
Phone: +(65) 68289614
Yuchen is an Associate Professor at the School of Computing and Information System (SCIS) in the Singapore Management University (SMU). Before joining SMU, he was a research fellow at the National University of Singapore (NUS). He received his Ph.D degree in computer science from NUS in 2017. His major research interests are social analytics and high performance graph mining. Besides, he is also acitvely working on fintech with large language models (LLMs).
HOT We are actively looking for talented and highly motivated Ph.D. students to work on large scale graph learning and fintech with LLMs. If you are interested, please find information about the SCIS PhD Program and send an email to me (full scholarship and monthly stipends available at here).
Competitive analysis is a strategic process designed to evaluate competition across various entities, such as companies, workers, and products. Central to this analysis is the identification of "Outstanding Facts" (OFs), which are significant insights highlighting areas where entities distinctly outperform their competitors. Traditionally, the discovery of OFs has relied on predefined data schemas and rule-based methods. However, these approaches are hampered by significant limitations, including (i) a restricted scope of competition that can be identified and (ii) persistent explainability issues regarding why such OFs are relevant to end users. Consequently, extensive human expertise and a costly manual evaluation process are required before OFs can be effectively utilized in downstream analysis. To overcome these challenges, this proposal aims to leverage Large Language Models (LLMs) to develop a next-generation system for the automated discovery and evaluation of OFs. By doing so, we intend to transform competitive analysis through more dynamic, accurate, and comprehensive assessments.
Graphs are powerful structures to represent complex dependencies in the data. Large scale graph processing remains a fundamental challenge, due to its compute-intensive as well as data-intensive nature. There are emerging interests in adopting massively parallel GPUs to speedup graph processing. Nevertheless, existing GPU solutions focus on static, homogeneous graphs and simple graph algorithms, e.g., BFS, shortest path, connected components and etc. In this project, we explore using GPUs for more complex processing tasks (e.g., motif discovery and deep learning on graphs), with emphasis on heterogeneous and dynamic graphs.
Massive amount of information has been continuously generated on social media streams. Misinformation posted in social media messages leads to undesirable consequences and can be very difficult to correct since people make casual inferences based on available information about a given event or outcome. A computational fact-checking engine is much needed to identify false statements. Existing approaches either uses knowledge graph to semantically fact-check statements but suffers from performance issue, or completely ignores the semantic meaning and solely focus on viral patterns to determine problematic statements. In this project, we are planning to develop an efficient engine on top of the knowledge graph to fact-check the massive social media streams.
Hanhua Xiao, Yuchen Li, Yanhao Wang, Panagiotis Karras, Kyriakos Mouratidis, Natalia-Rozalia Avlona.
How to Avoid Jumping to Conclusions: Measuring the Robustness of Outstanding Facts in Knowledge Graphs. ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2024. |
Siyuan Yao, Yuchen Li, Shixuan Sun, Jiaxin Jiang, Bingsheng He.
uBlade: Efficient Batch Processing for Uncertainty Graph Queries. ACM SIGMOD Conference on Management of Data (SIGMOD), 2024. |
Chang Ye, Yuchen Li, Shixuan Sun, Wentian Guo.
gSWORD: GPU-accelerated Sampling for Subgraph Counting. ACM SIGMOD Conference on Management of Data (SIGMOD), 2024. |
Yudong Niu, Yuchen Li, Panagiotis Karras, Yanhao Wang, Zhao Li.
Discovering Personalized Characteristic Communities in Attributed Graphs. IEEE International Conference on Data Engineering (ICDE), 2024. |
Sha Wang, Yuchen Li, Hanhua Xiao, Zhifeng Bao, Lambert Deng, Yanfei Dong.
Enabling Roll-up and Drill-down Operations in News Exploration with Knowledge Graphs for Due Diligence and Risk Management (Industry). IEEE International Conference on Data Engineering (ICDE), 2024. |
Yanhao Wang, Yuchen Li, Francesco Bonchi, Ying Wang.
Balancing Utility and Fairness in Submodular Maximization. International Conference on Extending Database Technology (EDBT), 2024. |
Wentiao Guo, Yuchen Li, Kian-Lee Tan.
Exploiting Reuse for GPU Subgraph Enumeration (Extended Abstract). IEEE International Conference on Data Engineering (ICDE), 2023. |
Wenwen Xia, Yuchen Li, Wentian Guo, Shenghong Li.
Efficient Navigation for Constrained Shortest Path with Adaptive Expansion Control (Oral). IEEE International Conference on Data Mining (ICDM), 2022. |
Yudong Niu, Yuchen Li, Ju Fan, Zhifeng Bao.
Local Clustering over Labeled Graphs: An Index-Free Approach. IEEE International Conference on Data Engineering (ICDE), 2022. |
Shixuan Sun, Yuhang Chen, Shengliang Lu, Bingsheng He, Yuchen Li.
ThunderRW: An In-Memory Graph Random Walk Engine. International Conference on Very Large Data Bases (VLDB), 2021. |
Yueji Yang, Yuchen Li, Panagiotis Karras, Anthony Tung.
Context-aware Outstanding Fact Mining from Knowledge Graphs. ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2021. |
Wenwen Xia, Yuchen Li, Jianwei Tian, Shenghong Li.
Forecasting Interaction Order on Temporal Graphs. ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2021. |
Chang Ye, Yuchen Li, Bingsheng He, Zhao Li, Jianling Sun.
GPU-Accelerated Graph Label Propagation for Real-Time Fraud Detection. ACM SIGMOD Conference on Management of Data (SIGMOD), 2021. |
Yipeng Zhang, Yuchen Li, Zhifeng Bao, Baihua Zheng, H. V. Jagadish.
Minimizing the Regret of an Influence Provider. ACM SIGMOD Conference on Management of Data (SIGMOD), 2021. |
Shengliang Lu, Shixuan Sun, Johns Paul, Yuchen Li, Bingsheng He.
Cache-Efficient Fork-Processing Patterns on Large Graphs. ACM SIGMOD Conference on Management of Data (SIGMOD), 2021. |
Mo Sha, Yuchen Li, Kian-Lee Tan.
Self-adaptive Graph Traversal on GPUs. ACM SIGMOD Conference on Management of Data (SIGMOD), 2021. |
Yanhao Wang, Michael Mathioudakis, Yuchen Li, Kian-Lee Tan.
Minimum Coresets for Maxima Representation of Multidimensional Data. ACM Symposium on Principles of Database Systems (PODS), 2021. |
Qiuyi Lyu, Yuchen Li, Bingsheng He, Bin Gong.
DBL: Efficient Reachability Queries on Dynamic Graphs. International Conference on Database Systems for Advanced Applications (DASFAA), 2021. |
Wenwen Xia, Yuchen Li, Jun Wu, Shenghong Li.
DeepIS: Susceptibility Estimation on Social Networks. ACM International Conference on Web Search and Data Mining (WSDM), 2021. |
Shixun Huang, Yuchen Li, Zhifeng Bao, Zhao Li.
Towards Efficient Motif-based Graph Partitioning: An Adaptive Sampling Approach. IEEE International Conference on Data Engineering (ICDE), 2021. |
Yuchen Li, Qiwei Zhu, Zheng Lyu, Zhongdong Huang, Jianling Sun.
DyCuckoo: Dynamic Hash Tables on GPUs. IEEE International Conference on Data Engineering (ICDE), 2021. |
Yueji Yang, Yuchen Li, Anthony Tung.
NewsLink: Empowering Intuitive News Search with Knowledge Graphs. IEEE International Conference on Data Engineering (ICDE), 2021. |
Yanhao Wang, Yuchen Li, Raymond Chi-Wing Wong, Kian-Lee Tan.
A Fully Dynamic Algorithm for k-Regret Minimizing Sets. IEEE International Conference on Data Engineering (ICDE), 2021. |
Shengliang Lu, Bingsheng He, Yuchen Li, Hao Fu.
Accelerating Exact Constrained Shortest Paths on GPUs. International Conference on Very Large Data Bases (VLDB), 2020. |
Jingjing Wang, Yanhao Wang, Wenjun Jiang, Yuchen Li, Kian-Lee Tan.
Efficient Sampling Algorithms for Approximate Temporal Motif Counting. ACM International Conference on Information and Knowledge Management (CIKM), 2020. |
Wentao Huang, Yuchen Li, Yuan Fang, Ju Fan, Hongxia Yang.
BiANE: Bipartite Attributed Network Embedding. ACM SIGIR Conference on Information Retrieval (SIGIR), 2020. |
Wentian Guo, Yuchen Li, Mo Sha, Bingsheng He, Xiaokui Xiao, Kian-Lee Tan.
GPU-Accelerated Subgraph Enumeration on Partitioned Graphs. ACM SIGMOD Conference on Management of Data (SIGMOD), 2020. |
Alvis Logins, Yuchen Li, Panagiotis Karras.
On the Robustness of Cascade Diffusion under Node Attacks (Short). The Web Conference (WWW), 2020. |
Hongyang Wang, Qingfei Meng, Ju Fan, Yuchen Li, Laizhong Cui, Xiaoman Zhao, Chong Peng, Gong Chen, Xiaoyong Du.
Social Influence Does Matter: User Action Prediction for In-Feed Advertising. AAAI Conference on Artificial Intelligence (AAAI), 2020. |
Yipeng Zhang, Yuchen Li, Zhifeng Bao, Songsong Mo, Ping Zhang.
Optimizing Impression Counts for Outdoor Advertising (Oral). Best Research Paper Award (Runner Up) ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2019. |
Yanhao Wang, Yuchen Li, Kian-Lee Tan.
Coresets for Minimum Enclosing Balls over Sliding Windows (Oral). ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2019. |
Mo Sha, Yuchen Li, Kian-Lee Tan.
GPU-based Graph Traversal on Compressed Graphs. SIGMOD reproducibility ACM SIGMOD Conference on Management of Data (SIGMOD), 2019. |
Yuchen Li, Ju Fan, George V. Ovchinnikov, Panagiotis Karras.
Maximizing Multifaceted Network Influence. IEEE International Conference on Data Engineering (ICDE), 2019. |
Yanhao Wang, Yuchen Li, Kian-Lee Tan.
Semantic and Influence-aware K-representative Queries over Social Streams. International Conference on Extending Database Technology (EDBT), 2019. |
Vincent W. Zheng, Mo Sha, Yuchen Li, Hongxia Yang, Zhenjie Zhang, Kian-Lee Tan.
Heterogeneous Embedding Propagation for Large-scale E-Commerce User Alignment (Short). IEEE International Conference on Data Mining (ICDM), 2018 |
Ping Zhang, Zhifeng Bao, Yuchen Li, Guoliang Li, Yipeng Zhang, Zhiyong Peng.
Trajectory-driven Influential Billboard Placement (Oral). Best Paper Candidate ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2018. |
Yanhao Wang, Yuchen Li, Kian-Lee Tan.
A Sliding-Window Framework for Representative Subset Selection (short). IEEE International Conference on Data Engineering (ICDE), 2018. |
Wentian Guo, Yuchen Li, Mo Sha, Kian-Lee Tan.
Parallel Personalized PageRank on Dynamic Graphs. International Conference on Very Large Data Bases (VLDB), 2017. |
Mo Sha, Yuchen Li, Bingsheng He, Kian-Lee Tan.
Accelerating Dynamic Graph Analytics on GPUs. International Conference on Very Large Data Bases (VLDB), 2017. |
Yuchen Li, Ju Fan, Dongxiang Zhang, Kian-Lee Tan.
Discovering Your Selling Points: Personalized Social Influential Tags Exploration. ACM SIGMOD Conference on Management of Data (SIGMOD), 2017. |
Yanhao Wang, Qi Fan, Yuchen Li, Kian-Lee Tan.
Real-time Influence Maximization on Dynamic Social Streams. International Conference on Very Large Data Bases (VLDB), 2017. |
Yuchen Li, Dongxiang Zhang, Ziquan Lan, Kian-Lee Tan.
Context-Aware Advertisement Recommendation for High-Speed Social News Feeding. IEEE International Conference on Data Engineering (ICDE), 2016. |
Yuchen Li, Dongxiang Zhang, Kian-Lee Tan.
Real Time Targeted Influence Maximization for Online Advertisements. International Conference on Very Large Data Bases (VLDB), 2015 |
Yuchen Li, Zhifeng Bao, Guoliang Li, Kian-Lee Tan.
Real Time Personalized Search on Social Networks. IEEE International Conference on Data Engineering (ICDE), 2015. |
Yuchen Li, Shixuan Sun, Hanhua Xiao, Chang Ye, Shengliang Lu, Bingsheng He.
A Survey on Concurrent Processing of Graph Analytical Queries: Systems and Algorithms. IEEE Transactions on Knowledge and Data Engineering (TKDE), 2024. |
Yipeng Zhang, Zhifeng Bao, Yuchen Li, Baihua Zheng, Xiaoli Wang.
From A Timeline Contact Graph to Close Contact Tracing and Infection Diffusion Intervention. IEEE Transactions on Knowledge and Data Engineering (TKDE), 2024. |
Chang Ye, Yuchen Li, Bingsheng He, Zhao Li, Jianling Sun.
Large-Scale Graph Label Propagation on GPUs. IEEE Transactions on Knowledge and Data Engineering (TKDE), 2023. |
Wenwen Xia, Yuchen Li, Shengdong Li.
On the Substructure Countability of Graph Neural Networks. IEEE Transactions on Knowledge and Data Engineering (TKDE), 2022. |
Wenwen Xia, Yuchen Li, Shengdong Li.
Graph Neural Point Process for Temporal Interaction Prediction. IEEE Transactions on Knowledge and Data Engineering (TKDE), 2022. |
Alvis Logins, Yuchen Li, Panagiotis Karras.
On the Robustness of Diffusion in a Network under Node Attacks. IEEE Transactions on Knowledge and Data Engineering (TKDE), 2021. |
Yanhao Wang, Yuchen Li, Ju Fan, Chang Ye, Mingke Chai.
A Survey of Typical Attributed Graph Queries. World Wide Web (WWW), 2020. |
Wentiao Guo, Yuchen Li, Kian-Lee Tan.
Exploiting Reuse for GPU Subgraph Enumeration. IEEE Transactions on Knowledge and Data Engineering (TKDE), 2020. |
Ping Zhang, Zhifeng Bao, Yuchen Li, Guoliang Li, Yipeng Zhang, Zhiyong Peng.
Towards an Optimal Outdoor Advertising Placement : When a Budget Constraint Meets Moving Trajectories. ACM Transactions on Knowledge Discovery from Data (TKDD), 2020. |
Yanhao Wang, Yuchen Li, Kian-Lee Tan.
Efficient Representative Subset Selection over Sliding Windows. IEEE Transactions on Knowledge and Data Engineering (TKDE), 2018. |
Yanhao Wang, Yuchen Li, Ju Fan, Kian-Lee Tan.
Location-aware Influence Maximization over Dynamic Social Streams. ACM Transactions on Information Systems (TOIS), 2018. |
Yuchen Li, Ju Fan, Yanhao Wang, Kian-Lee Tan.
Influence Maximization on Social Graphs: a Survey. IEEE Transactions on Knowledge and Data Engineering (TKDE), 2018. |
Dongxiang Zhang, Yuchen Li, Jie Shao, Hen Tao Shen.
Augmented Keyword Search on Spatial Entity Databases. The International Journal on Very Large Data Bases (VLDBJ), 2018. |
Qi Fan, Yuchen Li, Dongxiang Zhang, Kian-Lee Tan.
Discovering Newsworthy Themes From Sequenced Data: a Step Towards Computational Journalism. IEEE Transactions on Knowledge and Data Engineering (TKDE), 2017. |
Dongxiang Zhang, Yuchen Li, Ju Fan, Lianli Gao, Fumin Shen, Heng Tao Shen.
Processing Long Queries Against Short Text: Top-k Advertisement Matching in News Stream Applications. ACM Transactions on Information Systems (TOIS), 2017. |
Zhao Li, Pengcheng Zou, Xia Chen, Shichang Hu, Peng Zhang, Yumou Zhou, Bingsheng He, Yuchen Li, Xing Tang.
From Community Search to Community Understanding: A Multimodal Community Query Engine. Best Demo Paper Award (Runner Up) ACM International Conference on Information and Knowledge Management (CIKM), 2021. |
Zhao Li, Xia Chen, Xuming Pan, Pengcheng Zou, Yuchen Li, Guoxian Yu.
SHOAL: Large-scale Hierarchical Taxonomy via Graph-based Query Coalition in E-commerce. International Conference on Very Large Data Bases (VLDB), 2019 |
Yipeng Zhang, Zhifeng Bao, Songsong Mo, Yuchen Li, Yanghao Zhou.
ITAA: an Intelligent Trajectory-driven Outdoor Advertising Deployment Assistant. International Conference on Very Large Data Bases (VLDB), 2019 |
Mo Sha, Yuchen Li, Yanhao Wang, Wentian Guo, Kian-Lee Tan.
River: a Real-time Influence Monitoring System on Social Media Stream. IEEE International Conference on Data Mining (ICDM), 2018 |
Ju Fan, Jiarong Qiu, Yuchen Li, Qingfei Meng, Dongxiang Zhang, Guoliang Li, Kian-Lee Tan, Xiaoyong Du.
OCTOPUS: an Online Topic-Aware Influence Analysis System for Social Networks. IEEE International Conference on Data Engineering (ICDE), 2018. |