


default search action
Proceedings of the VLDB Endowment, Volume 18
Volume 18, Number 1, September 2024
- Themis Palpanas, Nesime Tatbul:

Front Matter. - Samuel Arch, Yuchen Liu, Todd C. Mowry, Jignesh M. Patel, Andrew Pavlo:

The Key to Effective UDF Optimization: Before Inlining, First Perform Outlining. 1-13 - Milad Rezaei Hajidehi, Sraavan Sridhar, Margo I. Seltzer:

CUTTANA: Scalable Graph Partitioning for Faster Distributed Graph Databases and Analytics. 14-27 - Guido Moerkotte:

Cardinality Estimation for Having-Clauses. 28-41 - Wenqi Jiang, Marco Zeller, Roger Waleffe, Torsten Hoefler, Gustavo Alonso:

Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models. 42-52 - Zhaodonghui Li, Haitao Yuan, Huiming Wang, Gao Cong, Lidong Bing:

LLM-R2: A Large Language Model Enhanced Rule-based Rewrite System for Boosting Query Efficiency. 53-65 - Hanfei Yu, Jacob Carter, Hao Wang, Devesh Tiwari, Jian Li, Seung-Jong Park:

Nitro: Boosting Distributed Reinforcement Learning with Serverless Computing. 66-79
Volume 18, Number 2, October 2024
- Themis Palpanas, Nesime Tatbul:

Front Matter. - Silin Zhou, Shuo Shang, Lisi Chen, Christian S. Jensen, Panos Kalnis:

RED: Effective Trajectory Representation Learning with Comprehensive Information. 80-92 - Anna Arpaci-Dusseau, Zixiang Zhou, Xuhao Chen:

Accurate and Fast Approximate Graph Pattern Mining at Scale. 93-107 - Xiu Tang, Wenhao Liu, Sai Wu, Chang Yao, Gongsheng Yuan, Shanshan Ying, Gang Chen:

QueryArtisan: Generating Data Manipulation Codes for Ad-hoc Analysis in Data Lakes. 108-116 - Yiqian Huang, Shiqi Zhang, Laks V. S. Lakshmanan, Wenqing Lin, Xiaokui Xiao, Bo Tang:

Efficient and Effective Algorithms for A Family of Influence Maximization Problems with A Matroid Constraint. 117-129 - Kyle B. Deeds, Diandre Sabale, Moe Kayali, Dan Suciu:

COLOR: A Framework for Applying Graph Coloring to Subgraph Cardinality Estimation. 130-143 - Xinle Wu, Xingjian Wu, Dalin Zhang, Miao Zhang, Chenjuan Guo, Bin Yang, Christian S. Jensen:

Fully Automated Correlated Time Series Forecasting in Minutes. 144-157 - Markos Markakis, Brit Youngmann, Trinity Gao, Ziyu Zhang, Rana Shahout, Peter Baile Chen, Chunwei Liu, Ibrahim Sabek, Michael J. Cafarella:

From Logs to Causal Inference: Diagnosing Large Systems. 158-172 - Xin Ai, Hao Yuan, Zeyu Ling, Qiange Wang, Yanfeng Zhang, Zhenbo Fu, Chaoyi Chen, Yu Gu, Ge Yu:

NeutronTP: Load-Balanced Distributed Full-Graph GNN Training with Tensor Parallelism. 173-186 - Yuhan Liu, Sheng Wang, Yixuan Liu, Feifei Li, Hong Chen:

Accuracy-enhanced Sparse Vector Technique with Exponential Noise and Optimal Threshold Correction. 187-199 - Lijun Chang:

Maximum Defective Clique Computation: Improved Time Complexities and Practical Performance. 200-212 - Yanni Tang, Zhuoxing Zhang, Kaiqi Zhao, Lanting Fang, Zhenhua Li, Wu Chen:

Substructure-aware Log Anomaly Detection. 213-225 - Hao Miao, Ziqiao Liu, Yan Zhao, Chenjuan Guo, Bin Yang, Kai Zheng, Christian S. Jensen:

Less is More: Efficient Time Series Dataset Condensation via Two-fold Modal Matching. 226-238 - Yunyao Cheng, Chenjuan Guo, Bin Yang, Haomin Yu, Kai Zhao, Christian S. Jensen:

A Memory Guided Transformer for Time Series Forecasting. 239-252 - Chuxuan Hu, Austin Peters, Daniel Kang:

LEAP: LLM-powered End-to-end Automatic Library for Processing Social Science Queries on Unstructured Data. 253-264 - Duc Kieu, Tung Kieu, Peng Han, Bin Yang, Christian S. Jensen, Bac Le:

TEAM: Topological Evolution-aware Framework for Traffic Forecasting. 265-278 - Geonho Lee, Jeongho Park, Min-Soo Kim:

Chimera: A system design of dual storage and traversal-join unified query processing for SQL/PGQ. 279-292 - Nikolai Merkel, Pierre Toussing, Ruben Mayer, Hans-Arno Jacobsen:

Can Graph Reordering Speed Up Graph Neural Network Training? An Experimental Study. 293-307 - Xiaoke Zhu, Min Xie, Ting Deng, Qi Zhang:

HyperBlocker: Accelerating Rule-based Blocking in Entity Resolution using GPUs. 308-321 - Yangfan Jiang, Xinjian Luo, Yin Yang, Xiaokui Xiao:

Calibrating Noise for Group Privacy in Subsampled Mechanisms. 322-334 - Yi Liu, Minghao Xie, Shouqian Shi, Yuanchao Xu, Heiner Litz, Chen Qian:

Outback: Fast and Communication-efficient Index for Key-Value Store on Disaggregated Memory. 335-348 - Yunhao Mao, Gengrui Zhang, Zongxin Liu, Pezhman Nasirifard, Sofia Tijanic, Hans-Arno Jacobsen:

Making CRDTs Not So Eventual. 349-362 - Shuohao Gao, Kaiqiang Yu, Shengxin Liu, Cheng Long:

Maximum k-Plex Search: An Alternated Reduction-and-Bound Method. 363-376 - Patrick Schäfer, Ulf Leser:

Discovering Leitmotifs in Multidimensional Time Series. 377-389 - Chuang Yang, Renhe Jiang, Xiaohang Xu, Chuan Xiao, Kaoru Sezaki:

SIMformer: Single-Layer Vanilla Transformer Can Learn Free-Space Trajectory Similarity. 390-398 - Junchang Wang, Manos Athanassoulis:

CUBIT: Concurrent Updatable Bitmap Indexing. 399-412 - Yongrui Zhong, Yunqing Ge, Jianbin Qin, Shuyuan Zheng, Bo Tang, Yu-Xuan Qiu, Rui Mao, Ye Yuan, Makoto Onizuka, Chuan Xiao:

Privacy-Enhanced Database Synthesis for Benchmark Publishing. 413-425 - Kijae Hong, Kyoungmin Kim, Young-Koo Lee, Yang-Sae Moon, Sourav S. Bhowmick, Wook-Shin Han:

Themis: A GPU-accelerated Relational Query Execution Engine. 426-438 - Shunit Agmon, Amir Gilad, Brit Youngmann, Shahar Zoarets, Benny Kimelfeld:

Finding Convincing Views to Endorse a Claim. 439-452 - Yumeng Song, Yu Gu, Tianyi Li, Yushuai Li, Christian S. Jensen, Ge Yu:

Quantifying Point Contributions: A Lightweight Framework for Efficient and Effective Query-Driven Trajectory Simplification. 453-465 - Liwei Deng, Tianfu Wang, Yan Zhao, Kai Zheng:

MILLION: A General Multi-Objective Framework with Controllable Risk for Portfolio Management. 466-474 - Yuxi Liu, Fangzhu Shen, Kushagra Ghosh, Amir Gilad, Benny Kimelfeld, Sudeepa Roy:

The Cost of Representation by Subset Repairs. 475-487 - Mark Zhao, Emanuel Adamiak, Christos Kozyrakis:

cedar: Optimized and Unified Machine Learning Input Data Pipelines. 488-502 - Monil Mukesh Sanghavi, Ming-May Hu, Zhenxiao Luo, Xiao Li, Kapil Bajaj:

Goku: A Schemaless Time Series Database for Large Scale Monitoring at Pinterest. 503-515
Volume 18, Number 3, November 2024
- Themis Palpanas, Nesime Tatbul:

Front Matter. - Zhangcheng Qiang, Weiqing Wang, Kerry Taylor:

Agent-OM: Leveraging LLM Agents for Ontology Matching. 516-529 - Xue Li, Weibin Zeng, Zhibin Wang, Diwen Zhu, Jingbo Xu, Wenyuan Yu, Jingren Zhou:

GraphAr: An Efficient Storage Scheme for Graph Data in Data Lakes. 530-543 - Hai Lan, Shixun Huang, Zhifeng Bao, Renata Borovica-Gajic:

Cardinality Estimation for Similarity Search on High-Dimensional Data Objects: The Impact of Reference Objects. 544-556 - Seonho Lee, Yeunjun Lee, Kunsoo Park:

Efficient Top-k Frequent Subgraph Mining Using Tight Upper and Lower Bounds. 557-570 - Hao Liu, Qianwen Yang, Taoyong Cui, Wei Wang:

MSGNN: Masked Schema based Graph Neural Networks. 571-584 - Jawad Tahir, Ruben Mayer, Christoph Doblander, Hans-Arno Jacobsen:

How Reliable Are Streams? End-to-End Processing-Guarantee Validation and Performance Benchmarking of Stream Processing Systems. 585-598 - Yinnian Lin, Lei Zou, Xunbin Su:

Towards Sufficient GPU-accelerated Dynamic Graph Management: Survey and Experiment. 599-612 - Zhuocheng Shang, Samriddhi Singla, Ahmed Eldawy, Elia Scudiero:

RDPro: Distributed Processing of Big Raster Data. 613-622 - Qi Zhang, Yalong Zhang, Ronghua Li, Guoren Wang:

Approximate Anchored Densest Subgraph Search on Large Static and Dynamic Graphs. 623-636 - Tianjing Zeng, Junwei Lan, Jiahong Ma, Wenqing Wei, Rong Zhu, Pengfei Li, Bolin Ding, Defu Lian, Zhewei Wei, Jingren Zhou:

PRICE: A Pretrained Model for Cross-Database Cardinality Estimation. 637-650 - Thomas Gilray, Arash Sahebolamri, Yihao Sun, Sowmith Kunapaneni, Sidharth Kumar, Kristopher K. Micinski:

Datalog with First-Class Facts. 651-665 - Junhao Zhu, Tao Wang, Danlei Hu, Ziquan Fang, Lu Chen, Yunjun Gao, Tianyi Li, Christian S. Jensen:

T-Assess: An Efficient Data Quality Assessment System Tailored for Trajectory Data. 666-674 - Junhao Ye, Jiahui Li, Lu Chen, Yuren Mao, Yunjun Gao, Tianyi Li:

LEAP: A Low-cost Spark SQL Query Optimizer using Pairwise Comparison. 675-687 - Xinle Cao, Weiqi Feng, Jian Liu, Jinjin Zhou, Wenjing Fang, Lei Wang, Quanqing Xu, Chuanhui Yang, Kui Ren:

Towards Practical Oblivious Map. 688-701 - Chaoyi Ruan, Yingqiang Zhang, Juncheng Zhang, Cheng Li, Xiaosong Ma, Hao Chen, Jie Zhou, Feifei Li, Xinjun Yang:

PolyBase: Adapting to Data Affinity Changes in Geo-Replicated Database via Row-Level Paxos-Group Affiliation Re-Assignment. 702-714 - Wenfei Fan, Lihang Fan, Dandan Lin, Min Xie:

Explaining GNN-based Recommendations in Logic. 715-728 - Haozhe Yin, Kai Wang, Wenjie Zhang, Ying Zhang, Ruijia Wu, Xuemin Lin:

Efficient Computation of Hyper-triangles on Hypergraphs. 729-742 - Yuwei Huang, Guoliang Li:

Laser: Buffer-Aware Learned Query Scheduling in Master-Standby Databases. 743-755 - Yang Liu, Wenfei Fan, Shuhao Liu, Xiaoke Zhu, Jianxin Li:

A Single Machine System for Querying Big Graphs with PRAM. 756-769 - Jiajia Li, Yongzhi Chen, Mengxuan Zhang, Lei Li:

A CPU-GPU Hybrid Labelling Algorithm for Massive Shortest Distance Queries on Road Networks. 770-783 - Fei Ye, Zikang Liu, Xi Zhang, Yinan Jing, Zhenying He, Yuxin Che, Haoran Xiong, Kai Zhang, X. Sean Wang:

SDEcho: Efficient Explanation of Aggregated Sequence Difference. 784-797 - Qideng Tang, Chaofan Dai, Yahui Wu, Haohao Zhou:

MLP-Mixer based Masked Autoencoders Are Effective, Explainable and Robust for Time Series Anomaly Detection. 798-811 - Liwei Deng, Penghao Chen, Ximu Zeng, Tianfu Wang, Yan Zhao, Kai Zheng:

Efficient Data-aware Distance Comparison Operations for High-Dimensional Approximate Nearest Neighbor Search. 812-821 - Shijie Zhang, Ru Cheng, Xinpeng Liu, Jiang Xiao, Hai Jin, Bo Li:

Seer: Accelerating Blockchain Transaction Execution by Fine-Grained Branch Prediction. 822-835 - Shangdi Yu, Jessica Shi, Jamison Meindl, David Eisenstat, Xiaoen Ju, Sasan Tavakkol, Laxman Dhulipala, Jakub Lacki, Vahab Mirrokni, Julian Shun:

The ParClusterers Benchmark Suite (PCBS): A Fine-Grained Analysis of Scalable Graph Clustering. 836-849 - Shuang Liu, Chenglin Tian, Jun Sun, Ruifeng Wang, Wei Lu, Yongxin Zhao, Yinxing Xue, Junjie Wang, Xiaoyong Du:

Semantic Conformance Testing of Relational DBMS. 850-862 - Songsong Mo, Yue Zhao, Zhifeng Bao, Quanqing Xu, Chuanhui Yang, Gao Cong:

RankPQO: Learning-to-Rank for Parametric Query Optimization. 863-875 - Aviv Hadar, Tova Milo, Kathy Razmadze:

Datamap-Driven Tabular Coreset Selection for Classifier Training. 876-888 - Quinten De Man, Laxman Dhulipala, Adam Karczmarz, Jakub Lacki, Julian Shun, Zhongqi Wang:

Towards Scalable and Practical Batch-Dynamic Connectivity. 889-901 - Shengquan Ni, Yicong Huang, Zuozhi Wang, Chen Li:

IcedTea: Efficient and Responsive Time-Travel Debugging in Dataflow Systems. 902-914 - Ge Lee, Shixun Huang, Zhifeng Bao, Yanchang Zhao:

Representative Time Series Discovery for Data Exploration. 915-928
Volume 18, Number 4, December 2024
- Manos Athanassoulis, Ioana Manolescu, Beng Chin Ooi, Themis Palpanas, Nesime Tatbul:

Front Matter. - Yifan Song, Xiaolong Chen, Wenqing Lin, Jia Li, Chen Zhang, Yan Zhou, Lei Chen, Jing Tang:

Efficient Graph Embedding Generation and Update for Large-Scale Temporal Graph. 929-942 - Haneen Mohammed, Eugene Wu, Alexander Yao, Charlie Summers, Lampros Flokas, Gromit Yeuk-Yin Chan, Subrata Mitra, Hongbin Zhong:

FaDE: More Than a Million What-ifs Per Second. 943-955 - Yuxin Yang, Hongkuan Zhou, Rajgopal Kannan, Viktor K. Prasanna:

Towards Ideal Temporal Graph Neural Networks: Evaluations and Conclusions after 10,000 GPU Hours. 956-969 - Zhaoheng Li, Supawit Chockchowwat, Areet Sheth, Yongjoo Park, Ribhav Sahu:

Kishu: Time-Traveling for Computational Notebooks. 970-985 - Ruiyao Ma, Yifan Zhu, Baihua Zheng, Lu Chen, Congcong Ge, Yunjun Gao:

GTI: Graph-based Tree Index with Logarithm Updates for Nearest Neighbor Search in High-Dimensional Spaces. 986-999 - Youri Kaminsky, Eduardo H. M. Pena, Felix Naumann:

Incremental Detection of Denial Constraint Violations. 1000-1012 - Zhihao Chang, Linzhu Yu, Huan Li, Sai Wu, Gang Chen, Dongxiang Zhang:

Revisiting CNNs for Trajectory Similarity Learning. 1013-1021 - Deming Chu, Zhizhi Gao, Fan Zhang, Wenjie Zhang, Xuemin Lin, Zhihong Tian:

Most Similar Biclique Search at Scale. 1022-1034 - Chenghong Wang, Lina Qiu, Johes Bater, Yukui Luo:

SPECIAL: Synopsis Assisted Secure Collaborative Analytics. 1035-1048 - Qingyin Lin, Jiangsu Du, Rui Li, Zhiguang Chen, Wenguang Chen, Nong Xiao:

IncrCP: Decomposing and Orchestrating Incremental Checkpoints for Effective Recommendation Model Training. 1049-1062 - Naiqing Guan, Nick Koudas:

WeShap: Weak Supervision Source Evaluation with Shapley Values. 1063-1076 - Weiping Yu, Fan Wang, Xuwei Zhang, Siqiang Luo:

Are Joins over LSM-trees Ready: Take RocksDB as an Example. 1077-1090 - Zheng Wu, Xuliang Zhu, Yixiang Fang, Jianliang Xu, Xin Huang:

Interactive Graph Search for Multiple Targets on DAGs. 1091-1103 - Xianghong Xu, Tieying Zhang, Xiao He, Haoyang Li, Rong Kang, Wang Shuai, Linhui Xu, Zhimin Liang, Shangyu Luo, Lei Zhang, Jianjun Chen:

AdaNDV: Adaptive Number of Distinct Value Estimation via Learning to Select and Fuse Estimators. 1104-1117 - Anqi Liang, Pengcheng Zhang, Bin Yao, Zhongpu Chen, Yitong Song, Guangxu Cheng:

UNIFY: Unified Index for Range Filtered Approximate Nearest Neighbors Search. 1118-1130 - Yingli Zhou, Qingshuo Guo, Yi Yang, Yixiang Fang, Chenhao Ma, Laks V. S. Lakshmanan:

In-depth Analysis of Densest Subgraph Discovery in a Unified Framework. 1131-1144 - Fuheng Zhao, Shaleen Deep, Fotis Psallidas, Avrilia Floratou, Divy Agrawal, Amr El Abbadi:

Sphinteract: Resolving Ambiguities in NL2SQL Through User Interaction. 1145-1158 - Zhihao Zhuang, Yingying Zhang, Kai Zhao, Chenjuan Guo, Bin Yang, Qingsong Wen, Lunting Fan:

Noise Matters: Cross Contrastive Learning for Flink Anomaly Detection. 1159-1168 - Biao Ouyang, Yingying Zhang, Hanyin Cheng, Yang Shu, Chenjuan Guo, Bin Yang, Qingsong Wen, Lunting Fan, Christian S. Jensen:

RCRank: Multimodal Ranking of Root Causes of Slow Queries in Cloud Database Systems. 1169-1182 - Hassan Abdallah, Béatrice Markhoff, Arnaud Soulet:

Ranking Indicator Discovery from Very Large Knowledge Graphs. 1183-1195 - Saurabh Bajaj, Hui Guan, Marco Serafini, Juelin Liu, Hojae Son:

Graph Neural Network Training Systems: A Performance Comparison of Full-Graph and Mini-Batch. 1196-1209 - Qingdong Su, Zhikang Wang, Zijing Tan, Shuai Ma:

Discovering Approximate Inclusion Dependencies. 1210-1222 - Zhutao Zhuang, Zhiguang Chen, Xinqi Zeng:

DumpKV: Learning based lifetime aware garbage collection for key value separation in LSM-tree. 1223-1236 - Boyu Zhang, He Huang, Yu-E Sun, Guoju Gao:

RGS-Sketch: An Accurate, Invertible, and Mergeable Sketch for Online Super Spreader Detection in High-speed Data Streams. 1237-1249 - Yichao Yuan, Advait Iyer, Lin Ma, Nishil Talati:

Vortex: Overcoming Memory Capacity Limitations in GPU-Accelerated Large-Scale Data Analytics. 1250-1263
Volume 18, Number 5, January 2025
- Themis Palpanas, Nesime Tatbul:

Front Matter. - Antonios Katsarakis, Vasilis Gavrielatos, Chris Jensen, Nikos Ntarmos:

Dandelion: Smaller Clusters, Bigger Speeds - Distributed Transactions Redefined. 1264-1277 - Xiaoying Wang, Wentao Wu, Vivek R. Narasayya, Surajit Chaudhuri:

Esc: An Early-Stopping Checker for Budget-aware Index Tuning. 1278-1290 - Yilei Wang, Xiangdong Zeng, Sheng Wang, Feifei Li:

Jodes: Efficient Oblivious Join in the Distributed Setting. 1291-1304 - Xunkai Li, Yinlin Zhu, Boyang Pang, Guochen Yan, Yeyu Yan, Zening Li, Zhengyu Wu, Wentao Zhang, Ronghua Li, Guoren Wang:

OpenFGL: A Comprehensive Benchmark for Federated Graph Learning. 1305-1320 - Samuele Langhi, Angela Bonifati, Riccardo Tommasini:

Evaluating Continuous Queries with Inconsistency Annotations. 1321-1334 - Zhi Wang, Ming Zhong, Yuanyuan Zhu, Tieyun Qian, Mengchi Liu, Jeffrey Xu Yu:

On More Efficiently and Versatilely Querying Historical k-Cores. 1335-1347 - Yi Li, Gao Cong:

GeoBloom: Revisiting Lightweight Models for Geographic Information Retrieval. 1348-1361 - Shabnam Ghasemirad, Si Liu, Christoph Sprenger, Luca Multazzu, David A. Basin:

VerIso: Verifiable Isolation Guarantees for Database Transactions. 1362-1375 - Yinhao Hong, Hongyao Zhao, Wei Lu, Xiaoyong Du, Yuxing Chen, Anqun Pan, Lixiong Zheng:

A Hybrid Approach to Integrating Deterministic and Non-deterministic Concurrency Control in Database Systems. 1376-1389 - Sandra Geisler, Cinzia Cappiello, Irene Celino, David Chaves-Fraga, Anastasia Dimou, Ana Iglesias-Molina, Maurizio Lenzerini, Anisa Rula, Dylan Van Assche, Sascha Welten, Maria-Esther Vidal:

From Genesis to Maturity: Managing Knowledge Graph Ecosystems Through Life Cycles. 1390-1397 - Matthias Lanzinger, Reinhard Pichler, Alexander Selzer:

Avoiding Materialisation for Guarded Aggregate Queries. 1398-1411 - Chengyang Luo, Qing Liu, Yunjun Gao, Jianliang Xu:

Synergetic Community Search over Large Multilayer Graphs. 1412-1424 - Shu Wang, Yixiang Fang, Wensheng Luo:

Searching and Detecting Structurally Similar Communities in Large Heterogeneous Information Networks. 1425-1438 - Gengrui Zhang, Shiquan Zhang, Michail Bachras, Yuqiu Zhang, Hans-Arno Jacobsen:

Cabinet: Dynamically Weighted Consensus Made Fast. 1439-1452 - Mohsen Dehghankar, Abolfazl Asudeh:

Mining the Minoria: Unknown, Under-represented, and Under-performing Minority Groups. 1453-1465 - Guanhao Hou, Jinchao Huang, Fangyuan Zhang, Sibo Wang:

Efficient Concurrent Updates to Persistent Randomized Binary Search Trees. 1481-1494 - Sariel Ofek, Amit Somech:

Explaining Black-Box Clustering Pipelines With Cluster-Explorer. 1495-1508 - Jianfeng Huang, Cao Yihao, Ren Shubing, Baohua Wu, Dongjing Miao:

BACH: Bridging Adjacency List and CSR Format using LSM-Trees for HGTAP Workloads. 1509-1521 - Hua Fan, Hao Tan, Wenchao Zhou, Feifei Li:

FLEET: High-Performance Durable Replicated State Machines using Scattered and Coordinated Log Entries. 1522-1535 - Guoxin Kang, Zhongxin Ge, Jingpei Hu, Xueya Zhang, Lei Wang, Jianfeng Zhan:

BigVectorBench: Heterogeneous Data Embedding and Compound Queries are Essential in Evaluating Vector Databases. 1536-1550
Volume 18, Number 6, February 2025
- Themis Palpanas, Nesime Tatbul:

Front Matter. - Jiawei Guan, Feng Zhang, Jiesong Liu, Xiaoyong Du, Xipeng Shen:

A Systematic Study on Early Stopping Metrics in HPO and the Implications of Uncertainty. 1551-1564 - Haoying Zhang, Mariem Brahem, Nicolas Anciaux, Benjamin Nguyen, José María de Fuentes:

TELESAFE - Detecting Private/Work Boundary Crossings in Energy Consumption Trails in Telework. 1565-1578 - Yuan Chen, Ao Li, Wenhai Li, Lingfeng Deng:

FB+-tree: A Memory-Optimized B+-tree with Latch-Free Update. 1579-1592 - Shenghao Gong, Haobo Sun, Ziquan Fang, Liu Liu, Lu Chen, Yunjun Gao:

VStream: A Distributed Streaming Vector Search System. 1593-1606 - Qiuyang Mang, Jingbang Chen, Hangrui Zhou, Yu Gao, Yingli Zhou, Qingyu Shi, Richard Peng, Yixiang Fang, Chenhao Ma:

Efficient Historical Butterfly Counting in Large Temporal Bipartite Networks via Graph Structure-aware Index. 1607-1620 - Abiram Mohanaraj, Matteo Lissandrini, Katja Hose:

PlanRGCN: Predicting SPARQL Query Performance. 1621-1634 - Susan B. Davidson, Tova Milo, Kathy Razmadze, Gal Zeevi:

Holistic query Approximation via RL Modeling. 1635-1648 - Lars Gottesbüren, Laxman Dhulipala, Rajesh Jayaram, Jakub Lacki:

Unleashing Graph Partitioning for Large-Scale Nearest Neighbor Search. 1649-1662 - Jiaming Ma, Binwu Wang, Pengkun Wang, Zhengyang Zhou, Xu Wang, Yang Wang:

BiST: A Lightweight and Efficient Bi-directional Model for Spatiotemporal Prediction. 1663-1676 - Zhengxin You, Qiaomu Shen, Man Lung Yiu, Bo Tang:

QOVIS: Understanding and Diagnosing Query Optimizer via a Visualization-assisted Approach (Revision). 1677-1690 - Vincent Jacob, Yanlei Diao:

Unsupervised Anomaly Detection in Multivariate Time Series across Heterogeneous Domains. 1691-1704 - Zhenbo Fu, Xin Ai, Qiange Wang, Yanfeng Zhang, Shizhan Lu, Chaoyi Chen, Chunyu Cao, Hao Yuan, Zhewei Wei, Yu Gu, Yingyou Wen, Ge Yu:

NeutronTask: Scalable and Efficient Multi-GPU GNN Training with Task Parallelism. 1705-1719 - Rihan Hai, Shih-Han Hung, Tim Coopmans, Tim Littau, Floris Geerts:

Quantum Data Management in the NISQ Era. 1720-1729 - Yunjia Zheng, Charlotte Sacré, Mohanna Shahrad, Owen Lipchitz, Yuting Gu, Bettina Kemme:

G-View: View Management for Graph Databases. 1730-1742 - Rong Du, Qingqing Ye, Yue Fu, Haibo Hu:

Privacy for Free: Leveraging Local Differential Privacy Perturbed Data from Multiple Services. 1743-1755 - Haoze Song, Yongqi Wang, Xusheng Chen, Hao Feng, Yazhi Feng, Xieyun Fang, Heming Cui, Linghe Kong:

K2: On Optimizing Distributed Transactions in a Multi-region Data Store with True-time Clocks. 1756-1769 - Tingyang Chen, Cong Fu, Kun Wang, Xiangyu Ke, Yunjun Gao, Wenchao Zhou, Yabo Ni, Anxiang Zeng:

Maximum Inner Product is Query-Scaled Nearest Neighbor. 1770-1783 - Rongzhao Chen, Xiangpeng Hu, Xiangdong Huang, Chen Wang, Shaoxu Song, Jianmin Wang:

Migration-Free Elastic Storage of Time Series in Apache IoTDB. 1784-1797 - Amélie Gheerbrant, Leonid Libkin, Liat Peterfreund, Alexandra Rogova:

GQL and SQL/PGQ: Theoretical Models and Expressive Power. 1798-1810 - Peizhi Wu, Haoshu Xu, Ryan Marcus, Zack Ives:

A Practical Theory of Generalization in Selectivity Learning. 1811-1824 - Shuo Yang, Jiadong Xie, Yingfan Liu, Jeffrey Xu Yu, Xiyue Gao, Qianru Wang, Yanguo Peng, Jiangtao Cui:

Revisiting the Index Construction of Proximity Graph-Based Approximate Nearest Neighbor Search. 1825-1838 - Yijun Bei, Teng Ma, Dongxiang Zhang, Sai Wu, Kian-Lee Tan, Gang Chen:

Mining Platoon Patterns from Traffic Videos. 1839-1851 - Botong Huang, Lianggui Weng, Wei Chen, Zuozhi Wang, Kai Zeng, Chen Li, Yihui Feng, Bolin Ding, Jingren Zhou:

Agamotto: Scheduling of Deadline-Oriented Incremental Query Execution under Uncertain Resource Price. 1852-1864 - Baoqing Cai, Yu Liu, Lin Ma, Pingqi Huang, Bingcheng Lian, Ke Zhou, Jia Yuan, Jie Yang, Xiaofan Cai, Peijun Wu:

SCompression: Enhancing Database Knob Tuning Efficiency Through Slice-Based OLTP Workload Compression. 1865-1878 - Xiyue Gao, Zhuang Liu, Yiran Shen, Hui Li, Yingfan Liu, Hongjun Xiao, Yanguo Peng, Jiangtao Cui:

Fucci: Database Transaction Fuzzing via Random Conflict Construction and Multilevel Constraint Solving. 1879-1891 - Wenjing Wang, Ziyang Yue, Bolong Zheng:

Streaming Time Series Subsequence Anomaly Detection: A Glance and Focus Approach. 1892-1904 - Leilei Du, Peng Cheng, Lei Chen, Heng Tao Shen, Xuemin Lin, Wei Xi:

Infinite Stream Estimation under Personalized w-Event Privacy. 1905-1918 - Meng Wang, Gus Waldspurger, Naufal Ananda, Yuyang Huang, Kemas Wiharja, John Bent, Swaminathan Sundararaman, Vijay Chidambaram, Haryadi S. Gunawi:

GPEmu: A GPU Emulator for Faster and Cheaper Prototyping and Evaluation of Deep Learning System Research. 1919-1932 - Anna Zeng, Michael J. Cafarella, Batya Kenig, Markos Markakis, Brit Youngmann, Babak Salimi:

Causal DAG Summarization. 1933-1947 - Zhengmao Ye, Dengchun Li, Zetao Hu, Tingfeng Lan, Jian Sha, Shicong Zhang, Lei Duan, Jie Zuo, Hui Lu, Yuanchun Zhou, Mingjie Tang:

mLoRA: Fine-Tuning LoRA Adapters via Highly-Efficient Pipeline Parallelism in Multiple GPUs. 1948-1961 - Abigale Kim, Marco Slot, David G. Andersen, Andrew Pavlo:

Anarchy in the Database: A Survey and Evaluation of Database Management System Extensibility. 1962-1976 - Longxu Sun, Xin Huang, Jiannan Wang, Jianliang Xu:

A Flexible Framework for Query-oriented Interactive Community Search. 1977-1990 - Ziyi Yan, Mohamed Farouk Drira, Tianxun Hu, Tianzheng Wang:

Tabular: Efficiently Building Efficient Indexes. 1991-2004
Volume 18, Number 7, March 2025
- Matthias Boehm, Reynold Cheng, Xin Luna Dong, Themis Palpanas, Nesime Tatbul:

Front Matter. - Yuanyuan Zeng, Yixiang Fang, Kun Chen, Yangfan Li, Chenhao Ma:

Efficient Maintenance of 2-Hop Labeling Index on Dynamic Small-World Graphs. 2005-2017 - Kexin Zhu, Michael Whittaker, Srdjan Petrovic, Robert Grandl, Sanjay Ghemawat:

Vive la Différence: Practical Diff Testing of Stateful Applications. 2018-2030 - Siyue Wu, Dingming Wu, Sinhong Cheuk, Tsz Nam Chan, Kezhong Lu:

GREAT: Generalized Reservoir Sampling based Triangle Counting Estimation over Streaming Graphs. 2031-2043 - Mengran Li, Zijing Tan, Honghui Yang, Shuai Ma:

Efficient Discovery of Relaxed Functional Dependencies. 2044-2056 - Danlei Hu, Yilin Li, Lu Chen, Ziquan Fang, Yushuai Li, Yunjun Gao, Tianyi Li:

SimRN: Trajectory Similarity Learning in Road Networks based on Distributed Deep Reinforcement Learning. 2057-2069 - Yuxiang Guo, Zhonghao Hu, Yuren Mao, Baihua Zheng, Yunjun Gao, Mingwei Zhou:

BIRDIE: Natural Language-Driven Table Discovery Using Differentiable Search Index. 2070-2083 - Shu Liu, Xiangxi Mo, Moshik Hershcovitch, Henric Zhang, Audrey Cheng, Guy Girmonsky, Gil Vernik, Michael Factor, Tiemo Bang, Soujanya Ponnapalli, Natacha Crooks, Joseph Gonzalez, Danny Harnik, Ion Stoica:

SkyStore: Cost-Optimized Object Storage Across Regions and Clouds. 2084-2096 - Tonghui Ren, Chen Ke, Yuankai Fan, Yinan Jing, Zhenying He, Kai Zhang, X. Sean Wang:

The Power of Constraints in Natural Language to SQL Translation. 2097-2111 - Yufan Sheng, Xin Cao, Kaiqi Zhao, Yixiang Fang, Jianzhong Qi, Wenjie Zhang, Christian S. Jensen:

ACE: A Cardinality Estimator for Set-Valued Queries. 2112-2125 - Xiaoying Wang, Jiannan Wang, Tianzheng Wang, Yong Zhang:

Accio: Bolt-on Query Federation. 2126-2135 - Xiaohai Dai, Chaozheng Ding, Wei Li, Jiang Xiao, Bolin Zhang, Chen Yu, Albert Y. Zomaya, Hai Jin:

Falcon: Advancing Asynchronous BFT Consensus for Lower Latency and Enhanced Throughput. 2136-2148 - Jindong Han, Hao Wang, Hui Xiong, Hao Liu:

Scalable Pre-Training of Compact Urban Spatio-Temporal Predictive Models on Large-Scale Multi-Domain Data. 2149-2158 - Xiaoyuan Liu, Ni Trieu, Trinabh Gupta, Ishtiyaque Ahmad, Dawn Song:

HADES: Range-Filtered Private Aggregation on Public Data. 2159-2171 - Zhaoxuan Ji, Xinlu Wang, Zhaojing Luo, Zhongle Xie, Meihui Zhang:

Optimized Batch Prompting for Cost-effective LLMs. 2172-2184 - Hongchao Qin, Guang Zeng, Ronghua Li, Longlong Lin, Ye Yuan, Guoren Wang:

Truss Decomposition in Hypergraphs. 2185-2197 - Jianting Zhang, Zhongtang Luo, Raghavendra Ramesh, Aniket Kate:

Optimal Sharding for Scalable Blockchains with Deconstructed SMR. 2198-2211 - Eugenie Lai, Yeye He, Surajit Chaudhuri:

Auto-Prep: Holistic Prediction of Data Preparation Steps for Self-Service Business Intelligence. 2212-2225 - Valerio Guerrini, Thibaut Germain, Charles Truong, Laurent Oudre, Paul Boniol:

Time Series Motif Discovery: A Comprehensive Evaluation. 2226-2239 - Daniel Bourgeois, Zhimin Ding, Dimitrije Jankov, Jiehui Li, Sleem Mahmoud Abdelghafar, Yuxin Tang, Jiawen Yao, Xinyu Yao, Chris Jermaine:

EinDecomp: Decomposition of Declaratively-Specified Machine Learning and Numerical Computations for Parallel Execution. 2240-2253 - Ruizhong Wu, Mengxuan Zhang, Shuxin Wang, Frodo Kin-Sun Chan, Yan Nei Law, Lei Li:

Continuous Lifelong Conflict-Aware AGV Routing with Kinematic Constraints. 2254-2267 - Dawei Liu, Bolong Zheng, Ziyang Yue, Fuhao Ruan, Xiaofang Zhou, Christian S. Jensen:

Wolverine: Highly Efficient Monotonic Search Path Repair for Graph-based ANN Index Updates. 2268-2280 - Jiansen Song, Wensheng Dou, Yingying Zheng, Yu Gao, Ziyu Cui, Wei Wang, Jun Wei:

Detecting Schema-Related Logic Bugs in Relational DBMSs via Equivalent Database Construction. 2281-2294 - Weiyang Kong, Kaiqi Wu, Sen Zhang, Yubao Liu:

GraphSparseNet: a Novel Method for Large Scale Trafffic Flow Prediction. 2295-2307
Volume 18, Number 8, April 2025
- H. V. Jagadish, M. Tamer Özsu, Themis Palpanas, Nesime Tatbul:

Front Matter. - Danling Lai, Jiajie Xu, Jianfeng Qu, Pingfu Chao, Junhua Fang, Chengfei Liu:

TMLKD: Few-shot Trajectory Metric Learning via Knowledge Distillation. 2308-2320 - Jun Nemoto, Taksahi Kambayashi, Takashi Hoshino, Hideyuki Kawashima:

Oze: Decentralized Graph-based Concurrency Control for Long-running Update Transactions. 2321-2333 - Elena Milkai, Xiangyao Yu, Jignesh M. Patel:

Hermes: Off-the-Shelf Real-Time Transactional Analytics. 2334-2347 - Zeying Zhu, Jonathan Chamberlain, Kenny Wu, David Starobinski, Zaoxing Liu:

Approximation-First Timeseries Monitoring Query At Scale. 2348-2361 - Ziheng Wang, Junyu Wei, Alex Aiken, Guangyan Zhang, Jacob O. Tørring, Rain Jiang, Chenyu Jiang, Wei Xu:

LogCloud: Fast Search of Compressed Logs on Object Storage. 2362-2370 - Changlun Li, Chenyu Yang, Yuyu Luo, Ju Fan, Nan Tang:

Weak-to-Strong Prompts with Lightweight-to-Powerful LLMs for High-Accuracy, Low-Cost, and Explainable Data Transformation. 2371-2384 - Zhe Xie, Zeyan Li, Xiao He, Longlong Xu, Xidao Wen, Tieying Zhang, Jianjun Chen, Rui Shi, Dan Pei:

ChatTS: Aligning Time Series with LLMs via Synthetic Data for Enhanced Understanding and Reasoning. 2385-2398 - Graham Cormode, Daniel Ting:

Federated Data Shift Distance Estimation. 2399-2412 - Liese Bekkers, Frank Neven, Stijn Vansummeren, Yisu Remy Wang:

Instance-Optimal Acyclic Join Processing Without Regret: Engineering the Yannakakis Algorithm in Column Stores. 2413-2426 - Myles Thiessen, Guy Khazma, Sam Toueg, Eyal de Lara:

Asymmetric Linearizable Local Reads. 2427-2439 - Jinyang Liu, Pu Jiao, Kai Zhao, Xin Liang, Sheng Di, Franck Cappello:

QPET: A Versatile and Portable Quantity-of-Interest-preservation Framework for Error-Bounded Lossy Compression. 2440-2453 - Xinyi Zhu, Yongqi Zhang, Lei Chen:

OpenMEL: Unsupervised Multimodal Entity Linking Using Noise-Free Expanded Queries and Global Coherence. 2454-2467 - Ariane Ziehn, Jan Szlang, Steffen Zeuch, Volker Markl:

Unraveling the Impact of Window Semantics: Optimizing Join Order for Efficient Stream Processing. 2468-2481 - Luca Zecchini, Vasilis Efthymiou, Felix Naumann, Giovanni Simonini:

Deduplicated Sampling On-Demand. 2482-2495 - Theis E. Jendal, Matteo Lissandrini, Peter Dolog, Katja Hose:

The Limits of Graph Samplers for Training Inductive Recommender Systems. 2496-2504 - Zhiying Liang, Vahab Jabrayilov, Abutalib Aghayev, Aleksey Charapko:

HoliPaxos: Towards More Predictable Performance in State Machine Replication. 2505-2518 - Peizhi Wu, Rong Kang, Tieying Zhang, Jianjun Chen, Ryan Marcus, Zack Ives:

Data-Agnostic Cardinality Learning from Imperfect Workloads. 2519-2532 - Song Wang, Chen Wang, Jianchun Wang, Shengguo Li, Rui Li, Zhiyong Peng:

BLAEQ: A Multigrid Index for Spatial Query on Geometry Data. 2533-2546 - Ziyu Cui, Wensheng Dou, Yu Gao, Rui Yang, Yingying Zheng, Jiansen Song, Yuan Feng, Jun Wei:

Simple Testing Can Expose Most Critical Transaction Bugs: Understanding and Detecting Write-Specific Serializability Violations in Database Systems. 2547-2560 - Ilie Sarpe, Aristides Gionis:

Efficient and Adaptive Estimation of Local Triadic Coefficients. 2561-2574 - Wentao Zhang, Jingyuan Wang, Yifan Yang, Leong Hou U:

VecCity: A Taxonomy-guided Library for Map Entity Representation Learning [Experiment, Analysis \u0026 Benchmark]. 2575-2588 - Jerin George Mathew, Donatella Firmani, Divesh Srivastava:

Evaluating Methods for Efficient Entity Count Estimation. 2589-2601 - Audrey Cheng, Xiao Shi, Aaron N. Kabcenell, Jolene Huey, Peter Bailis, Natacha Crooks, Ion Stoica:

Fair Transaction Processing For Multi-Tenant Databases. 2602-2615 - Sheng Lin, Fangcheng Fu, Haoyang Li, Hao Ge, Xuanyu Wang, Jiawen Niu, Yaofeng Tu, Bin Cui:

LobRA: Multi-tenant Fine-tuning over Heterogeneous Data. 2616-2625 - Amin Kamali, Verena Kantere, Calisto Zuzarte, Vincent Corvinelli:

Robust Plan Evaluation based on Approximate Probabilistic Machine Learning. 2626-2638 - Saeed Fathollahzadeh, Essam Mansour, Matthias Boehm:

CatDB: Data-catalog-guided, LLM-based Generation of Data-centric ML Pipelines. 2639-2652 - Hanwen Liu, Shashank Giridhara, Ibrahim Sabek:

Conformal Prediction for Verifiable Learned Query Optimization. 2653-2666 - Neha Makhija, Wolfgang Gatterbauer:

Is Integer Linear Programming All You Need for Deletion Propagation? A Unified and Practical Approach for Generalized Deletion Propagation. 2667-2680 - Yurong Liu, Eduardo Peña, Aécio S. R. Santos, Eden Wu, Juliana Freire:

Magneto: Combining Small and Large Language Models for Schema Matching. 2681-2694 - Qiuyu Guo, Jianye Yang, Wenjie Zhang, Hanchen Wang, Ying Zhang, Xuemin Lin:

Efficient and Accurate Subgraph Counting: A Bottom-up Flow-learning based Approach. 2695-2708 - Lixiang Chen, Yuxing Han, Yu Chen, Xing Chen, Chengcheng Yang, Weining Qian:

AQETuner: Reliable Query-level Configuration Tuning for Analytical Query Engines. 2709-2721 - Jun Liu, Bingqian Du, Ziyue Luo, Sitian Lu, Qiankun Zhang, Hai Jin:

PipeTGL: (Near) Zero Bubble Memory-based Temporal Graph Neural Network Training via Pipeline Optimization. 2722-2734 - Yeounoh Chung, Gaurav Tarlok Kakkar, Yu Gan, Brenton Milne, Fatma Ozcan:

Is Long Context All You Need? Leveraging LLM's Extended Context for NL2SQL. 2735-2747 - Laurens Kuiper, Paul Gross, Peter Boncz, Hannes Mühleisen:

Saving Private Hash Join. 2748-2760
Volume 18, Number 9, May 2025
- Wolfgang Lehner, Jun Yang, Themis Palpanas, Nesime Tatbul:

Front Matter. - Weixing Zhou, Yanfeng Zhang, Xinji Zhou, Zhiyou Wang, Zeshun Peng, Yang Ren, Sihao Li, Huanchen Zhang, Guoliang Li, Ge Yu:

Concurrency Control as a Service. 2761-2774 - Xiangfei Qiu, Zhe Li, Wanghui Qiu, Shiyan Hu, Lekui Zhou, Xingjian Wu, Zhengyu Li, Chenjuan Guo, Aoying Zhou, Zhenli Sheng, Jilin Hu, Christian S. Jensen, Bin Yang:

TAB: Unified Benchmarking of Time Series Anomaly Detection Methods. 2775-2789 - Yuchen Zhong, Junwei Su, Chuan Wu, Minjie Wang:

Heta: Distributed Training of Heterogeneous Graph Neural Networks. 2790-2803 - Yannis Foufoulas, Theoni Palaiologou, Alkis Simitsis:

The UDFBench Benchmark for General-purpose UDF Queries. 2804-2817 - Ye Sun, Lei Shi, Yongxin Tong:

eXpath: Explaining Knowledge Graph Link Prediction with Ontological Closed Path Rules. 2818-2830 - Emmanouil Giortamis, Antonios Katsarakis, Vasilis Gavrielatos, Pramod Bhatotia, Aleksandar Dragojevic, Boris Grot, Vijay Nagarajan, Panagiota Fatourou:

The LAW theorem: Local Reads and Linearizable Asynchronous Replication. 2831-2845 - Brecht Vandevoort, Alan D. Fekete, Bas Ketsman, Frank Neven, Stijn Vansummeren:

Using Read Promotion and Mixed Isolation Levels for Performant Yet Serializable Execution of Transaction Programs. 2846-2858 - Zeynep Korkmaz, M. Tamer Özsu, Khuzaima Daudjee:

Locality-Aware Cache Replacement Policy for Graph Traversals. 2859-2871 - Rúben Adão, Zhongjie Wu, Changjun Zhou, Oana Balmau, João Paulo, Ricardo Macedo:

Keigo: Co-designing Log-Structured Merge Key-Value Stores with a Non-Volatile, Concurrency-aware Storage Hierarchy. 2872-2885 - Qiyu Liu, Siyuan Han, Yanlin Qi, Jingshu Peng, Jin Li, Longlong Lin, Lei Chen:

Why Are Learned Indexes So Effective but Sometimes Ineffective? 2886-2898 - Falaah Arif Khan, Denys Herasymuk, Nazar Protsiv, Julia Stoyanovich:

Still More Shades of Null: An Evaluation Suite for Responsible Missing Value Imputation [Experiment, Analysis and Benchmark]. 2899-2913 - Tianji Cong, Fatemeh Nargesian, Junjie Xing, H. V. Jagadish:

OpenForge: Probabilistic Metadata Integration. 2914-2927 - Akhlaque Ahmad, Da Yan, Xiao Chen, Lyuheng Yuan, Qin Zhang, Saugat Adhikari:

Maximum k-Plex Finding: Choices of Pruning Techniques Matter! 2928-2940 - Xiaoxuan Gou, Weiguo Zheng, Yuxiang Wang, Xiaoliang Xu, Zhiyuan Yu:

A Comprehensive Survey and Experimental Study of Learning-based Community Search. 2941-2954 - Weizheng Lu, Chao Hui, Yunhai Wang, Feng Zhang, Yueguo Chen, Bao Liu, Chengjie Li, Zhaoxin Wu, Xuye Qin:

Decentralized Actor Scheduling and Reference-based Storage in Xorbits: a Native Scalable Data Science Engine. 2955-2963 - Tao Kong, Hui Li, Yuxuan Zhao, Liping Li, Xiyue Gao, Qilong Wu, Jiangtao Cui:

STsCache: An Efficient Semantic Caching Scheme for Time-series Data Workloads Based on Hybrid Storage. 2964-2977 - Ruihong Wang, Jianguo Wang, Walid G. Aref:

Cache Coherence Over Disaggregated Memory. 2978-2991 - Ruchi Bhoot, Tuhin Khare, Manoj Agarwal, Siddharth D. Jaiswal, Yogesh Simmhan:

Triparts: Scalable Streaming Graph Partitioning to Enhance Community Structure. 2992-3006 - Shipeng Qi, Bing Tong, Jiatao Hu, Heng Lin, Yue Pang, Wei Yuan, Songlin Lyu, Zhihui Guo, Ke Huang, Xujin Ba, Qiang Yin, Youren Shen, Yan Zhou, Tao Lv, Jia Li, Lei Zou, Yongwei Wu, Gábor Szárnyas, Xiaowei Zhu, Wenguang Chen, Chuntao Hong:

The LDBC Financial Benchmark: Transaction Workload. 3007-3020 - Donghyun Sohn, Kelly Jiang, Nicolas Hammer, Jennie Rogers:

Alchemy: A Query Optimization Framework for Oblivious SQL. 3021-3034 - Shreya Shankar, Tristan Chambers, Tarak Shah, Aditya G. Parameswaran, Eugene Wu:

DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing. 3035-3048 - Guoyu Hu, Shaofeng Cai, Tien Tuan Anh Dinh, Zhongle Xie, Cong Yue, Gang Chen, Beng Chin Ooi:

HAKES: Scalable Vector Database for Embedding Search Service. 3049-3062 - Zhengdong Wang, Qiang Yin, Longbin Lai:

Path-centric Cardinality Estimation for Subgraph Matching. 3063-3076 - Hong Lin, Shixin Wan, Zhongle Xie, Ke Chen, Meihui Zhang, Lidan Shou, Gang Chen:

A Comprehensive Study of Shapley Value in Data Analytics. 3077-3092 - Longjiao Zhang, Rui Wang, Tongya Zheng, Ziqi Huang, Wenjie Huang, Xinyu Wang, Can Wang, Mingli Song, Sai Wu, Shuibing He:

Effective and Efficient Distributed Temporal Graph Learning through Hotspot Memory Sharing. 3093-3105 - Riddho R. Haque, Anh L. Mai, Matteo Brucato, Azza Abouzied, Peter J. Haas, Alexandra Meliou:

Stochastic SketchRefine: Scaling In-Database Decision-Making under Uncertainty to Millions of Tuples. 3106-3118 - Marcel Weisgut, Daniel Ritter, Pinar Tözün, Lawrence Benson, Tilmann Rabl:

CXL Memory Performance for In-Memory Data Processing. 3119-3133 - Fei Teng, Haoyang Li, Lei Chen:

LLMLog: Advanced Log Template Generation via LLM-driven Multi-Round Annotation. 3134-3148 - Vinh Quang Ngo, Marina Papatriantafilou:

Cuckoo Heavy Keeper and the balancing act of maintaining heavy hitters in stream processing. 3149-3161 - Qian Zhang, Yiwen Xiang, Jianhao Wei, Yang Yang, Yifan Li, Xueqing Gong, Wanggen Liu:

Rebirth-Retire: A Concurrency Control Protocol Adaptable to Different Levels of Contention. 3162-3174 - Ruikun Li, Dai Shi, Ye Xiao, Junbin Gao:

UFGTime: Mining Intertwined Dependencies in Multivariate Time Series via an Efficient Pure Graph Approach (Flavor: Foundations and Algorithms Papers). 3175-3188 - Ruochen Jiang, Spyros Blanas:

ArrayMorph: Optimizing Hyperslab Queries on the Cloud for Machine Learning Pipelines. 3189-3202 - Yangxin Fan, Haolai Che, Yinghui Wu:

Inference-friendly Graph Compression for Graph Neural Networks. 3203-3215 - Sacheendra Talluri, Guido Walter Di Donato, Luca Danelutti, Koen Vlaswinkel, Marco Arnaboldi, Arnaud Delamare, Marco Domenico Santambrogio, Daniele Bonetta:

GpJSON: High-performance JSON Data Processing on GPUs. 3216-3229 - Antonio Ferrara, David García-Soriano, Francesco Bonchi:

Beyond Shortest Paths: Node Fairness in Route Recommendation. 3230-3242
Volume 18, Number 10, June 2025
- Themis Palpanas, Divesh Srivastava, Nesime Tatbul:

Front Matter. - Yin Li, Sharad Mehrotra, Shantanu Sharma, Komal Kumari:

Access Control for Information-Theoretically Secure Data. 3243-3255 - Zhencan Peng, Miao Qiao, Wenchao Zhou, Feifei Li, Dong Deng:

Dynamic Range-Filtering Approximate Nearest Neighbor Search. 3256-3268 - Yukun Cao, Zengyi Gao, Zhiyang Li, Xike Xie, S. Kevin Zhou, Jianliang Xu:

LEGO-GraphRAG: Modularizing Graph-based Retrieval-Augmented Generation for Design Space Exploration. 3269-3283 - Jinwoo Hwang, Daeun Kim, Sangyeop Lee, Yoonsung Kim, Guseul Heo, Hojoon Kim, Yunseok Jeong, Tadiwos Meaza, Eunhyeok Park, Jeongseob Ahn, Jongse Park:

Déjà Vu: Efficient Video-Language Query Engine with Learning-based Inter-Frame Computation Reuse. 3284-3298 - Mihail Stoian, Andreas Zimmerer, Skander Krid, Amadou Ngom, Jialin Ding, Tim Kraska, Andreas Kipf:

Parachute: Single-Pass Bi-Directional Information Passing. 3299-3311 - Jiani Yang, Sai Wu, Yong Wang, Dongxiang Zhang, Yifei Liu, Xiu Tang, Gang Chen:

Twisted Twin: A Collaborative and Competitive Memory Management Approach in HTAP Systems. 3312-3325 - Muhammad Farhan, Henning Koehler, Qing Wang, Jiawen Wang, Moritz Laupichler, Peter Sanders:

Customization Meets 2-Hop Labeling: Efficient Routing in Road Networks. 3326-3338 - Dayi Fan, Rubao Lee, Xiaodong Zhang:

X-Blossom: Massive Parallelization of Graph Maximum Matching. 3339-3353 - Chenyu Yang, Yuyu Luo, Chuanxuan Cui, Ju Fan, Chengliang Chai, Nan Tang:

Data Imputation with Limited Data Redundancy Using Data Lakes. 3354-3367 - Huang Chunyue, Shuang Liu, Xinyi Zhang, Wenhao Li, Wei Lu, Xiaoyong Du:

Chimera: Mitigating Ownership Transfers in Multi-Primary Shared-Storage Cloud-Native Databases. 3368-3381 - Minze Xu, Zhentai Xie, Zhibin Wang, Guangzhan Wang, Longbin Lai, Yuan Zhang, Chen Tian, Sheng Zhong:

Sectric: Towards Accurate, Privacy-preserving and Efficient Triangle Counting. 3382-3395 - Haoyang Li, Yuming Xu, Yiming Li, Hanmo Liu, Darian Li, Chen Jason Zhang, Lei Chen, Qing Li:

When Speed meets Accuracy: an Efficient and Effective Graph Model for Temporal Link Prediction. 3396-3405 - Yuxin Tang, Feng Zhang, Jiawei Guan, Yuan Tian, Xiangdong Huang, Chen Wang, Jianmin Wang, Xiaoyong Du:

Improving Time Series Data Compression in Apache IoTDB. 3406-3420 - Jianmin Wang, Kai Wang, Ying Zhang, Wenjie Zhang, Xiwei Xu, Xuemin Lin:

On LLM-Enhanced Mixed-Type Data Imputation with High-Order Message Passing. 3421-3434 - Vishal Chakraborty, Youri Kaminsky, Sharad Mehrotra, Felix Naumann, Faisal Nawab, Primal Pappachan, Mohammad Sadoghi, Nalini Venkatasubramanian:

Meaningful Data Erasure in the Presence of Dependencies. 3435-3448 - Chuzhe Tang, Zhaoguo Wang, Jinyang Li, Haibo Chen:

Sonata: Multi-Database Transactions Made Fast and Serializable. 3449-3462 - Matteo Ceccarello, Francesco Pio Monaco, Francesco Silvestri:

MOMENTI: Scalable Motif Mining in Multidimensional Time Series. 3463-3476 - Albert Martin, Eduardo C. de Almeida, Oscar Romero, Anna Queralt:

How and Why False Denial Constraints are Discovered. 3477-3489 - Yingli Zhou, Qingshuo Guo, Yixiang Fang:

Efficient 𝑘-Clique Densest Subgraph Discovery: Towards Bridging Practice and Theory. 3490-3503 - Meihao Fan, Ju Fan, Nan Tang, Lei Cao, Guoliang Li, Xiaoyong Du:

AutoPrep: Natural Language Question-Aware Data Preparation with a Multi-Agent Framework. 3504-3517 - Zengyang Gong, Yuxiang Zeng, Lei Chen:

Accelerating Approximate Nearest Neighbor Search in Hierarchical Graphs: Efficient Level Navigation with Shortcuts. 3518-3530 - Yan Zhang, Shuwei Liang, Xiaoye Miao, Yangyang Wu, Jianwei Yin:

Federated Incomplete Tabular Data Prediction with Missing Complementarity. 3531-3544 - Yihao Hu, Jin Wang, Sajjadur Rahman:

LakeVisage: Towards Scalable, Flexible and Interactive Visualization Recommendation for Data Discovery over Data Lakes. 3545-3558 - Xiaokai Zhou, Xiao Yan, Fangcheng Fu, Ziwen Fu, Tieyun Qian, Yuanyuan Zhu, Qinbo Zhang, Bin Cui, Jiawei Jiang:

PS-MI: Accurate, Efficient, and Private Data Valuation in Vertical Federated Learning. 3559-3572 - Jiasheng Zhang, Deqiang Ouyang, Shuang Liang, Jie Shao:

Towards Pattern-aware Data Augmentation for Temporal Knowledge Graph Completion. 3573-3586 - Chiyu Hao, Jixian Su, Shixuan Sun, Hao Zhang, Sen Gao, Jianwen Zhao, Chenyi Zhang, Jieru Zhao, Chen Chen, Minyi Guo:

RapidStore: An Efficient Dynamic Graph Storage System for Concurrent Queries. 3587-3600 - Xiaoyu Fan, Kun Chen, Jiping Yu, Xiaowei Zhu, Yunyi Chen, Huanchen Zhang, Wei Xue:

GORAM: Graph-oriented ORAM for Efficient Ego-centric Queries on Federated Graphs. 3601-3614 - Weijie Sun, Zihuan Xu, Wangze Ni, Lei Chen, Peng Cheng, Chen Jason Zhang:

Authenticated Aggregate Queries with Boolean Range Predicates on Blockchains. 3615-3627 - Zemin Chao, Qiaoyi Zheng, Zhixin Qi, Hongzhi Wang:

FSMDTW: A Fast Index-free Subsequence Matching Algorithm for Dynamic Time Warping. 3628-3640 - Jianheng Tang, Xi Zhao, Lemin Kong, Xiaofang Zhou, Jia Li:

Fused Gromov-Wasserstein Alignment for Graph Edit Distance Computation and Beyond. 3641-3654 - Tianshu Zhang, Kun Qian, Siddhartha Sahai, Yuan Tian, Shaddy Garg, Huan Sun, Yunyao Li:

EVOSCHEMA: TOWARDS TEXT-TO-SQL ROBUSTNESS AGAINST SCHEMA EVOLUTION. 3655-3668 - Shuai Han, Yushi Tao, Jingwen Tan, Huanran Wang, Wu Yang, Yanmei Wang:

Effective and Efficient Community Search for Complex Network Semantics Capture: From Coarse-Grain to Fine-Grain. 3669-3681 - Rongrong Zhang, Zhiwei Ye, Jun-Peng Zhu, Peng Cai, Xuan Zhou, Dunbo Cai, Ling Qian:

HAWK: A Workload-driven Hierarchical Deadlock Detection Approach in Distributed Database System. 3682-3694
Volume 18, Number 11, July 2025
- Themis Palpanas, Peter R. Pietzuch, Nesime Tatbul, Peter Triantafillou:

Front Matter. - Chengliang Chai, Jiajun Li, Yuhao Deng, Yuanhao Zhong, Ye Yuan, Guoren Wang, Lei Cao:

Doctopus: Budget-aware Structural Table Extraction from Unstructured Documents. 3695-3707 - Qi Wen, Yutong Ye, Xiang Lian, Mingsong Chen:

S^3AND: Efficient Subgraph Similarity Search Under Aggregated Neighbor Difference Semantics. 3708-3720 - Yanping Zheng, Zhewei Wei, Frank De Hoo, Xu Chen, Hongteng Xu, Yuhang Ye, Jiadeng Huang:

Lighter-X: An Efficient and Plug-and-play Strategy for Graph-based Recommendation through Decoupled Propagation. 3721-3729 - Qiyu Liu, Yanlin Qi, Siyuan Han, Jingshu Peng, Jin Li, Lei Chen:

Not Small Enough? SegPQ: A Learned Approach to Compress Product Quantization Codebooks. 3730-3743 - Nazanin Rashedi, Guido Moerkotte:

The Accuracy of Cardinality Estimators: Unraveling the Evaluation Result Conundrum. 3744-3756 - Benzhao Tang, Shiyu Yang, Zhitao Shen, Wenjie Zhang, Xuemin Lin, Zhihong Tian:

LogLite: Lightweight Plug-and-Play Streaming Log Compression. 3757-3770 - Saimon Amanuel Tsegai, Xinyu Yang, Haoyuan Liu, Peng Gao:

Enabling Efficient Attack Investigation via Human-in-the-Loop Security Analysis. 3771-3783 - Wenzhi Fu, Yang Cao:

Shifting Transaction Isolation on Graphs: From Systems to Data. 3784-3796 - Wenqi Jiang, Hang Hu, Torsten Hoefler, Gustavo Alonso:

Fast Graph Vector Search via Hardware Acceleration and Delayed-Synchronization Traversal. 3797-3811 - Hengyu Ye, Jiadong Chen, Xiao He, Fuxin Jiang, Tieying Zhang, Jianjun Chen, Xiaofeng Gao:

Fremer: Lightweight and Effective Frequency Transformer for Workload Forecasting in Cloud Services. 3812-3825 - Arash Dargahi Nobari, Davood Rafiei:

TabulaX: Leveraging Large Language Models for Multi-Class Table Transformations. 3826-3839 - Fan Cui, Eric Lo, Srijan Srivastava, Ziliang Lai:

Bonspiel: Low Tail Latency Transactions in Geo-Distributed Databases. 3840-3853 - Qiange Wang, Yongze Yan, Hongshi Tan, Cheng Chen, Cheng Zhao, Jiaming Tian, Jiaxin Jiang, Xiaoliang Cong, Yanfeng Zhang, Ge Yu, Weng-Fai Wong, Bingsheng He:

Efficient Graph Data Access for Out-of-Memory GPU Streaming Graph Processing. 3854-3867 - Daniel Ulrich Schmitt, Thomas Hütter, Nikolaus Augsten:

Extensible and Robust Evaluation of Similarity Queries. 3868-3882 - Yan Zhou, Chunwei Liu, Bhuvan Urgaonkar, Zhengle Wang, Magnus Mueller, Chao Zhang, Songyue Zhang, Pascal Pfeil, Dominik Horn, Zhengchun Liu, Davide Pagano, Tim Kraska, Samuel Madden, Ju Fan:

PBench: Workload Synthesizer with Real Statistics for Cloud Analytics Benchmarking. 3883-3895 - Yujie Lu, Zhijie Zhang, Weiguo Zheng, Lei Zou:

Accelerating Subgraph Matching through Fine-grained and Powerful Equivalences. 3896-3909 - Luca Gretscher, Jens Dittrich:

How to Optimize SQL Queries? A Comparison Between Split, Holistic, and Hybrid Approaches. 3910-3922 - Navid Eslami, Ioana O. Bercea, Niv Dayan:

Diva: Dynamic Range Filter for Var-Length Keys and Queries. 3923-3936 - Luca Becchetti, Andrea Clementi, Luciano Gualà, Luca Pepè Sciarria, Alessandro Straziota, Matteo Stromieri:

Approximate 2-hop neighborhoods on incremental graphs: An efficient lazy approach. 3937-3950 - Vasilis Mageirakos, Bowen Wu, Gustavo Alonso:

Cracking Vector Search Indexes. 3951-3964 - Alireza Heidari, Amirhossein Ahmadi, Wei Zhang:

DobLIX: A Dual-Objective Learned Index for Log-Structured Merge Trees. 3965-3978 - Xuhang Zhu, Xiu Tang, Sai Wu, Jichen Li, Haobo Wang, Chang Yao, Quanqing Xu, Gang Chen:

CoLA: Model Collaboration for Log-based Anomaly Detection. 3979-3987 - Michael Jungmair, Jana Giceva:

Towards Designing Future-Proof Data Processing Systems. 3988-3995 - Omer Abramovich, Daniel Deutch, Nave Frost, Ahmet Kara, Dan Olteanu:

Advancing Fact Attribution for Query Answering: Aggregate Queries and Novel Algorithms. 3996-4008 - Amedeo Pachera, Mattia Palmiotto, Angela Bonifati, Andrea Mauri:

What If: Causal Analysis with Graph Databases. 4009-4016 - Mateusz Gienieczko, Maximilian Kuschewski, Thomas Neumann, Viktor Leis, Jana Giceva:

AnyBlox: A Framework for Self-Decoding Datasets. 4017-4031 - Yushuai Ji, Shengkun Zhu, Shixun Huang, Zepeng Liu, Sheng Wang, Zhiyong Peng:

Federated and Balanced Clustering for High-dimensional Data. 4032-4044 - Mohamed Sabri Hafidi, Ozan Kahramanogullari, Anton Dignös, Johann Gamper:

Relational Data Models for Genetic VCF data. 4045-4053 - Apostolos Giannoulidis, Anastasios Gounaris, John Paparrizos:

BURST: Rendering Clustering Techniques Suitable for Evolving Streams. 4054-4063 - Michail Bachras, Hans-Arno Jacobsen:

Environmental Footprints of Query Processing: A Vision for Sustainable Database Architectures. 4064-4072 - Alexander W. Lee, Justin Chan, Michael Fu, Nicolas Kim, Akshay Mehta, Deepti Raghavan, Ugur Çetintemel:

Semantic Integrity Constraints: Declarative Guardrails for AI-Augmented Data Processing Systems. 4073-4080 - Bingqiao Luo, Jiaxin Jiang, Yuhang Chen, Junyi Hou, Cheng Jun Tey, Ziyang Qiu, Bingsheng He, Spencer Xiao, Dominic Ong, Wee Howe Ang:

RICH: Real-time Identification of negative Cycles for High-efficiency Arbitrage. 4081-4089 - Martin Lange, Patricia Guerra-Balboa, Javier Parra-Arnau, Thorsten Strufe:

Balancing Privacy and Utility in Correlated Data: A Study of Bayesian Differential Privacy. 4090-4103 - Riki Otaki, Jun Hyuk Chang, Aaron J. Elmore, Goetz Graefe:

Enhancing Transaction Processing through Indirection Skipping. 4104-4116 - Xiaoou Ding, Zekai Qian, Hongzhi Wang, Siying Chen, Yafeng Tang, Hongbin Su, Huan Hu, Chen Wang:

UniClean: A Scalable Data Cleaning Solution for Mixed Errors based on Unified Cleaners and Optimized Cleaning Workflow. 4117-4130 - Venetia Pliatsika, João Fonseca, Kateryna Akhynko, Ivan Shevchenko, Julia Stoyanovich:

ShaRP: Explaining Rankings and Preferences with Shapley Values. 4131-4143 - Tobias Schmidt, Viktor Leis, Peter Boncz, Thomas Neumann:

SQLStorm: Taking Database Benchmarking into the LLM Era. 4144-4157 - Jiaxiang Liu, Siyuan Xia, Daniel Alabi, Eugene Wu:

Suna: Scalable Causal Confounder Discovery over Relational Data. 4158-4170 - Liana Patel, Siddharth Jha, Melissa Z. Pan, Harshit Gupta, Parth Asawa, Carlos Guestrin, Matei Zaharia:

Semantic Operators and Their Optimization: Towards AI-Based Data Analytics with Accuracy Guarantees. 4171-4184 - Ziniu Wu, Markos Markakis, Chunwei Liu, Peter Baile Chen, Balakrishnan Narayanaswamy, Tim Kraska, Samuel Madden:

Improving DBMS Scheduling Decisions with Accurate Performance Prediction on Concurrent Queries. 4185-4198 - Pranay Mundra, Charalampos Papamanthou, Julian Shun, Quanquan C. Liu:

Practical and Accurate Local Edge Differentially Private Graph Algorithms. 4199-4213 - Wen Xu, Pengpeng Qiao, Shang Liu, Zhirun Zheng, Yang Cao, Zhetao Li:

Continuous Publication of Weighted Graphs with Local Differential Privacy. 4214-4226 - Qiyu Zhuang, Wei Lu, Shuang Liu, Yuxing Chen, Xinyue Shi, Zhanhao Zhao, Yipeng Sun, Anqun Pan, Xiaoyong Du:

TxnSails: Achieving Serializable Transaction Scheduling with Self-Adaptive Isolation Level Selection. 4227-4240 - Hyoungjoo Kim, Yiwei Zhao, Andrew Pavlo, Phillip B. Gibbons:

No Cap, This Memory Slaps: Breaking Through the Memory Wall of Transactional Database Systems with Processing-in-Memory. 4241-4254 - Xinbiao Gan, Tiejun Li, Chunye Gong, Dongsheng Li, Dezun Dong, Jie Liu, Kai Lu:

GraphCSR: A Degree-Equalized CSR Format for Large-scale Graph Processing. 4255-4268 - Yiran Li, Gongyao Guo, Chen Feng, Jieming Shi:

Effective and Efficient Attributed Hypergraph Embedding on Nodes and Hyperedges. 4269-4281 - Qiyan Li, Jeffrey Yu, Zongyan He:

Subgraph Matching: A New Decomposition Based Approach. 4282-4294 - Gabriel Haas, Bohyun Lee, Philippe Bonnet, Viktor Leis:

SSD-iq: Uncovering the Hidden Side of SSD Performance. 4295-4308 - Qiqi Zhou, Yanyan Shen, Lei Chen:

Faster Convergence in Mini-batch Graph Neural Networks Training with Pseudo Full Neighborhood Compensation. 4309-4322 - Keonwoo Oh, Pooja Nilangekar, Amol Deshpande:

TreeCat: Standalone Catalog Engine for Large Data Systems. 4323-4336 - Ziyang Yue, Bolong Zheng, Ling Xu, Kanru Xu, Shuhao Zhang, Yajuan Du, Yunjun Gao, Xiaofang Zhou, Christian S. Jensen:

Select Edges Wisely: Monotonic Path Aware Graph Layout Optimization for Disk-based ANN Search. 4337-4349 - Marko Kabic, Bowen Wu, Jonas Dann, Gustavo Alonso:

Powerful GPUs or Fast Interconnects: Analyzing Relational Workloads on Modern GPUs. 4350-4363 - Qinghua Liu, Seunghak Lee, John Paparrizos:

TSB-AutoAD: Towards Automated Solutions for Time-Series Anomaly Detection [E, A \u0026 B]. 4364-4379 - John Paparrizos, Bogireddy Sai Prasanna Teja:

Time-Series Clustering: A Comprehensive Study of Data Mining, Machine Learning, and Deep Learning Methods. 4380-4395 - Kaisei Hishida, Chunwei Liu, John Paparrizos, Aaron J. Elmore:

Beyond Compression: A Comprehensive Evaluation of Lossless Floating-Point Compression. 4396-4409 - Keke Huang, Yimin Shi, Dujian Ding, Yifei Li, Yang Fei, Laks V. S. Lakshmanan, Xiaokui Xiao:

ThriftLLM: On Cost-Effective Selection of Large Language Models for Classification Queries. 4410-4423 - Sajad Faghfoor Maghrebi, Niv Dayan:

Sphinx: A Succinct Perfect Hash Index for x86. 4424-4437 - Gaurav Sehgal, Semih Salihoglu:

NaviX: A Native Vector Index Design for Graph DBMSs With Robust Predicate-Agnostic Search Performance. 4438-4450 - Ryan Hildebrant, Rahul Atul Bhope, Sharad Mehrotra, Christopher Tull, Nalini Venkatasubramanian:

DIM-SUM: Dynamic IMputation for Smart Utility Management. 4451-4464 - Anurag Chakraborty, Semih Salihoglu:

Robust Recursive Query Parallelism in Graph Database Management Systems. 4465-4477 - Haseeb Ahmed, Nachiket Rao, Abdelkarim Kati, Florian Kerschbaum, Sujaya Maiyya:

OasisDB: An Oblivious and Scalable System for Relational Data. 4478-4491 - Tharushi Jayasekara, Immanuel Trummer:

CEDAR: A System for Cost-Efficient Data-Driven Claim Verification. 4492-4504 - Konstantinos Lampropoulos, Fatemeh Zardbani, Nikos Mamoulis, Panagiotis Karras:

Benchmarking Adaptive Multidimensional Indices. 4505-4517 - Yinan Li, Bailu Ding, Ziyun Wei, Lukas M. Maas, Momin Al-Ghosien, Spyros Blanas, Nicolas Bruno, Carlo Curino, Matteo Interlandi, Craig Peeper, Kaushik Rajan, Surajit Chaudhuri, Johannes Gehrke:

Scaling GPU-Accelerated Databases beyond GPU Memory Size. 4518-4531 - Haibo Xiu, Yang Li, Qianyu Yang, Pankaj Agarwal, Jun Yang:

PAR2QO: Parametric Penalty-Aware Robust Query Optimization. 4532-4545 - Qihan Zhang, Shaolin Xie, Ibrahim Sabek:

LIMAO: A Framework for Lifelong Modular Learned Query Optimization. 4546-4559 - Zhaoze Sun, Chengliang Chai, Qiyan Deng, Kaisen Jin, Xinyu Guo, Han Han, Ye Yuan, Guoren Wang, Lei Cao:

QUEST: Query Optimization in Unstructured Document Analysis. 4560-4573 - Guorui Xiao, Dong He, Jin Wang, Magdalena Balazinska:

CENTS: A Flexible and Cost-Effective Framework for LLM-Based Table Understanding. 4574-4587 - Christos Koutras, Jiani Zhang, Xiao Qin, Chuan Lei, Vassilis N. Ioannidis, Christos Faloutsos, George Karypis, Asterios Katsifodimos:

OmniMatch: Joinability Discovery in Data Products. 4588-4601 - Enyuan Zhou, Song Guo, Zicong Hong, Christian S. Jensen, Yang Xiao, Jinwen Liang, Dalin Zhang:

Pistis: A Decentralized Knowledge Graph Platform Enabling Ownership-Preserving SPARQL Querying. 4602-4615 - Yihao Liu, Shaoxuan Tang, Yulong Hui, Hangrui Zhou, Huanchen Zhang:

Selective Late Materialization in Modern Analytical Databases. 4616-4628 - Azim Afroozeh, Peter Boncz:

The FastLanes File Format. 4629-4643 - Yuchuan Huang, Ana Elena Uribe, Kareem Eldahshoury, Youssef Hussein, Grant Ogren, Mohamed F. Mokbel:

POLARIS: An Interactive and Scalable Data Infrastructure for Polar Science. 4644-4652 - Tobias Maltenberger, Ilin Tolovski, Tilmann Rabl:

Efficiently Joining Large Relations on Multi-GPU Systems. 4653-4667 - Jiongli Zhu, Geyang Xu, Felipe Lorenzi, Boris Glavic, Babak Salimi:

Stress-Testing ML Pipelines with Adversarial Data Corruption. 4668-4681 - Songlei Wang, Yifeng Zheng, Xiaohua Jia, Haibo Hu:

PrivAGM: Secure Construction of Differentially Private Directed Attributed Graph Models on Decentralized Social Graphs. 4682-4694 - Haoyang Li, Shang Wu, Xiaokang Zhang, Xinmei Huang, Jing Zhang, Fuxin Jiang, Shuai Wang, Tieying Zhang, Jianjun Chen, Rui Shi, Hong Chen, Cuiping Li:

OmniSQL: Synthesizing High-quality Text-to-SQL Data at Scale. 4695-4709 - Joobo Shim, Jaewon Oh, Hongchan Roh, Jaeyoung Do, Sang-Won Lee:

Turbocharging Vector Databases using Modern SSDs. 4710-4722 - Zhaoheng Li, Silu Huang, Wei Ding, Yongjoo Park, Jianjun Chen:

SIEVE: Effective Filtered Vector Search with Collection of Indexes. 4723-4736 - Andrea D'Ascenzo, Julian Meffert, Petra Mutzel, Fabrizio Rossi:

Enhancing Graph Edit Distance Computation: Stronger and Orientation-based ILP Formulations. 4737-4749 - Chunyu Chen, Zhengjie Miao, Yong Zhang, Jiannan Wang:

ParSEval: Plan-aware Test Database Generation for SQL Equivalence Evaluation. 4750-4762
Volume 18, Number 12, August 2025
- Sonia Bergamaschi, Sourav S. Bhowmick, Philippe Bonnet, Surajit Chaudhuri, Xiaoou Ding, Hakan Ferhatosmanoglu, Raul Castro Fernandez, Jana Giceva, Madelon Hulsebos, Alexandra Meliou, Nikos Ntarmos, Themis Palpanas, John Paparrizos, Norman Paton, Subhadeep Sarkar, Giovanni Simonini, Nesime Tatbul, Jiuqi Wei, Jingren Zhou:

Front Matter. - Fengxin Li, Yi Li, Yue Liu, Chao Zhou, Yuan Wang, Xiaoxiang Deng, Wei Xue, Dapeng Liu, Lei Xiao, Haijie Gu, Jie Jiang, Hongyan Liu, Biao Qin, Jun He:

LEADRE: Multi-Faceted Knowledge Enhanced LLM Empowered Display Advertisement Recommender System. 4763-4776 - Georgios Theodorakis, Hugo Firth, James Clarkson, Natacha Crooks, Jim Webber:

TuskFlow: An Efficient Graph Database for Long-Running Transactions. 4777-4790 - Panagiotis Antonopoulos, Mansi Chauhan, Shailender Dabas, Rajat Jain, Darshan Kattera, Wonseok Kim, Hanuma Kodavalla, Nikolas Ogg, Prashanth Purnananda, Rahul Ranjan, Alex Swanson, Divyesh Tikmani:

MD-MVCC: Multi-version Concurrency Control for Schema Changes in Azure SQL Database. 4791-4803 - Michael J. Carey, Wail Y. Alkowaileet, Nick Digeronimo, Peeyush Gupta, Sachin Smotra, Till Westmann:

Towards Principled, Practical Document Database Design. 4804-4816 - Manos Karpathiotakis, Vlassios Rizopoulos, Artem Gelun, Tiziano Carotti, Hazem Nada, Basri Kahveci, Yuri Dolgov:

Scribe: How Meta transports zettabytes per day in real time. 4817-4830 - Daniel Ritter, Mihnea Andrei, Sukhyeun Cho, Maik Goergens, Taehyung Lee, Norman May, Amit Pathak, Paul R. Willems:

The HANA Native Query Engine for Lakehouse Systems. 4831-4845 - Yuan Mei, Zhaoqian Lan, Lei Huang, Yanfei Lei, Han Yin, Rui Xia, Kaitian Hu, Paris Carbone, Vasiliki Kalavri, Feng Wang:

Disaggregated State Management in Apache Flink 2.0. 4846-4859 - Jie Jiang, Haining Xie, Siqi Shen, Yu Shen, Zihan Zhang, Meng Lei, Yifeng Zheng, Yang Li, Chunyou Li, Danqing Huang, Yinjun Wu, Wentao Zhang, Xiaofeng Yang, Bin Cui, Peng Chen:

SiriusBI: A Comprehensive LLM-powered Solution for Data Analytics in Business Intelligence. 4860-4873 - Edward Y. Chang, Longling Geng:

SagaLLM: Context Management, Validation, and Transaction Guarantees for Multi-Agent LLM Planning. 4874-4886 - Hongtao Yang, Zhichen Xu, Sergey Yudin, Andrew Davidson:

Unlocking the Power of CI/CD for Data Pipelines in Distributed Data Warehouses. 4887-4895 - Jianjun Chen, Li Zhang, Yu Xie, Wei Ding, Lixun Cao, Ye Liu, Yonghua Ding, Fangshi Li, Ke Wu, Haibo Xiu, Kui Wei, Le Cai, Rui Chang, Yuxiang Chen, Yuanjin Lin, Shangyu Luo, Jianfeng Qian, Xu Wang, Zikang Wang, Jian Zhang, Mingyi Zhang, Shicai Zeng, Jason Sun, Lei Zhang, Rui Shi, Pengwei Zhao:

veDB-HTAP: a Highly Integrated, Efficient and Adaptive HTAP System. 4896-4909 - Konstantinos Kanellis, Badrish Chandramouli, Ted Hart, Shivaram Venkataraman:

From FASTER to F2: Evolving Concurrent Key-Value Store Designs for Large Skewed Workloads. 4910-4923 - Sunil Chakkappen, Shreya Kunjibettu, Daniel Mcgreer, Masoomeh Kishi, Hong Su, Mohamed Ziauddin, Mohamed Zaït, Zhan Li, Yuying Zhang:

Automatic Indexing in Oracle. 4924-4937 - Murtadha Al Hubail, Ali Alsuliman, Wail Y. Alkowaileet, Michael Blow, Michael J. Carey, Savyasach Enukonda, Peeyush Gupta, Santosh Hegde, Kamini Jagtiani, Abhishek Jindal, Nawazish Kahn, Mehnaz Tabassum Mahin, Ian Maxon, M. Muralikrishna, Keshav Murthy, Daniel Nagy, Preetham Poluparthi, Ankit Prabhu, Ritik Raj, Vijay Sarathy, Shahrzad Shirazi, Utsav Singh, Hussain Towaileb, Ayush Tripathi, Janhavi Tripurwar, Bo-Chun Wang, Till Westmann:

Cloudy With a Chance of JSON. 4938-4950 - Guoliang Li, Ji Sun, James Pan, Jiang Wang, Yongqing Xie, Ruicheng Liu, Wen Nie:

GaussDB-Vector: A Large-Scale Persistent Real-Time Vector Database for LLM Applications. 4951-4963 - Jun Song, Jingyi Ding, Irshad Kandy, Yanghao Lin, Zhongjia Wei, Zilong Zhou, Zhiwei Peng, Jixi Shan, Hongyue Mao, Xiuqi Huang, Xun Song, Cheng Chen, Yanjia Li, Tianhao Yang, Wei Jia, Xiaohong Dong, Kang Lei, Rui Shi, Pengwei Zhao, Wei Chen:

Magnus: A Holistic Approach to Data Management for Large-Scale Machine Learning Workloads. 4964-4977 - Xin Gao, Sibasish Acharya, Sihui Han, Yongxiong Ren, Yanli Zhao, Liang Luo, Chucheng Wang, Pradeep Fernando, Saurabh Mishra, Siqi Yan, Yicong Du, Elzbieta Krepska, Intaik Park, Min Ni, Qunshu Zhang, Shen Li:

DECK: Experiences on Delta Checkpointing for Industrial Recommendation Systems. 4978-4990 - Zhe Jiang, Zhaoguo Wang, Haoning Lan, Chuzhe Tang, Haoran Ding, Lefeng Wang, Songyun Zou, Zhuoran Wei, Yongcun Liu, Xiang Yu, Yang Ren, Guoliang Li, Haibo Chen:

GRewriter: Practical Query Rewriting with Automatic Rule Set Expansion in GaussDB. 4991-5003 - Jun-Peng Zhu, Lingfeng Zhang, Peng Cai, Xuan Zhou, Peisen Zhao, Xue Wang, Linpeng Tang:

FDBKeeper: Enabling Scalable Coordination Services for Metadata Management using Distributed Key-Value Databases. 5004-5016 - Xiaoyao Zhong, Haotian Li, Jiabao Jin, Mingyu Yang, Deming Chu, Xiangyu Wang, Zhitao Shen, Wei Jia, George Gu, Yi Xie, Xuemin Lin, Heng Tao Shen, Jingkuan Song, Peng Cheng:

VSAG: An Optimized Search Framework for Graph-based Approximate Nearest Neighbor Search. 5017-5030 - Zhaoyan Sun, Xuanhe Zhou, Guoliang Li, Xiang Yu, Jianhua Feng, Yong Zhang:

R-Bot: An LLM-based Query Rewrite System. 5031-5044 - William Schultz, Murat Demirbas:

Design and Modular Verification of Distributed Transactions in MongoDB. 5045-5058 - Xinjun Yang, Feifei Li, Yingqiang Zhang, Hao Chen, Qingda Hu, Panfeng Zhou, Qiang Zhang, Shuai Li, Zongzhi Chen, Zheyu Miao, Rongbiao Xie, Chuan Sun, Zetao Wei, Jing Fang, Xingxuan Zhou, Xiaofei Wu:

From Scale-Up to Scale-Out: PolarDB's Journey to Achieving 2 Billion tpmC. 5059-5072 - Mingyu Liu, Junbin Kang, Kai Wang, Lu Zhang, Haibo Chen, Xiuchang Li, Tianhong Ding:

ScaleCache: Scalable and Production-grade Buffer Management for Disk-based Database Systems. 5073-5085 - Jun-Peng Zhu, Boyan Niu, Peng Cai, Zheming Ni, Jianwei Wan, Kai Xu, Jiajun Huang, Shengbo Ma, Bing Wang, Xuan Zhou, Guanglei Bao, Donghui Zhang, Liu Tang, Qi Liu:

Towards Automated Cross-domain Exploratory Data Analysis through Large Language Models. 5086-5099 - Bing Tong, Yan Zhou, Chen Zhang, Jianheng Tang, Jia Li, Lei Chen:

GalaxyWeaver: Autonomous Table-to-Graph Conversion and Schema Optimization with Large Language Models. 5100-5112 - Tim Gubner, Rune Humborstad, Manyi Lu:

Freely Moving Between the OLTP and OLAP Worlds: Hermes - an High-Performance OLAP Accelerator for MySQL. 5113-5125 - Jan Vincent Szlang, Sebastian Breß, Sebastian Cattes, Jonathan Dees, Florian Funke, Max Heimel, Michel Oleynik, Ismail Oukid, Tobias Maltenberger:

Workload Insights From the Snowflake Data Cloud: What Do Production Analytic Queries Really Look Like? 5126-5138 - Fangyuan Zhang, Caihua Yin, Hua Fan, Fenghua Fang, Yineng Chen, Xuqi Wang, Mengqi Wu, Bing Chen, Tianbo Jin, Sibo Wang, Wenchao Zhou, Feifei Li:

AnalyticDB-PG: A Cloud-native High-performance Data Warehouse in Alibaba Cloud. 5139-5152 - Fangyuan Zhang, Mengqi Wu, Chunlei Xu, Yunong Bao, Jiyu Qiao, Yingli Zhou, Hua Fan, Caihua Yin, Wenchao Zhou, Feifei Li:

Streaming View: An Efficient Data Processing Engine for Modern Real-time Data Warehouse of Alibaba Cloud. 5153-5165 - Nitish Upreti, Harsha Vardhan Simhadri, Hari Sudan Sundar, Krishnan Sundaram, Samer Boshra, Balachandar Perumalswamy, Shivam Atri, Martin Chisholm, Revti Raman Singh, Greg Yang, Tamara Hass, Nitesh Dudhey, Subramanyam Pattipaka, Mark Hildebrand, Magdalen Dobson, Jack Moffitt, Haiyang Xu, Naren Datha, Suryansh Gupta, Ravishankar Krishnaswamy, Prashant Gupta, Abhishek Sahu, Hemeswari Varada, Sudhanshu Barthwal, Ritika Mor, James Codella, Shaun Cooper, Kevin Pilch, Simon Moreno, Aayush Kataria, Santosh Kulkarni, Neil Deshpande, Amar Sagare, Dinesh Billa, Zishan Fu, Vipul Vishal:

Cost-Effective, Low Latency Vector Search with Azure Cosmos DB. 5166-5183 - Sijie Guo, Matteo Merli, Hang Chen, Neng Lu, Penghui Li:

Ursa: A Lakehouse-Native Data Streaming Engine for Kafka. 5184-5196 - Krishna Puttaswamy, Abhijit Chakankar, Tao Tao, Zaheera Valani, Ramesh Chandra, William Chau, Mengxi Chen, Akram Chetibi, Tianyi Huang, Jonathan Keller, Celia Kung, Andy Liu, Charlene Lyu, Samarth Shetty, Xiaotong Sun, Steve Weis, Lin Zhou, Ryan Zhu, Reynold Xin, Matei Zaharia:

Delta Sharing: An Open Protocol for Cross-Platform Data Sharing. 5197-5209 - Sam Lightstone, Ping Wang:

SQL:Trek Automated Index Design at Airbnb. 5210-5222 - Marc Baeuerle, Thomas Bodner, Martin Boissier, Tilmann Rabl, Ricardo Salazar-Díaz, Florian Schmeller, Nils Strassenburg, Ilin Tolovski, Marcel Weisgut, Wang Yue:

TCO2: Analyzing the Carbon Footprint of Database Server Replacements. 5223-5226 - Pavel Koupil, Jáchym Bártík, Stefan Klessinger, André Conrad, Stefanie Scherzinger:

FDepHunter: Harnessing Negative Examples to Expose Fakes and Reveal Ghosts. 5227-5230 - Rong Kang, Shuai Wang, Tieying Zhang, Xianghong Xu, Linhui Xu, Zhimin Liang, Lei Zhang, Rui Shi, Jianjun Chen:

VIDEX: A Disaggregated and Extensible Virtual Index for the Cloud and AI Era. 5231-5234 - Roi Yona, Jonathan Breitman, Benny Kimelfeld:

DVote: Constraining Committee Voting with Database Dependencies. 5235-5238 - Yangxin Fan, Haolai Che, Mingjian Lu, Yinghui Wu:

Graph Compression for Interpretable Graph Neural Network Inference At Scale. 5239-5242 - Bingnan Chen, Binyang Dai, Qichen Wang, Ke Yi:

Query running too slow? Rewrite it with Quorion! 5243-5246 - Søren Kejser Jensen, Christian Schmidt Godiksen, Christian Thomsen, Torben Bach Pedersen:

Demonstration of ModelarDB: Model-Based Management of High-Frequency Time Series Across Edge, Cloud, and Client. 5247-5250 - Louisa Lambrecht, Tim Findling, Samuel Heid, Marcel Knüdeler, Torsten Grust:

Democratize MATCH_RECOGNIZE! 5251-5254 - Roman Heinrich, Oleksandr Havrylov, Manisha Luthra, Johannes Wehrstein, Carsten Binnig:

Opening The Black-Box: Explaining Learned Cost Models For Databases. 5255-5258 - K. Venkatesh Emani, Wenjing Wang, Zi Ye, Jia He, Neel Ball, Kumaraswamy Boora, Carlo Curino, Avrilia Floratou, Manan Goenka, Paridhi Gupta, Vivek Gupta, Katherine Lin, Nick Litombe, Jared Meade, Suryakant Mutnal, Raghu Ramakrishnan, Sudhir Raparla, Dhruv Relwani, Shyam Sai, Vaibhave Sekar, Roneet Shaw, Harmeet Singh, Prasanna Sridharan, Mark Taylor, Sunidhi Tiwari, Yiwen Zhu:

Horizon: Robust Checks for SQL Migration Using LLMs. 5259-5262 - Wenhao Liu, Xiu Tang, Sai Wu, Chang Yao, Gongsheng Yuan, Gang Chen:

A Demonstration of QueryArtisan: Real-Time Data Lake Analysis via Dynamically Generated Data Manipulation Code. 5263-5266 - Dvir Cohen, Liad Domb, Avigdor Gal, Lior Ganon, Eliezer Gavriel, Omri Lazover, Coral Scharf, Bar Shterenberg:

RecForUS: A Recommender System for Uncertain Scores. 5267-5270 - Frederik M. Trudslev, Matteo Lissandrini, Juan Manuel Rodriguez, Martin Bøgsted, Daniele Dell'Aglio:

PrivEval: a tool for interactive evaluation of privacy metrics in synthetic data generation. 5271-5274 - Kyriakos Psarakis, Oto Mraz, George Christodoulou, George Siachamis, Marios Fragkoulis, Asterios Katsifodimos:

Styx in Action: Transactional Cloud Applications Made Easy. 5275-5278 - Mathilde Marcy, Jean-Marc Petit, Marian Scuturici, Jocelyn Bonjour, Camille Fertel, Gérald Cavalier:

Can Surrogate Keys Negatively Impact Data Quality? 5279-5282 - Benjamin Hättasch, Leon Krüger, Carsten Binnig:

JUSTINE (JUST-INsert Engine): Demonstrating Self-organizing Data Schemas. 5283-5286 - Jiayi Wang, Yuan Li, Jianming Wu, Shihui Xu, Guoliang Li:

Unify: A System For Unstructured Data Analytics. 5287-5290 - Alexander Beischl, Thomas Neumann:

UmbraPerf - Profiling Results Tailored for DBMS Developers. 5291-5294 - Abiram Mohanaraj, Matteo Lissandrini, Katja Hose:

Smart SPARQL Advisor: Guiding Users in Query Formulation with Performance Prediction. 5295-5298 - Jiatang Zhou, Kaisong Huang, Zhuoyue Zhao, Dong Xie, Tianzheng Wang:

Analytics Are Heavy. The DBMS Is Busy. When Will My Mission-Critical Transaction Start Running? 5299-5302 - Enzo Veltri, Donatello Santoro, Jean-Flavien Bussotti, Paolo Papotti:

Accelerating Tabular Inference: Training Data Generation with TENET. 5303-5306 - Xukang Zhang, Huanchen Zhang, Xiaofeng Meng:

Accordion: Balancing Performance and Cost in Cloud-Native Data Analysis with Intra-Query Runtime Elasticity. 5307-5310 - Long Gu, Shaza Zeitouni, Carsten Binnig, Zsolt István:

Demonstration of Reflex: How SMPC Query Execution can be sped up through Efficient and Flexible Intermediate Result Size Trimming. 5311-5314 - Haralampos Gavriilidis, Joel Ziegler, Midhun Kaippillil Venugopalan, Benedikt Didrich, Matthias Boehm, Volker Markl:

Enter the Warp: Fast and Adaptive Data Transfer with XDBC. 5315-5318 - Luca Zecchini, Ziawasch Abedjan, Vasilis Efthymiou, Giovanni Simonini:

RadlER: Deduplicated Sampling On-Demand. 5319-5322 - Amey Shinde, Viraj Sabhaya, Kevin Farokhrouz, Fariba Afrin Irany, Ali Khan, Sanjukta Bhowmick, Abhishek Santra, Sharma Chakravarthy:

MLN-geeWhiz: A Dashboard for Supporting Complete Life-Cycle of Complex Data Analysis using Multilayer Networks. 5323-5326 - Haibo Xiu, Yang Li, Qianyu Yang, Weihang Guo, Yuxi Liu, Sudeepa Roy, Pankaj K. Agarwal, Jun Yang:

Hint-QPT: Hints for Robust Query Performance Tuning. 5327-5330 - Shunit Agmon, David Avigdor, Brit Youngmann, Amir Gilad, Benny Kimelfeld:

ClaimIt: Finding Convincing Views to Endorse a Claim. 5331-5334 - Filip Jezek, Pavel Koupil, Michal Kopecky, Jáchym Bártík, Irena Holubová:

DortDB: Bridging Query Languages for Multi-Model Data Ponds. 5335-5338 - Zekai Qian, Xiaoou Ding, Chen Wang, Hongzhi Wang:

DemandClean: A Multi-Objective Learning Framework for Balancing Model Tolerance to Data Authenticity and Diversity. 5339-5342 - Xiaoou Ding, Yanshuo Liu, Zhounan Chen, Hongzhi Wang, Chen Wang, Jianmin Wang:

TARImpute: Task-Aware Auto-Recommender System for Missing Value Imputation Algorithms with Clustering Case Studies. 5343-5346 - Yuchuan Huang, Ana Elena Uribe, Grant Ogren, Youssef Hussein, Kareem Eldahshoury, Mohamed F. Mokbel:

A Demonstration of POLARIS: An Interactive and Scalable Data Infrastructure for Polar Science. 5347-5350 - Zixuan Chen, Jinyang Li, H. V. Jagadish, Mirek Riedewald:

GooseDB: A Database Engine that Optimally Refines Top-k Queries to Satisfy Representation Constraints. 5351-5354 - Ourania Ntouni, Dimitrios Banelas, Nikos Giatrakos:

NeuroFlinkCEP: Neurosymbolic Complex Event Recognition Optimized across IoT Platforms. 5355-5358 - Stefan Grafberger, Paul Groth, Sebastian Schelter:

mlidea: Interactively Improving ML Data Preparation Code via \. 5359-5362 - Henning Koehler, Sebastian Link:

Mining Meaningful Keys and Foreign Keys with High Precision and Recall. 5363-5366 - Wissal Benjira, Nicolas Travers, Bénédicte Bucher, Malika Grim-Yefsah, Faten Atigui:

SDG-KG: A Framework to Compute SDG Indicator using Open Data. 5367-5370 - Zeheng Fan, Yuxiang Zeng, Zhuanglin Zheng, Yongxin Tong:

FedVSE: A Privacy-Preserving and Efficient Vector Search Engine for Federated Databases. 5371-5374 - Sebastian Eggers, Nina Zukowska, Ziawasch Abedjan:

APEX-DAG: Library and Language independent Pipeline EXtraction. 5375-5378 - Fatemeh Ahmadi, Julian Paulußen, Ziawasch Abedjan:

Demonstrating Matelda for Multi-Table Error Detection. 5379-5382 - Qingliu Wu, Qingfeng Xiang, Yingxia Shao, Qiyao Luo, Quanqing Xu:

DBPecker: A Graph-Based Compound Anomaly Diagnosis System for Distributed RDBMSs. 5383-5386 - Zequn Li, Yuanhao Zhong, Chengliang Chai, Zhaoze Sun, Yuhao Deng, Ye Yuan, Guoren Wang, Lei Cao:

DocDB: A Database for Unstructured Document Analysis. 5387-5390 - Jianxin Yan, Wangze Ni, Lei Chen, Xuemin Lin, Peng Cheng, Zhan Qin, Kui Ren:

ContextCache: Context-Aware Semantic Cache for Multi-Turn Queries in Large Language Models. 5391-5394 - Zenon G. Zacouris, Maribel Acosta:

Simulating a Transactional Server for Multi-Model Systems. 5395-5398 - Lingxi Cui, Guanyu Jiang, Huan Li, Ke Chen, Lidan Shou, Gang Chen:

TableCopilot: A Table Assistant Empowered by Natural Language Conditional Table Discovery. 5399-5402 - Shuting Cao, Zeping Niu, Guoliang Li:

LETIndex: A Secure Learned Index with TEE. 5403-5406 - Alessandro Ferri, Mauro Famà, Samuele Langhi, Riccardo Tommasini, Angela Bonifati:

Play2Win: A Windowing Playground for Continuous Queries. 5407-5410 - Luigi Bellomarini, Andrea Gentili, Davide Magnanimi, Emanuel Sallinger:

Vadacode: A Logician-friendly IDE for Datalog+/-. 5411-5414 - Anas Dorbani, Sunny Yasser, Jimmy Lin, Amine Mhedhbi:

Beyond Quacking: Deep Integration of Language Models and RAG into DuckDB. 5415-5418 - Fan Yang, John Paparrizos:

SAIL: A Voyage to Symbolic Approximation Solutions for Time-Series Analysis. 5419-5422 - Annabelle Warner, Andrew McNutt, Paul Rosen, El Kindi Rezig:

Buckaroo: A Direct Manipulation Visual Data Wrangler. 5423-5426 - Akash Khatri, Mahathir Mohammad, El Kindi Rezig:

Sort it Like You Mean It: Discovering Semantically Interesting Attribute Augmentations to Sort Tables. 5427-5430 - Qinghua Liu, Seunghak Lee, John Paparrizos:

EasyAD: A Demonstration of Automated Solutions for Time-Series Anomaly Detection. 5431-5434 - Tarlan Bahadori, Ahmed Eldawy, Sai Sreekar Sarvepalli:

LASEK: LLM-Assisted Style Exploration Kit for Geospatial Data. 5435-5438 - Hanwen Liu, Federico M. Spedalieri, Ibrahim Sabek:

A Demonstration of Q2O: Quantum-augmented Query Optimizer. 5439-5443 - Philipp Skavantzos, Sebastian Link:

When Entity/Relationship Models Meet Graph Databases. 5444-5447 - Graham Cormode, Shripad Gade, Samuel Maddock, Enayat Ullah:

Synthetic Tabular Data: methods, attacks and defenses. 5448-5450 - Youssef Hussein, Mohamed Hemdan, Mohamed F. Mokbel:

Large Language Models for Spatial Analysis Queries. 5451-5454 - Ziawasch Abedjan, Mahdi Esmailoghli, Sainyam Galhorta:

Data Disovery in Data Lakes: Operations, Indexes, Systems. 5455-5459 - Da Yan, Lyuheng Yuan, Akhlaque Ahmad, Saugat Adhikari:

Systems for Scalable Graph Analytics and Machine Learning: Trends and Methods. 5460-5465 - Yuyu Luo, Guoliang Li, Ju Fan, Chengliang Chai, Nan Tang:

Natural Language to SQL: State of the Art and Open Problems. 5466-5471 - Ramón Rico, Arno Siebes, Yannis Velegrakis:

New Trends in Data Forgetting for Sustainable Data Management. 5472-5476 - Haridimos Kondylakis, Stefania Dumbrava, Matteo Lissandrini, Nikolay Yakovets, Angela Bonifati, Vasilis Efthymiou, George Fletcher, Dimitris Plexousakis, Riccardo Tommasini, Georgia Troullinou, Elisjana Ymeralli:

Property Graph Standards: State of the Art \u0026 Open Challenges. 5477-5481 - Roman Heinrich, Xiao Li, Manisha Luthra, Zoi Kaoudi:

Learned Cost Models for Query Optimization: From Batch to Streaming Systems. 5482-5487 - Helena Caminal, Yannis Chronis, Yannis Papakonstantinou, Fatma Özcan, Anastasia Ailamaki:

Filtered Vector Search: State-of-the-art and Research Challenges. 5488-5492 - Mengying Wang, Moming Duan, Yicong Huang, Chen Li, Bingsheng He, Yinghui Wu:

ML-Asset Management: Curation, Discovery, and Utilization. 5493-5498 - Hanchen Wang, Ying Zhang, Wenjie Zhang:

Machine Learning for Graph Data Management and Query Processing. 5499-5503 - James Pan, Guoliang Li:

Database Perspective on LLM Inference Systems. 5504-5507 - Viktor Leis:

Beyond Incrementalism: How to Change the World Through Data Systems Research (VLDB 2025 Panel). 5508-5509 - Eugene Wu, Raul Castro Fernandez:

Where Does Academic Database Research Go From Here? 5510-5511 - Yannis E. Ioannidis:

Open Science: A New Paradigm for the Research Lifecycle. 5512 - Paolo Papotti, Carsten Binnig:

Panel on Neural Relational Data: Tabular Foundation Models, LLMs... or both? 5513-5515 - Angela Bonifati:

Versatile Property Graph Transformations. 5516-5526 - Xiangyao Yu:

Disaggregation: A New Architecture for Cloud Databases. 5527-5530 - Viktor Leis, Andrey Gubichev, Atanas Mirchev, Peter Boncz, Alfons Kemper, Thomas Neumann:

Still Asking: How Good Are Query Optimizers, Really? 5531-5536 - Stratos Idreos:

Alphabets, Grammars, Calculators, and the End of Hand-Crafted Systems. 5537 - Juliana Freire:

Bridging Disciplines in Data Management Research to Solve Complex Data Problems. 5538 - Matei Zaharia:

Bringing the Operational and Analytical Worlds Together with Lakebase. 5539

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














