Proceedings of the VLDB Endowment, Volume 15, 2021-2022
- Editors-in-Chief: Fatma Ozcan, Juliana Freire, and Xuemin Lin
- Publication Editors: Xin Cao and Lijun Chang
- Associate Editors: Arun Kumar, Azza Abouzied, Beng Chin Ooi, Boris Glavic, Dan Suciu, Divyakant Agrawal, Eugene Wu, Georgia Koutrika, Jeffrey Xu Yu, Julia Stoyanovich, Jun Yang, K. Selçuk Candan, Khuzaima Daudjee, Laks Lakshmanan, Laure Berti-Equille, Lei Chen, Mohamed Mokbel, Neoklis Polyzotis, Papotti Paolo, Peter Boncz, Sebastian Schelter, Sharad Mehrotra, Sourav S Bhowmick, Surajit Chaudhuri, Themis Palpanas, Vanessa Braganholo, Wang-Chiew Tan, Wenjie Zhang, Wook-Shin Han, Xiaofang Zhou
- Review Board: see full list
Volume 15, No. 1, September 2021
- Juliana Freire and Xuemin Lin:
Front Matter. i - vi. - Kang Zhao, Liuyihan Song, Yingya Zhang, Pan Pan, Yinghui Xu, Rong Jin:
ANN Softmax: Acceleration of Extreme Classification Training. 1 - 10. - Gyeong-In Yu, Saeed Amizadeh, Sehoon Kim, Artidoro Pagnoni, Ce Zhang, Byung-Gon Chun, Markus Weimer, Matteo Interlandi:
WindTunnel: Towards Differentiable ML Pipelines Beyond a Single Model. 11 - 20. - Athinagoras Skiadopoulos, Qian Li, Peter Kraft, Kostis Kaffes, Daniel Hong, Shana Mathew, David Bestor, Michael Cafarella, Vijay Gadepally, Goetz Graefe, Jeremy Kepner, Christos Kozyrakis, Tim Kraska, Michael Stonebraker, Lalith Suresh, and Matei Zaharia:
DBOS: A DBMS-oriented Operating System. 21 - 30. - Arjit Jain, Sunita Sarawagi, Prithviraj Sen:
Deep Indexed Active Learning for Matching Heterogeneous Entity Representations. 31 - 45. - Xuanhe Zhou, Guoliang Li, Chengliang Chai, Jianhua Feng:
A Learned Query Rewrite System using Monte Carlo Tree Search. 46 - 58. - Yin Lin, Brit Youngmann, Yuval Moskovitch, H. V. Jagadish, Tova Milo:
On Detecting Cherry-picked Generalizations. 59 - 71. - Jiayi Wang, Chengliang Chai, Jiabin Liu, Guoliang Li:
FACE: A Normalizing Flow based Cardinality Estimator. 72 - 84. - Ji Sun, Jintao Zhang, Zhaoyan Sun, Guoliang Li, Nan Tang:
Learned Cardinality Estimation: A Design Space Exploration and A Comparative Evaluation. 85 - 97. - Dong He, Maureen Daum, Walter Cai, Magdalena Balazinska:
DeepEverest: Accelerating Declarative Top-K Queries for Deep Neural Network Interpretation. 98 - 111. - Subarna Chatterjee, Meena Jagadeesan, Wilson Qin, Stratos Idreos:
Cosine: A Cloud-Cost Optimized Self-Designing Key-Value Storage Engine. 112 - 126. - Muhammad Adnan, Yassaman Ebrahimzadeh Maboud, Divya Mahajan, Prashant J. Nair:
Accelerating Recommendation System Training by Leveraging Popular Choices. 127 - 140.
Volume 15, No. 2, October 2021
- Juliana Freire and Xuemin Lin:.
Front Matter. i - vii. - Jianye Yang, Yun Peng, Wenjie Zhang:
(p,q)-biclique Counting and Enumeration for Large Sparse Bipartite Graphs. 141 - 153. - Dan Graur, Ingo Müller, Mason Proffitt, Ghislain Fourny, Gordon T. Watts, Gustavo Alonso:
Evaluating Query Languages and Systems for High-Energy Physics Data. 154 - 168. - Kongzhang Hao, Long Yuan, Wenjie Zhang:
Distributed Hop-Constrained s-t Simple Path Enumeration at Billion Scale. 169 - 182. - Jingzhi Fang, Yanyan Shen, Yue Wang, Lei Chen:
ETO: Accelerating Optimization of DNN Operators by High-Performance Tensor Program Reuse. 183 - 195. - Philipp Marian Grulich, Steffen Zeuch, Volker Markl:
Babelfish: Efficient Execution of Polyglot Queries. 196 - 210. - Alexander Zhou, Yue Wang, Lei Chen:
Butterfly Counting on Uncertain Bipartite Networks. 211 - 223. - Yue Cui, Kai Zheng, Dingshan Cui, Jiandong Xie, Liwei Deng, Feiteng Huang, Xiaofang Zhou:
METRO: A Generic Graph Neural Network Framework for Multivariate Time Series Forecasting. 224 - 236. - Congcong Ge, Xiaoze Liu, Lu Chen, Baihua Zheng, Yunjun Gao:
LargeEA: Aligning Entities for Large-scale Knowledge Graphs. 237 - 245. - Kejing Lu, Mineichi Kudo, Chuan Xiao, Yoshiharu Ishikawa:
HVS: Hierarchical Graph Structure Based on Voronoi Diagrams for Solving Approximate Nearest Neighbor Search. 246 - 258. - Arif Arman, Dmitri Loguinov:
Origami: A High-Performance Mergesort Framework. 259 - 271. - Renzhi Wu, Bolin Ding, Xu Chu, Zhewei Wei, Xiening Dai, Tao Guan, Jingren Zhou:
Learning to be a Statistician: Learned Estimator for Number of Distinct Values. 272 - 284. - Shangdi Yu, Yiqiu Wang, Yan Gu, Laxman Dhulipala, Julian Shun:
ParChain: A Framework for Parallel Hierarchical Agglomerative Clustering using Nearest-Neighbor Chain. 285 - 298. - Komal Chauhan, Kartik Jain, Sayan Ranu, Srikanta Bedathur, Amitabha Bagchi:
Answering Regular Path Queries through Exemplars. 299 - 311. - Xupeng Miao, Hailin Zhang, Yining Shi, Xiaonan Nie, Zhi Yang, Yangyu Tao, Bin Cui:
HET: Scaling out Huge Embedding Model Training via Cache-enabled Distributed Framework. 312 - 320. - Pengfei Li, Yu Hua, Jingnan Jia, Pengfei Zuo:
FINEdex: A Fine-grained Learned Index Scheme for Scalable and Concurrent Memory Systems. 321 - 334. - Jiyang Bai, Peixiang Zhao:
TaGSim: Type-aware Graph Similarity Learning and Computation. 335 - 347. - Yuqing Zhu, Jing Tang, Xueyan Tang, Lei Chen:
Analysis of Influence Contribution in Social Advertising. 348 - 360. - Georgios R Theodorakis, Fotios Kounelis, Peter Pietzuch, Holger Pirk:
Scabbard: Single-Node Fault-Tolerant Stream Processing. 361 - 374. - George Konstantinidis, Jet Holt, Adriane Chapman:
Enabling Personal Consent in Databases. 375 - 387.
Volume 15, No. 3, November 2021
- Juliana Freire and Xuemin Lin:
Front Matter. i - vii. - Yejia Liu, Weiyuan Wu, Lampros Flokas, Jiannan Wang, Eugene Wu:
Enabling SQL-based Training Data Debugging for Federated Learning. 388 - 400. - Kapil Vaidya, Anshuman Dutt, Vivek Narasayya, Surajit Chaudhuri:
Leveraging Query Logs and Machine Learning for Parametric Query Optimization. 401 - 413. - Yao Lu, Srikanth Kandula, Arnd Christian König, Surajit Chaudhuri:
Pre-training Summarization Models of Structured Datasets for Cardinality Estimation. 414 - 426. - Susie Xi Rao, Shuai Zhang, Zhichao Han, Zitao Zhang, Wei Min, Zhiyao Chen, Yinan Shan, Yang Zhao, Ce Zhang:
xFraud: Explainable Fraud Transaction Detection. 427 - 436. - Ye Yuan, Delong Ma, Zhenyu Wen, Zhiwei Zhang, Guoren Wang:
Subgraph Matching over Graph Federation. 437 - 450. - Xing Niu, Boris Glavic, Ziyu Liu, Pengyuan Li, Dieter Gawlick, Vasudha Krishnaswamy, Zhen Hua Liu, Danica Porobic:
Provenance-based Data Skipping. 451 - 464. - Di Jin, Bunyamin Sisman, Hao Wei, Xin Luna Dong, Danai Koutra:
Deep Transfer Learning for Multi-source Entity Linkage via Domain Adaptation. 465 - 477. - Lu Xing, Eric Lee, Tong An, Bo-cheng Chu, Ahmed Mahmood, Ahmed M. Aly, Jianguo Wang, Walid G. Aref:
An Experimental Evaluation and Investigation of Waves of Misery in R-trees. 478 - 490. - Yongyi Liu, Ahmed Mahmood, Amr Magdy, Sergio Rey:
PRUC : P-Regions with User-Defined Constraint. 491 - 503. - Yile Chen, Xiucheng Li, Gao Cong, Cheng Long, Zhifeng Bao, Shang Liu, Wanli Gu, Fuzheng Zhang:
Points-of-Interest Relationship Inference with Spatial-enriched Graph Neural Networks. 504 - 512. - Tsz Nam Chan, Pak Lon Ip, Leong Hou U, Byron Choi, Jianliang Xu:
SAFE: A Share-and-Aggregate Bandwidth Exploration Framework for Kernel Density Visualization. 513 - 526. - Jens Dittrich, Joris Nix, Christian Schön:
The next 50 Years in Database Indexing or: The Case for Automatically Generated Index Structures. 527 - 540. - Koral Chapnik, Ilya Kolchinsky, Assaf Schuster:
DARLING: Data-Aware Load Shedding in Complex Event Processing Systems. 541 - 554. - Danyang Zhuo, Kaiyuan Zhang, Zhuohan Li, Siyuan Zhuang, Stephanie Wang, Ang Chen, Ion Stoica:
Rearchitecting In-Memory Object Stores for Low Latency. 555 - 568. - Pingchuan Ma, Shuai Wang:
MT-Teql: Evaluating and Augmenting Neural NLIDB on Real-world Linguistic and Schema Variations. 569 - 582. - Jessica Shi, Laxman Dhulipala, Julian Shun:
Theoretically and Practically Efficient Parallel Nucleus Decomposition. 583 - 596. - Baotong Lu, Jialin Ding, Eric Lo, Umar Farooq Minhas, Tianzheng Wang:
APEX: A High-Performance Learned Index on Persistent Memory. 597 - 610. - David Campos, Tung Kieu, Chenjuan Guo, Feiteng Huang, Kai Zheng, Bin Yang, Christian S Jensen:
Unsupervised Time Series Outlier Detection with Diversity-Driven Convolutional Ensembles. 611 - 623. - Xiaoye Miao, Yangyang Wu, Lu Chen, Yunjun Gao, Jun Wang, Jianwei Yin:
Efficient and Effective Data Imputation with Influence Functions. 624 - 632. - Adrian Kochsiek, Rainer Gemulla:
Parallel Training of Knowledge Graph Embedding Models: A Comparison of Techniques. 633 - 645. - Gerardo Vitagliano, Lan Jiang, Felix Naumann:
Detecting Layout Templates in Complex Multiregion Files. 646 - 658. - Kajetan Maliszewski, Jorge Arnulfo Quiane Ruiz, Jonas Traub, Volker Markl:
What Is the Price for Joining Securely? Benchmarking Equi-Joins in Trusted Execution Environments. 659 - 672. - Van Long Ho, Nguyen Ho, Torben Bach Pedersen:
Efficient Temporal Pattern Mining in Big Time Series Using Mutual Information. 673 - 685. - Junhua Zhang, Long Yuan, Wentao Li, Lu Qin, Ying Zhang:
Efficient Label-Constrained Shortest Path Queries on Road Networks: A Tree Decomposition Approach. 686 - 698. - Sahaana Suri, Ihab F Ilyas, Christopher Re, Theodoros Rekatsinas:
Ember: No-Code Context Enrichment via Similarity-Based Keyless Joins. 699 - 712. - Tin Vu, Ahmed Eldawy, Vagelis Hristidis, Vassilis J. Tsotras:
Incremental Partitioning for Efficient Spatial Data Analytics. 713 - 726. - Doris Lee, Dixin Tang, Kunal Agarwal, Thyne Boonmark, Caitlyn Chen, Jake Kang, Ujjaini Mukhopadhyay, Jerry Song, Micah Yong, Marti A. Hearst, Aditya Parameswaran:
Lux: Always-on Visualization Recommendations for Exploratory Dataframe Workflows. 727 - 738. - Devin Petersohn, Dixin Tang, Rehan S Durrani, Areg Melik-adamyan, Joseph Gonzalez, Anthony Joseph, Aditya Parameswaran:
Flexible Rule-Based Decomposition and Metadata Independence in Modin: A Parallel Dataframe System. 739 - 751.
Volume 15, No. 4, December 2021
- Juliana Freire and Xuemin Lin:
Front Matter. i - vii. - Yuxing Han, Ziniu Wu, Peizhi Wu, Rong Zhu, Jingyi Yang, Liang Wei Tan, Kai Zeng, Gao Cong, Yanzhao Qin, Andreas Pfadler, Zhengping Qian, Jingren Zhou, Jiangneng Li, Bin Cui:
Cardinality Estimation in DBMS: A Comprehensive Benchmark Evaluation. 752 - 765. - Qizhen Zhang, Philip A Bernstein, Daniel S Berger, Badrish Chandramouli:
Redy: Remote Dynamic Memory Cache. 766 - 779. - Martin Boissier:
Robust and Budget-Constrained Encoding Configurations for In-Memory Database Systems. 780 - 793. - Shulong Tan, Weijie Zhao, Ping Li:
Fast Neural Ranking on Bipartite Graph Indices. 794 - 803. - Shaoduo Gan, Xiangru Lian, Rui Wang, Jianbin Chang, Chengjun Liu, Hongmei Shi, Shengzhuo Zhang, Xianghong Li, Tengxu Sun, Jiawei Jiang, Binhang Yuan, Sen Yang, Ji Liu, Ce Zhang:
BAGUA: Scaling up Distributed Learning with System Relaxations. 804 - 813. - Tsz Nam Chan, Pak Lon Ip, Leong Hou U, Byron Choi, Jianliang Xu:
SWS: A Complexity-Optimized Solution for Spatial-Temporal Kernel Density Visualization. 814 - 827. - Junxu Liu, Jian Lou, Li Xiong, Jinfei Liu, Xiaofeng Meng:
Projected Federated Averaging with Heterogeneous Differential Privacy. 828 - 840. - Daniel Haimovich, Dmytro Karamshuk, Thomas J. Leeper, Evgeniy Riabenko, Milan Vojnovic:
Popularity Prediction for Social Media over Arbitrary Time Horizons. 841 - 849. - Ishita Doshi, Dhritiman Das, Ashish Bhutani, Rajeev Kumar, Rushi Bhatt, Niranjan Balasubramanian:
LANNS: A Web-Scale Approximate Nearest Neighbor Lookup System. 850 - 858. - Eduardo H. M. Pena, Eduardo Cunha De Almeida, Felix Naumann:
Fast Detection of Denial Constraint Violations. 859 - 871. - Bowen Yu, Guanyu Feng, Huanqi Cao, Xiaohan Li, Zhenbo Sun, Haojie Wang, Xiaowei Zhu, Weimin Zheng, Wenguang Chen:
Chukonu: A Fully-Featured Big Data Processing System by Efficiently Integrating a Native Compute Engine into Spark. 872 - 885. - Sian Jin, Chengming Zhang, Xintong Jiang, Yunhe Feng, Hui Guan, Guanpeng Li, Shuaiwen Song, Dingwen Tao:
COMET: A Novel Memory-Efficient Deep Learning Training Framework by Using Error-Bounded Lossy Compression. 886 - 899. - Zitao Li, Bolin Ding, Ce Zhang, Ninghui Li, Jingren Zhou:
Federated Matrix Factorization with Privacy Guarantee. 900 - 913. - Chi Thang Duong, Dung Trung Hoang, Hongzhi Yin, Matthias Weidlich, Quoc Viet Hung Nguyen, Karl Aberer:
Scalable Robust Graph Embedding with Spark. 914 - 922. - Debjyoti Paul, Jie Cao, Feifei Li, Vivek Srikumar:
Database Workload Characterization with Query Plan Encoders. 923 - 935. - Abhishek Modi, Kaushik Rajan, Srinivas Thimmaiah, Prakhar Jain, Swinky Mann, Ayushi Agarwal, Ajith Shetty, Shahid K I, Ashit Gosalia, Partho Sarthi:
New Query Optimization Techniques in the Spark Engine of Azure Synapse. 936 - 948. - Phanwadee Sinthong, Dhaval Patel, Nianjun Zhou, Shrey Shrivastava, Arun Iyengar, Anuradha Bhamidipaty:
DQDF: Data-Quality-Aware Dataframes. 949 - 957. - Archita Agarwal, Marilyn George, Aaron R Jeyaraj, Malte Schwarzkopf:
Retrofitting GDPR Compliance onto Legacy Databases. 958 - 970. - Xinle Wu, Dalin Zhang, Chenjuan Guo, Chaoyang He, Bin Yang, Christian S Jensen:
AutoCTS: Automated Correlated Time Series Forecasting. 971 - 983. - Sivaprasad Sudhir, Michael Cafarella, Samuel Madden:
Replicated Layout for In-Memory Database Systems. 984 - 997.
Volume 15, No. 5, January 2022
- Fatma Ozcan, Juliana Freire and Xuemin Lin:
Front Matter. i - vi. - Anupam Sanghi, Shadab Ahmed, Jayant R Haritsa:
Projection-Compliant Database Generation. 998 - 1010. - Guodong Jin, Semih Salihoglu:
Making RDBMSs Efficient on Graph Workloads Through Predefined Joins. 1011 - 1023. - Shaleen Deep, Xiao Hu, Paraschos Koutris:
Ranked Enumeration of Join Queries with Projections. 1024 - 1037. - Ahnjae Shin, Joo Seong Jeong, Do Yoon Kim, Soyoung Jung, Byung-gon Chun:
Hippo: Sharing Computations in Hyper-Parameter Optimization. 1038 - 1052. - Arik Rinberg, Tomer Solomon, Roee Shlomo, Guy Khazma, Gal Lushi, Idit Keidar, Paula Ta-shma:
DSON: JSON CRDT Using Delta-Mutations For Document Stores. 1053 - 1065. - Sepanta Zeighami, Ritesh Ahuja, Gabriel Ghinita, Cyrus Shahabi:
A Neural Database for Differentially Private Spatial Range Queries. 1066 - 1078. - Marcel Maltry, Jens Dittrich:
A Critical Analysis of Recursive Model Indexes. 1079 - 1091. - Zerui Ge, Dumitrel Loghin, Beng Chin Ooi, Pingcheng Ruan, Tianwen Wang:
Hybrid Blockchain Database Systems: Design and Performance. 1092 - 1104. - Angela Bonifati, Stefania Dumbrava, George Fletcher, Jan Hidders, Matthias Hofer, Wim Martens, Filip Murlak, Joshua Shinavier, Sławek Staworko, Dominik Tomaszuk:
Threshold Queries in Theory and in the Wild. 1105 - 1118. - Moritz Sichert, Thomas Neumann:
User-Defined Operators: Efficiently Integrating Custom Algorithms into Modern Databases. 1119 - 1131.