Volume 13, 2019-2020

Editors-in-Chief:
Magdalena Balazinska and Xiaofang Zhou
Associate Editors:
Azza Abouzied, Amr El Abbadi, Phil Bernstein, Xin Luna Dong, Zi (Helen) Huang, Nick Koudas, Georgia Koutrika, Guoliang Li, Alexandra Meliou, Felix Naumann, Dan Olteanu, M. Tamer Özsu, Aditya Parameswaran, Andy Pavlo, Xiaokui Xiao, Jeffrey Xu Yu, Meihui Zhang, Jingren Zhou
Review Board:

Volume 13, No. 1

Magdalena Balazinska and Xiaofang Zhou: Front Matter i - vi

1 - 14

Revenue Maximization for Query Pricing

Shuchi Chawla, Shaleen Deep, Paraschos Koutris, Yifeng Teng

15 - 28

Realtime Top-k Personalized PageRank over Large Graphs on GPUs

Jieming Shi, Renchi Yang, Tianyuan Jin, Xiaokui Xiao, Yin Yang

29 - 42

Fast Large-Scale Trajectory Clustering

Sheng Wang, Zhifeng Bao, J. Shane Culpepper, Timos Sellis, Xiaolin Qin

43 - 56

Automating Distributed Tiered Storage Management in Cluster Computing

Herodotos Herodotou, Elena Kakoulli

57 - 70

APOLLO: Automatic Detection and Diagnosis of Performance Regressions in Database Systems

Jinho Jung, Hong Hu, Joy Arulraj, Taesoo Kim, Woonhak Kang

71 - 85

Lowering the Latency of Data Processing Pipelines Through FPGA based Hardware Acceleration

Muhsen Owaida, Gustavo Alonso, Laura Fogliarini, Anthony Hock-Koon, Pierre-Etienne Melet

Volume 13, No. 3

Magdalena Balazinska and Xiaofang Zhou: Front Matter i - vi

226 - 238

Interleaved Multi-Vectorizing

Zhuhe Fang, Beilei Zheng, Chuliang Weng

239 - 251

A Unified Optimization Algorithm For Solving Regret-Minimizing Representative Problems

Suraj Shetiya, Abolfazl Asudeh, Sadia Ahmed, Gautam Das

252 - 265

Pushing Data-Induced Predicates Through Joins in Big-Data Clusters

Laurel Orr, Srikanth Kandula, Surajit Chaudhuri

266 - 278

Discovery of Approximate (and Exact) Denial Constraints

Eduardo Pena, Eduardo Cunha De Almeida, Felix Naumann

279 - 292

Deep Unsupervised Cardinality Estimation

Zongheng Yang, Eric Liang, Amog Kamsetty, Chenggang Wu, Yan Duan, Peter Chen, Pieter Abbeel, Joseph Hellerstein, Sanjay Krishnan, Ion Stoica

293 - 306

Free Gap Information from the Differentially Private Sparse Vector and Noisy Max Mechanisms

Zeyu Ding, Yuxin Wang, Danfeng Zhang, Dan Kifer

307 - 319

An End-to-End Learning-based Cost Estimator

Ji Sun, Guoliang Li

320 - 333

Last-Mile Delivery Made Practical: An Efficient Route Planning Framework with Theoretical Guarantees

Yuxiang Zeng, Yongxin Tong, Lei Chen

334 - 347

Database Processing-in-Memory: An Experimental Study

Tiago Kepe, Eduardo Cunha De Almeida, Marco Alves

348 - 361

Incorporating Super-Operators in Big-Data Query Optimizers

Jyoti Leeka, Kaushik Rajan

362 - 375

Efficient Progressive Minimum k-core Search

Conggai Li, Fan Zhang, Ying Zhang, Lu Qin, Wenjie Zhang, Xuemin Lin

376 - 389

Harmonia: Near-Linear Scalability for Replicated Storage with In-Network Conflict Detection

Hang Zhu, Zhihao Bai, Jialin Li, Ellis Michael, Dan Ports, Ion Stoica, Xin Jin

390 - 402

Learning to Sample: Counting with Complex Queries

Brett Walenz, Stavros Sintos, Sudeepa Roy, Jun Yang

403 - 420

Return of the Lernaean Hydra: Experimental Evaluation of Data Series Approximate Similarity Search

Karima Echihabi, Kostas Zoumpatianos, Themis Palpanas, Houda Benbrahim

Volume 13, No. 4

Magdalena Balazinska and Xiaofang Zhou: Front Matter i - vi

421 - 434

DPTree: Differential Indexing for Persistent Memory

Xinjing Zhou, Lidan Shou, Ke Chen, Wei Hu, Gang Chen

435 - 448

AJoin: Ad-hoc Stream Joins at Scale

Jeyhun Karimov, Tilmann Rabl, Volker Markl

449 - 462

On Performance Stability in LSM-based Storage Systems

Chen Luo, Michael Carey

463 - 476

Hop-constrained s-t Simple Path Enumeration: Towards Bridging Theory and Practice

You Peng, Ying Zhang, Xuemin Lin, Wenjie Zhang, Lu Qin, Jingren Zhou

477 - 491

Panorama: A Data System for Unbounded Vocabulary Querying over Video

Yuhao Zhang, Arun Kumar

492 - 505

Planting Trees for scalable and efficient Canonical Hub Labeling

Kartik Lakhotia, Rajgopal Kannan, Qing Dong, Viktor Prasanna

506 - 518

Operationalizing Individual Fairness with Pairwise Fair Representations

Preethi Lahoti, Krishna Gummadi, Gerhard Weikum

519 - 532

Optimizing Databases by Learning Hidden Parameters of Solid State Drives

Aarati Kakaraparthy, Jignesh Patel, Kwanghyun Park, Brian Kroth

533 - 546

BlazeIt: Optimizing Declarative Aggregation and Limit Queries for Neural Network-Based Video Analytics

Daniel Kang, Peter Bailis, Matei Zaharia

547 - 560

Join on Samples: A Theoretical Guide for Practitioners

Dawei Huang, Dong Young Yoon, Seth Pettie, Barzan Mozafari

561 - 573

Mining an Anti-Knowledge Base from Wikipedia Updates with Applications to Fact Checking and Beyond

Georgios Karagiannis, Immanuel Trummer, Saehan Jo, Shubham Khandelwal, Xuezhi Wang, Cong Yu

574 - 587

Evaluating Persistent Memory Range Indexes

Lucas Lersch, Xiangpeng Hao, Ismail Oukid, Tianzheng Wang, Thomas Willhalm

Volume 13, No. 5

Magdalena Balazinska and Xiaofang Zhou: Front Matter i - vi

588 - 601

A.M.B.R.O.S.I.A: Providing Performant Virtual Resiliency for Distributed Applications

Jonathan Goldstein, Ahmed Abdelhamid, Mike Barnett, Sebastian Burckhardt, Badrish Chandramouli, Darren Gehring, Niel Lebeck, Christopher Meiklejohn, Umar Farooq Minhas, Ryan Newton, Rahee Peshawaria, Tal Zaccai, Irene Zhang

602 - 615

Efficient Shortest Path Index Maintenance on Dynamic Road Networks with Theoretical Guarantees

Dian Ouyang, Long Yuan, Lu Qin, Lijun Chang, Ying Zhang, Xuemin Lin

616 - 628

ParPaRaw: Massively Parallel Parsing of Delimiter-Separated Raw Data

Elias Stehle, Hans-Arno Jacobsen

629 - 642

Opportunities for Optimism in Contended Main-Memory Multicore Transactions

Yihe Huang, William Qian, Eddie Kohler, Barbara Liskov, Liuba Shrira

643 - 655

PM-LSH: A Fast and Accurate LSH Framework for High-Dimensional Approximate NN Search

Bolong Zheng, Xi Zhao, Lianggui Weng, Hung Nguyen Quoc Viet, Hang Liu, Christian Jensen

656 - 669

Hunting multiple bumps in graphs

Yahui Sun, Jun Luo, Theodoros Lappas, Xiaokui Xiao, Bin Cui

670 - 683

Homogeneous Network Embedding for Massive Graphs via Reweighted Personalized PageRank

Renchi Yang, Jieming Shi, Xiaokui Xiao, Yin Yang, Sourav S. Bhowmick

684 - 697

Pattern Functional Dependencies for Data Cleaning

Abdulhakim Qahtan, Nan Tang, Mourad Ouzzani, Yang Cao, Michael Stonebraker

698 - 711

MEGA: Multi-View Semi-Supervised Clustering of Hypergraphs

Joyce Whang, Rundong Du, Sangwon Jung, Geon Lee, Barry Drake, Qingqing Liu, Seonggoo Kang, Haesun Park

712 - 725

MDedup: Duplicate Detection with Matching Dependencies

Ioannis Koumarelas, Thorsten Papenbrock, Felix Naumann

726 - 739

Programmable View Update Strategies on Relations

Van-Dang Tran, Hiroyuki Kato, Zhenjiang Hu

740 - 753

Amber: A Debuggable Dataflow System Based on the Actor Model

Avinash Kumar, Zuozhi Wang, Shengquan Ni, Chen Li

754 - 767

Dynamic Speculative Optimizations for SQL Compilation in Apache Spark

Filippo Schiavio, Daniele Bonetta, Walter Binder

768 - 782

Mind the Gap: An Experimental Evaluation of Imputation of Missing Values Techniques in Time Series

Mourad Khayati, Alberto Lerner, Zakhar Tymchenko, Philippe Cudre-Mauroux

Volume 13, No. 6

Magdalena Balazinska and Xiaofang Zhou: Front Matter i - vi

783 - 797

Graphite: A NUMA-aware HPC System for Graph Analytics Based on a new MPI * X Parallelism Model

Mohammad Hasanzadeh Mofrad, Rami Melhem, Muhammad Yousuf Ahmad, Mohammad Hammoud

798 - 811

Personal Insights for Altering Decisions of Tree-based Ensembles over Time

Nave Frost, Naama Boer, Daniel Deutch, Tova Milo

812 - 825

Answering Billion-Scale Label-Constrained Reachability Queries within Microsecond

You Peng, Ying Zhang, Xuemin Lin, Lu Qin, Wenjie Zhang

826 - 839

Effective and Efficient Retrieval of Structured Entities

Ruihong Huang, Shaoxu Song, Yunsu Lee, Jungho Park, Soo-Hyung Kim, Sungmin Yi

840 - 853

Micro-architectural Analysis of OLAP: Limitations and Opportunities

Utku Sirin, Anastasia Ailamaki

854 - 867

Effective and Efficient Community Search over Large Heterogeneous Information Networks

Yixiang Fang, Yixing Yang, Wenjie Zhang, Xuemin Lin, Xin Cao

868 - 883

ResilientDB: Global Scale Resilient Blockchain Fabric

Suyash Gupta, Sajjad Rahnama, Jelle Hellings, Mohammad Sadoghi

884 - 897

Data-Parallel Query Processing on Non-Uniform Data

Henning Funke, Jens Teubner

898 - 911

Evaluating Memory-Hard Proof-of-Work Algorithms on Three Processors

Zonghao Feng, Qiong Luo

912 - 924

Approximate Summaries for Why and Why-not Provenance

Seokki Lee, Bertram Ludaescher, Boris Glavic

925 - 938

PIDS: Attribute Decomposition for Improved Compression and Query Performance in Columnar Storage

Hao Jiang, Chunwei Liu, Qi Jin, John Paparrizos, Aaron Elmore

939 - 952

On Detecting Cherry-picked Trendlines

Abolfazl Asudeh, H. Jagadish, You Wu, Cong Yu

Volume 13, No. 7

Magdalena Balazinska and Xiaofang Zhou: Front Matter i - vi

953 - 965

Data-Driven Domain Discovery for Structured Datasets

Masayo Ota, Heiko Mueller, Juliana Freire, Divesh Srivastava

966 - 978

Realtime Index-Free Single Source SimRank Processing on Web-Scale Graphs

Jieming Shi, Tianyuan Jin, Renchi Yang, Xiaokui Xiao, Yin Yang

979 - 991

Demand-Aware Route Planning for Shared Mobility Services

Jiachuan Wang, Peng Cheng, Libin Zheng, Chao Feng, Lei Chen, Xuemin Lin, Zheng Wang

992 - 1005

DeepDB: Learn from Data, not from Queries!

Benjamin Hilprecht, Andreas Schmidt, Moritz Kulessa, Alejandro Molina, Kristian Kersting, Carsten Binnig

1006 - 1019

Data Migration using Datalog Program Synthesis

Yuepeng Wang, Rushi Shah, Abby Criswell, Rong Pan, Isil Dillig

1020 - 1034

LiveGraph: A Transactional Graph Storage System with Purely Sequential Adjacency List Scans

Xiaowei Zhu, Marco Serafini, Xiaosong Ma, Ashraf Aboulnaga, Wenguang Chen, Guanyu Feng

1035 - 1049

KBPearl: A Knowledge Base Population System Supported by Joint Entity and Relation Linking

Xueling Lin, Haoyang Li, Hao Xin, Zijian Li, Lei Chen

1050 - 1063

Compression of Uncertain Trajectories in Road Networks

Tianyi Li, Ruikai Huang, Lu Chen, Christian Jensen, Torben Pedersen

1064 - 1077

Understanding and Benchmarking the Impact of GDPR on Database Systems

Supreeth Shastri, Vinay Banakar, Melissa Wasserman, Arun Kumar, Vijay Chidambaram

1078 - 1090

LB+-Trees: Optimizing Persistent Index Performance on 3DXPoint Memory

Jihang Liu, Shimin Chen, Lujun Wang

1091 - 1104

Enabling Low Tail Latency on Multicore Key-Value Stores

Lucas Lersch, Ivan Schreter, Ismail Oukid, Wolfgang Lehner

1105 - 1118

Approximate Analytics System over Compressed Time Series with Tight Deterministic Error Guarantees

Chunbin Lin, Etienne Boursier, Yannis Papakonstantinou

1119 - 1133

Traversing Large Graphs on GPUs with Unified Memory

Prasun Gera, Hyojong Kim, Piyush Sao, Hyesoon Kim, David Bader

1134 - 1146

Supporting Hard Queries over Probabilistic Preferences

Haoyue Ping, Julia Stoyanovich, Benny Kimelfeld

Volume 13, No. 8

Magdalena Balazinska and Xiaofang Zhou: Front Matter i - vi

1147 - 1161

Dash: Scalable Hashing on Persistent Memory

Baotong Lu, Xiangpeng Hao, Tianzheng Wang, Eric Lo

1162 - 1175

The PGM-index: a fully-dynamic compressed learned index with provable worst-case bounds

Paolo Ferragina, Giorgio Vinciguerra

1176 - 1189

Diagnosing Root Causes of Intermittent Slow Queries in Large-Scale Cloud Databases

Minghua Ma, Zheng Yin, Shenglin Zhang, Sheng Wang, Christopher Zheng, Xinhao Jiang, Hanwen Hu, Cheng Luo, Yilin Li, Nengjun Qiu, Feifei Li, Changcheng Chen, Dan Pei

1190 - 1205

Pangolin: An Efficient and Flexible Graph Mining System on CPU and GPU

Xuhao Chen, Roshan Dathathri, Gurbinder Gill, Keshav Pingali

1206 - 1220

Quantifying TPC-H Choke Points and Their Optimizations

Markus Dreseler, Martin Boissier, Tilmann Rabl, Matthias Uflacker

1221 - 1233

Efficient Algorithms for Crowd-Aided Categorization

Yuanbing Li, Xian Wu, Yifei Jin, Jian Li, Guoliang Li

1234 - 1247

Set-valued Data Publication with Local Privacy: Tight Error Bounds and Efficient Mechanisms

Shaowei Wang, Yuqiu Qian, Jiachun Du, Wei Yang, Liusheng Huang, Hongli Xu

1248 - 1260

Translation of Array-Based Loops to Distributed Data-Parallel Programs

Leonidas Fegaras, Md Hasanuzzaman Noor

1261 - 1274

Incrementalization of Graph Partitioning Algorithms

Wenfei Fan, Muyang Liu, Chao Tian, Ruiqi Xu, Jingren Zhou

1275 - 1289

Optimizing Item and Subgroup Configurations for Social-Aware VR Shopping

Shao-Heng Ko, Hsu-Chao Lai, Hong-Han Shuai, Wang-Chien Lee, Philip S. Yu, De-Nian Yang

1290 - 1303

Efficient Confidentiality-Preserving Data Analytics over Symmetrically Encrypted Datasets

Savvas Savvides, Darshika Khandelwal, Patrick Eugster

1304 - 1318

Single Machine Graph Analytics on Massive Datasets Using Intel Optane DC Persistent Memory

Gurbinder Gill, Roshan Dathathri, Loc Hoang, Ramesh Peri, Keshav Pingali

Volume 13, No. 9

Magdalena Balazinska and Xiaofang Zhou: Front Matter i - vi

1319 - 1331

Atomic Commitment Across Blockchains

Victor Zakhary, Divy Agrawal, Amr El Abbadi

1332 - 1345

HydraList: A Scalable In-Memory Index Using Asynchronous Updates and Partial Replication

Ajit Mathew, Changwoo Min

1346 - 1358

eXtreme Modelling in Practice

Judah Schvimer, A. Jesse Jiryu Davis, Max Hirschhorn

1359 - 1372

Maximum Biclique Search at Billion Scale

Bingqing Lyu, Lu Qin, Xuemin Lin, Ying Zhang, Zhengping Qian, Jingren Zhou

1373 - 1387

ARDA: Automatic Relational Data Augmentation for Machine Learning

Nadiia Chepurko, Ryan Marcus, Emanuel Zgraggen, Raul Castro Fernandez, Tim Kraska, David Karger

1388 - 1400

An LSM-based Tuple Compaction Framework for Apache AsterixDB

Wail Alkowaileet, Sattam Alsubaiee, Michael Carey

1401 - 1415

ADnEV: Cross-Domain Schema Matching using Deep Similarity Matrix Adjustment and Evaluation

Roee Shraga, Avigdor Gal, Haggai Roitman

1416 - 1428

Query Performance Prediction for Concurrent Queries using Graph Embedding

Xuanhe Zhou, Ji Sun, Guoliang Li, Jianhua Feng

1429 - 1442

Scalable, NearZero Loss Disaster Recovery for Distributed Data Stores

Ahmed Alquraan, Alex Kogan, Virendra Marathe, Samer Al-Kiswany

1443 - 1455

VHP: Approximate Nearest Neighbor Search via Virtual Hypersphere Partitioning

Kejing Lu, Hongya Wang, Wei Wang, Mineichi Kudo

1456 - 1468

IDAR: Fast Supergraph Search Using DAG Integration

Hyunjoon Kim, Seunghwan Min, Kunsoo Park, Xuemin Lin, Seok-Hee Hong, Wook-Shin Han

1469 - 1482

Guided Exploration of User Groups

Mariia Seleznova, Behrooz Omidvar-Tehrani, Sihem Amer-Yahia, Eric Simon

1483 - 1497f

iDEC: Indexable Distance Estimating Codes for Approximate Nearest Neighbor Search

Long Gong, Huayi Wang, Mitsunori Ogihara, Jun Xu

1498 - 1510

Efficient Algorithms for Budgeted Influence Maximization on Massive Social Networks

Song Bian, Qintian Guo, Sibo Wang, Jeffrey Xu Yu

1511 - 1524

Mining Top-k Pairs of Correlated Subgraphs in a Large Network

Arneish Prateek, Arijit Khan, Akshit Goyal, Sayan Ranu

1525 - 1539

FireLedger: A High Throughput Blockchain Consensus Protocol

Yehonatan Buchnik, Roy Friedman

1540 - 1554

Put an Elephant into a Fridge: Optimizing Cache Efficiency for In-memory Key-value Stores

Kefei Wang, Jian Liu, Feng Chen

1555 - 1567

Anytime Stochastic Routing with Hybrid Learning

Simon Aagaard Pedersen, Bin Yang, Christian Jensen

1568 - 1581

Understanding the Effect of Data Center Resource Disaggregation on Production DBMSs

Qizhen Zhang, Yifan Cai, Xinyi Chen, Sebastian Angel, Ang Chen, Vincent Liu, Boon Loo

1582 - 1597

Optimal Algorithms for Ranked Enumeration of Answers to Full Conjunctive Queries

Nikolaos Tziavelis, Deepak Ajwani, Wolfgang Gatterbauer, Mirek Riedewald, Xiaofeng Yang

1598 - 1613

Sage: Parallel Semi-Asymmetric Graph Algorithms for NVRAMs

Laxman Dhulipala, Charles Mcguffey, Hongbo Kang, Yan Gu, Guy Blelloch, Phillip Gibbons, Julian Shun

Volume 13, No. 10

Magdalena Balazinska and Xiaofang Zhou: Front Matter i - vi

1614 - 1627

Pricing Influential Nodes in Online Social Networks

Yuqing Zhu, Jing Tang, Xueyan Tang

1628 - 1640

KClist++: A Simple Algorithm for Finding k-Clique Densest Subgraphs in Large Graphs

Bintao Sun, Maximilien Dansich, Hubert Chan, Mauro Sozio

1641 - 1653

Dynamic Interleaving of Content and Structure for Robust Indexing of Semi-Structured Hierarchical Data

Kevin Wellenzohn, Michael Böhlen, Sven Helmer

1654 - 1668

ChiSeL: Graph Similarity Search using Chi-Squared Statistics in Large Probabilistic Graphs

Shubhangi Agarwal, Sourav Dutta, Arnab Bhattacharya

1669 - 1681

Fast Incremental Discovery of Pointwise Order Dependencies

Zijing Tan, Ai Ran, Shuai Ma, Sheng Qin

1682 - 1695

Approximate Denial Constraints

Ester Livshits, Alireza Heidari, Ihab Ilyas, Benny Kimelfeld

1696 - 1708

Sharing Opportunities for OLTP Workloads in Different Isolation Levels

Robin Rehrmann, Carsten Binnig, Alexander Boehm, Kihong Kim, Wolfgang Lehner

1709 - 1722

Biclustering and Boolean Matrix Factorization in Data Streams

Stefan Neumann, Pauli Miettinen

1723 - 1736

Effective and Efficient Relational Community Detection and Search in Large Dynamic Heterogeneous Information Networks

Xun Jian, Yue Wang, Lei Chen

1737 - 1750

Natural language to SQL: Where are we today?

Hyeonji Kim, Byeong-Hoon So, Wook-Shin Han, Hongrae Lee

1751 - 1764

Accelerating Truss Decomposition on Heterogeneous Processors

Yulin Che, Zhuohang Lai, Shixuan Sun, Yue Wang, Qiong Luo

1765 - 1778

Searching a Database of Source Codes Using Contextualized Code Search

Rohan Mukherjee, Chris Jermaine, Swarat Chaudhuri

1779 - 1792

Data Stream Event Prediction Based on Timing Knowledge and State Transitions

Yan Li, Tingjian Ge, Cindy Chen

1793 - 1806

Shared Arrangements: practical inter-query sharing for streaming dataflows

Frank Mcsherry, Andrea Lattuada, Malte Schwarzkopf, Timothy Roscoe

Volume 13, No. 11

Magdalena Balazinska and Xiaofang Zhou: Front Matter i - viii

1807 - 1820

SmartBench: A Benchmark For Data Management In Smart Spaces

Peeyush Gupta, Michael Carey, Sharad Mehrotra, Roberto Yus

1821 - 1834

Series2Graph: Graph-based Subsequence Anomaly Detection for Time Series

Paul Boniol, Themis Palpanas

1835 - 1848

Sato: Contextual Semantic Type Detection in Tables

Dan Zhang, Yoshihiko Suhara, Jinfeng Li, Madelon Hulsebos, Çağatay Demiralp, Wang-Chiew Tan

1849 - 1862

TransNet: Training Privacy-Preserving Neural Network over Transformed Layer

Qijian He, Wei Yang, Bingren Chen, Yangyang Geng, Liusheng Huang

1863 - 1876

Capturing Associations in Graphs

Wenfei Fan, Ruochun Jin, Muyang Liu, Ping Lu, Chao Tian, Jingren Zhou

1877 - 1890

Dynamic Parameter Allocation in Parameter Servers

Alexander Renz-Wieland, Rainer Gemulla, Steffen Zeuch, Volker Markl

1891 - 1904

Adopting Worst-Case Optimal Joins in Relational Database Systems

Michael Freitag, Maximilian Bandle, Tobias Schmidt, Alfons Kemper, Thomas Neumann

1905 - 1918

A workload-adaptive mechanism for linear queries under local differential privacy

Ryan Mckenna, Raj Kumar, Arya Mazumdar, Gerome Miklau

1919 - 1932

SPORES: Sum-Product Optimization via Relational Equality Saturation for Large Scale Linear Algebra

Yisu Wang, Shana Hutchison, Dan Suciu, Bill Howe, Jonathan Leang

1933 - 1947

Data Market Platforms: Trading Data Assets to Solve Data Problems

Raul Castro Fernandez, Pranav Subramaniam, Michael Franklin

1948 - 1961

Baran: Effective Error Correction via a Unified Context Representation and Transfer Learning

Mohammad Mahdavi, Ziawasch Abedjan

1962 - 1975

Relational Data Synthesis using Generative Adversarial Networks: A Design Space Exploration

Ju Fan, Tongyu Liu, Guoliang Li, Junyou Chen, Yuwei Shen, Xiaoyong Du

1976 - 1989

Leaper: A Learned Prefetcher for Cache Invalidation in LSM-tree based Storage Engines

Lei Yang, Hong Wu, Tieying Zhang, Xuntao Cheng, Feifei Li, Lei Zou, Yujie Wang, Rongyao Chen, Jianying Wang, Gui Huang

1990 - 2003

Approximate Selection with Guarantees using Proxies

Daniel Kang, Edward Gan, Peter Bailis, Tatsunori Hashimoto, Matei Zaharia

2004 - 2017

2R: Efficiently Isolating Cold Pages in Flash Storages

Minji Kang, Soyee Choi, Gihwan Oh, Sang Won Lee

2018 - 2032

Knowledge Translation

Bahar Ghadiri Bashardoost, Renée J. Miller, Kelly Lyons, Fatemeh Nargesian

2033 - 2046

Towards Scalable Dataframe Systems

Devin Petersohn, William Ma, Doris Lee, Stephen Macke, Doris Xin, Xiangxi Mo, Joseph Gonzalez, Joseph Hellerstein, Anthony Joseph, Aditya Parameswaran

2047 - 2060

Aria: A Fast and Practical Deterministic OLTP Database

Yi Lu, Xiangyao Yu, Lei Cao, Samuel Madden

2061 - 2074

The Computation of Optimal Subset Repairs

Dongjing Miao, Zhipeng Cai, Jianzhong Li, Xiangyu Gao, Xianmin Liu

2075 - 2089

Pytheas: Pattern-based Table Discovery in CSV Files

Christina Christodoulakis, Eric Munson, Moshe Gabel, Angela Demke Brown, Renée J. Miller

2090 - 2103

Privacy Preserving Vertical Federated Learning for Tree-based Models

Yuncheng Wu, Shaofeng Cai, Xiaokui Xiao, Gang Chen, Beng Chin Ooi

2104 - 2117

Topic-based Community Search over Spatial-Social Networks

Ahmed Al-Baghdadi, Xiang Lian

2118 - 2131

LOG-Means: Efficiently Estimating the Number of Clusters in Large Datasets

Manuel Fritz, Michael Behringer, Holger Schwarz

2132 - 2145

Efficient Oblivious Database Joins

Simeon Krastnikov, Florian Kerschbaum, Douglas Stebila

2146 - 2158

Evaluating Top-k Queries with Inconsistency Degrees

Ousmane Issa, Angela Bonifati, Farouk Toumani

2159 - 2173

Cerebro: A Data System for Optimized Deep Learning Model Selection

Supun Nakandala, Yuhao Zhang, Arun Kumar

2174 - 2187

CoopStore: Optimizing Precomputed Summaries for Aggregation

Edward Gan, Peter Bailis, Moses Charikar

2188 - 2201

Fast Subtrajectory Similarity Search in Road Networks under Weighted Edit Distance Constraints

Satoshi Koide, Chuan Xiao, Yoshiharu Ishikawa

2202 - 2214

SimTab: Accuracy-Guaranteed SimRank Queries through Tighter Confidence Bounds and Multi-Armed Bandits

Yu Liu, Lei Zou, Qian Ge, Zhewei Wei

2215 - 2228

Efficiently Approximating Selectivity Functions using Low Overhead Regression Models

Anshuman Dutt, Chi Wang, Vivek Narasayya, Surajit Chaudhuri

2229 - 2242

Identifying Insufficient Data Coverage in Databases with Multiple Relations

Yin Lin, Yifan Guan, Abolfazl Asudeh, H. Jagadish

2243 - 2255

Continuously Monitoring Alternative Shortest Paths on Road Networks

Lingxiao Li, Muhammad Aamir Cheema, Mohammed Eunus Ali, Hua Lu, David Taniar

2256 - 2269

Hypergraph Motifs: Concepts, Algorithms, and Discoveries

Geon Lee, Jihoon Ko, Kijung Shin

2270 - 2283

Hitting Set Enumeration with Partial Information for Unique Column Combination Discovery

Johann Birnick, Thomas Bl√§sius, Tobias Friedrich, Felix Naumann, Thorsten Papenbrock, Martin Schirneck

2284 - 2296

SSTD: A Distributed System on Streaming Spatio-Textual Data

Yue Chen, Zhida Chen, Gao Cong, Ahmed Mahmood, Walid Aref

2297 - 2311

Continuous Prefetch for Interactive Data Applications

Haneen Mohammed, Ziyun Wei, Ravi Netravali, Eugene Wu

2312 - 2325

Efficient and Effective Similar Subtrajectory Search with Deep Reinforcement Learning

Zheng Wang, Cheng Long, Gao Cong, Yiding Liu

2326 - 2340

A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs

Zequn Sun, Qingheng Zhang, Wei Hu, Chengming Wang, Muhao Chen, Farahnaz Akrami, Chengkai Li

2341 - 2354

Effectively Learning Spatial Indices

Jianzhong Qi, Guanli Liu, Christian Jensen, Lars Kulik

2355 - 2367

Stable Learned Bloom Filters for Data Streams

Qiyu Liu, Libin Zheng, Yanyan Shen, Lei Chen

2368 - 2381

Auto-Transform: Learning-to-Transform by Patterns

Yeye He, Zhongjun Jin, Surajit Chaudhuri

2382 - 2395

Magic mirror in my hand, which is the best in the land? An Experimental Evaluation of Index Selection Algorithms

Jan Kossmann, Stefan Halfpap, Marcel Jankrift, Rainer Schlosser

2396 - 2410

MorphStore: Analytical Query Engine with a Holistic Compression-Enabled Processing Model

Patrick Damme, Annett Ungethüm, Johannes Pietrzyk, Alexander Krause, Dirk Habich, Wolfgang Lehner

2411 - 2423

Fast and Effective Distribution-Key Recommendation for Amazon Redshift

Panos Parchas, Yonatan Naamad, Peter Van Bouwel, Christos Faloutsos, Michalis Petropoulos

2424 - 2437

Sieve: A Middleware Approach to Scalable Access Control for Database Management Systems

Primal Pappachan, Roberto Yus, Sharad Mehrotra, Johann-Christoph Freytag

2438 - 2452

Cloudburst: Stateful Functions-as-a-Service

Vikram Sreekanti, Chenggang Wu, Charles Lin, Johann Schleier-Smith, Joseph Gonzalez, Joseph Hellerstein, Alexey Tumanov

2453 - 2465

ODIN: Automated Drift Detection and Recovery in Video Analytics

Abhijit Suprem, Joy Arulraj, Calton Pu, Jo√£o Eduardo Ferreira

Volume 13, No. 12

Magdalena Balazinska and Xiaofang Zhou: Front Matter i - viii

2801 - 2804

Demand-based Sensor Data Gathering with Multi-Query Optimization

Julius Hülsmann, Jonas Traub, Volker Markl

2805 - 2808

CheetahVIS: A Visual Analytical System for Large Urban Bus Data

Wentao Ning, Qiandong Tang, Yi Zhao, Chuan Yang, Xiaofeng Wang, Teng Wang, Haotian Liu, Chaozu Zhang, Zhiyuan Zhou, Qiaomu Shen, Bo Tang

2809 - 2812

UNMASQUE: A Hidden SQL Query Extractor

Kapil Khurana, Jayant Haritsa

2813 - 2816

G3: When Graph Neural Networks Meet Parallel Graph Processing Systems on GPUs

Husong Liu, Shengliang Lu, Xinyu Chen, Bingsheng He

2817 - 2820

PiBench Online: Interactive Benchmarking of Persistent Memory Indexes

Xiangpeng Hao, Lucas Lersch, Tianzheng Wang, Ismail Oukid

2821 - 2824

VisClean: Interactive Cleaning for Progressive Visualization

Yuyu Luo, Chengliang Chai, Xuedi Qin, Nan Tang, Guoliang Li

2825 - 2828

IMO: A Toolbox for Simulating and Querying ``Infected'' Moving Objects

Jianqiu Xu, Hua Lu, Zhifeng Bao

2829 - 2832

COUNTATA: Dataset Labeling Using Pattern Counts

Yuval Moskovitch, H. Jagadish

2833 - 2836

A Demonstration of Willump: A Statistically-Aware End-to-end Optimizer for Machine Learning Inference

Peter Kraft, Daniel Kang, Deepak Narayanan, Shoumik Palkar, Peter Bailis, Matei Zaharia

2837 - 2840

Ease.ml/snoopy in Action: Towards Automatic Feasibility Analysis for Machine Learning Application Development

Cedric Renggli, Luka Rimanic, Luka Kolar, Wentao Wu, Ce Zhang

2841 - 2844

DeepTrack: Monitoring and Exploring Spatio-Temporal Data - A Case of Tracking COVID-19 -

Yuyu Luo, Wenbo Li, Tianyu Zhao, Xiang Yu, Lixi Zhang, Guoliang Li, Nan Tang

2845 - 2848

DF-Toolkit: Interacting with Low-Level Database Storage

James Wagner, Alexander Rasin, Karen Heart, Tanu Malik, Jonathan Grier

2849 - 2852

Like Water and Oil: With a Proper Emulsifier, Query Compilation and Data Parallelism Will Mix Well

Henning Funke, Jens Teubner

2853 - 2856

SQUARES : A SQL Synthesizer Using Query Reverse Engineering

Pedro Orvalho, Miguel Terra-Neves, Miguel Ventura, Ruben Martins, Vasco Manquinho

2857 - 2860

SuccinctEdge: A Succinct RDF Store for Edge Computing

Weiqin Xu, Olivier Cure, Philippe Calvez

2861 - 2864

SuDocu: Summarizing Documents by Example

Anna Fariha, Matteo Brucato, Peter Haas, Alexandra Meliou

2865 - 2868

CONCIERGE: Improving Constrained Search Results by Data Melioration

Ido Guy, Tova Milo, Slava Novgorodov, Brit Youngmann

2869 - 2872

Demonstrating the Voice-Based Exploration of Large Data Sets with CiceroDB-Zero

Immanuel Trummer

2873 - 2876

FASTS: A Satisfaction-Boosting Bus Scheduling Assistant

Songsong Mo, Zhifeng Bao, Baihua Zheng, Zhiyong Peng

2877 - 2880

Vaas: Video Analytics At Scale

Favyen Bastani, Oscar Moll, Samuel Madden

2881 - 2884

sPaQLTooLs: A Stochastic Package Query Interface for Scalable Constrained Optimization

Matteo Brucato, Miro Mannino, Azza Abouzied, Peter Haas, Alexandra Meliou

2885 - 2888

ActiveDeeper: A Model-based Active Data Enrichment System

Liang Zhao, Qingcan Li, Pei Wang, Jiannan Wang, Eugene Wu

2889 - 2892

RDFFrames: Knowledge Graph Access for Machine Learning Tools

Aisha Mohamed, Ghadeer Abuoda, Abdurrahman Ghanem, Zoi Kaoudi, Ashraf Aboulnaga

2893 - 2896

Scalable, Resilient and Configurable Permissioned Blockchain Fabric

Sajjad Rahnama, Suyash Gupta, Thamir Qadah, Jelle Hellings, Mohammad Sadoghi

2897 - 2900

BIRDS: Programming view update strategies in Datalog

Van-Dang Tran, Hiroyuki Kato, Zhenjiang Hu

2901 - 2904

Apache IoTDB: Time-series database for Internet of Things

Chen Wang, Huang Xiangdong, Jialin Qiao, Jianmin Wang, Jiaguang Sun, Kevin Mcgrail, Julian Feinauer, Jinrui Zhang, Peng Wang, Jinrui Zhang, Rong Kang, Tian Jiang, Lei Rui, Jun Yuan

2905 - 2908

Evaluating Ridesharing Algorithms using the Jargo Real-Time Stochastic Simulator

James Pan, Guoliang Li, Yong Wang

2909 - 2912

BitFun: Fast Answers to Queries with Tunable Functions in Geospatial Array DBMS

Ramon Antonio Rodriges Zalipynis

2913 - 2916

SPHINX: A System for Metapath-based Entity Exploration in Heterogeneous Information Networks

Serafeim Chatzopoulos, Kostas Patroumpas, Alexandros Zeakis, Thanasis Vergoulis, Dimitrios Skoutas

2917 - 2920

ExplainED: Explanations for EDA Notebooks

Daniel Deutch, Amir Gilad, Tova Milo, Amit Somech

2921 - 2924

MuSe: Multiple Deletion Semantics for Data Repair

Amir Gilad, Yihao Hu, Daniel Deutch, Sudeepa Roy

2925 - 2928

Tabula in Action: A Sampling Middleware for Interactive Geospatial Visualization Dashboards

Jia Yu, Kanchan Chowdhury, Mohamed Sarwat

2929 - 2932

X^2R^2: a Tool for Explainable and Explorative Reidentification Risk Analysis

Tom Rolandus Hagedoorn, Rohit Kumar, Francesco Bonchi

2933 - 2936

Obi-Wan: Ontology-Based RDF Integration of Heterogeneous Data

Maxime Buron, François Goasdoué, Ioana Manolescu, Marie-Laure Mugnier

2937 - 2940

CrocodileDB in Action: Resource-Efficient Query Execution by Exploiting Time Slackness

Dixin Tang, Zechao Shang, Aaron Elmore, Sanjay Krishnan, Michael Franklin

2941 - 2944

GraphAn: Graph-based Subsequence Anomaly Detection

Paul Boniol, Themis Palpanas, Mohammed Meftah, Emmanuel Remy

2945 - 2948

LMFAO: An Engine for Batches of Group-By Aggregates

Maximilian Schleich, Dan Olteanu

2949 - 2952

ESTOCADA: Towards Scalable Polystore Systems

Rana Alotaibi, Bogdan Cautis, Alin Deutsch, Moustafa Latrache, Ioana Manolescu, Yifei Yang

2953 - 2956

Demonstration of Interactive Runtime Debugging of Distributed Dataflows in Texera

Zuozhi Wang, Avinash Kumar, Shengquan Ni, Chen Li

2957 - 2960

DeepTRANS: A Deep Learning System for Public Bus Travel Time Estimation using Traffic Forecasting

Luan Tran, Min Young Mun, Matthew Lim, Jonah Yamato, Nathan Huh, Cyrus Shahabi

2961 - 2964

Demonstration of ScroogeDB: Getting More Bang For the Buck with Deterministic Approximation in the Cloud

Saehan Jo, Jialing Pei, Immanuel Trummer

2965 - 2968

Scrutinizer: Fact Checking Statistical Claims

Georgios Karagiannis, Mohammed Saeed, Paolo Papotti, Immanuel Trummer

2969 - 2972

SciLens News Platform: A System for Real-Time Evaluation of News Articles

Angelika Romanou, Panayiotis Smeros, Carlos Castillo, Karl Aberer

2973 - 2976

HDAG-Explorer: A System for Hierarchical DAG Summarization and Exploration

Xuliang Zhu, Xin Huang, Jinbin Huang, Byron Choi, Jianliang Xu

2977 - 2980

Orca-SR: A Real-Time Traffic Engineering Framework leveraging Similarity Joins

Jees Augustine, Suraj Shetiya, Abolfazl Asudeh, Saravanan Thirumuruganathan, Azade Nazi, Nan Zhang, Gautam Das, Divesh Srivastava

2981 - 2984

nKV in Action: Accelerating KV-Stores on NativeComputational Storage with Near-Data Processing

Tobias Vincon, Lukas Weber, Arthur Bernhard, Andreas Koch, Ilia Petrov, Christian Knoedler, Sergey Hardock, Sajjad Tamimi, Christian Riegger

2985 - 2988

Demonstration of Inferring Causality from Relational Databases with CaRL

Moe Kayali, Babak Salimi, Dan Suciu

2989 - 2992

SQL for Data Scientists: Designing SQL Tutorials for Scalable Online Teaching

Uwe Roehm, Lexi Brent, Tim Dawborn, Bryn Jeffries

2993 - 2996

Debugging Large-Scale Data Science Pipelines using Dagger

El Kindi Rezig, Ashrita Brahmaroutu, Nesime Tatbul, Mourad Ouzzani, Nan Tang, Timothy Mattson, Samuel Madden, Michael Stonebraker

2997 - 3000

I-Rex: An Interactive Relational Query Explainer for SQL

Zhengjie Miao, Tiangang Chen, Alexander Bendeck, Kevin Day, Sudeepa Roy, Jun Yang

3001 - 3004

PANDA: Policy-aware Location Privacy for Epidemic Surveillance

Yang Cao, Shun Takagi, Yonghui Xiao, Li Xiong, Masatoshi Yoshikawa

3005 - 3018

PyTorch Distributed: Experiences on Accelerating Data Parallel Training

Shen Li, Yanli Zhao, Rohan Varma, Omkar Salpekar, Pieter Noordhuis, Teng Li, Adam Paszke, Jeff Smith, Brian Vaughan, Pritam Damania, Soumith Chintala

3019 - 3031

Towards Multi-way Join Aware Optimizer in SAP HANA

Sungheun Wi, Wook-Shin Han, Chuho Chang, Kihong Kim

3032 - 3045

InvaliDB: Scalable Push-Based Real-Time Queries on Top of Pull-Based Databases (Extended)

Wolfram Wingerath, Felix Gessert, Norbert Ritter

3046 - 3058

Automated Generation of Materialized Views in Oracle

Rafi Ahmed, Randall Bello, Andrew Witkowski, Praveen Kumar

3059 - 3071

Native JSON Datatype Support: Maturing SQL and NoSQL convergence in Oracle Database

Zhen Hua Liu, Beda Hammerschmidt, Douglas Mcmahon, Hui Chang, Ying Lu, Josh Spiegel, Alfonso Colunga Sosa, Srikrishnan Suresh, Geeta Arora, Vikas Arora

3072 - 3084

TiDB: A Raft-based HTAP Database

Dongxu Huang, Qi Liu, Qiu Cui, Zhuhe Fang, Xiaoyu Ma, Fei Xu, Li Shen, Liu Tang, Yuxing Zhou, Menglong Huang, Wan Wei, Cong Liu, Jian Zhang, Jianjun Li, Xuelian Wu, Lingyu Song, Ruoxi Sun, Shuaipeng Yu, Lei Zhao, Nicholas Cameron, Liquan Pei, Xin Tang

3085 - 3098

A system design for elastically scaling transaction processing engines in virtualized servers

Angelos Christos Anadiotis, Raja Appuswamy, Anastasia Ailamaki, Ilan Bronshtein, Hillel Avni, David Dominguez-Sal, Shay Goikhman, Eliezer Levy

3099 - 3111

Industrial Strength OLTP Using Main Memory and Many Cores

Hillel Avni, Alisher Aliev, Oren Amor, Aharon Avitzur, Ilan Bronshtein, Eli Ginot, Shay Goikhman, Eliezer Levy, Idan Levy, Fuyang Lu, Liran Mishali, Yeqin Mo, Nir Pachter, Dima Sivov, Vinoth Veeraraghavan, Vladi Vexler, Lei Wang, Peng Wang

3112 - 3124

Asymmetric-Partition Replication for Highly Scalable Distributed Transaction Processing in Practice

Juchang Lee, Hyejeong Lee, Seongyun Ko, Kyu Hwan Kim, Mihnea Andrei, Friedrich Keller, Wook-Shin Han

3125 - 3137

AGL: A Scalable System for Industrial-purpose Graph Machine Learning

Dalong Zhang, Xin Huang, Ziqi Liu, Jun Zhou, Zhiyang Hu, Xianzheng Song, Zhibang Ge, Lin Wang, Zhiqiang Zhang, Yuan Qi

3138 - 3151

LedgerDB: A Centralized Ledger Database for Universal Audit and Verification

Xinying Yang, Yuan Zhang, Sheng Wang, Benquan Yu, Feifei Li, Yize Li, Wenyuan Yan

3152 - 3165

AnalyticDB-V: A Hybrid Analytical Engine Towards Query Fusion for Structured and Unstructured Data

Chuangxian Wei, Bin Wu, Sheng Wang, Renjie Lou, Chaoqun Zhan, Feifei Li, Yuanzhe Cai

3166 - 3180

Oracle AutoML: A Fast and Predictive AutoML Pipeline

Anatoly Yakovlev, Hesam Fathi Moghadam, Ali Moharrer, Jingxiao Cai, Nikan Chavoshi, Venkatanathan Varadarajan, Sandeep Agrawal, Tomas Karnagel, Sam Idicula, Sanjay Jinturkar, Nipun Agarwal

3181 - 3194

Monarch: Google's Planet-Scale In-Memory Time Series Database

Colin Adams, Luis Alonso, Benjamin Atkin, John Banning, Sumeer Bhola, Rick Buskens, Ming Chen, Xi Chen, Yoo Chung, Qin Jia, Nick Sakharov, George Talbot, Nick Taylor, Adam Tart

3195 - 3203

Concurrent Updates to Pages with Fixed-Size rows Using Lock-Free Algorithms

Raghavendra Thallam Kodandaramaih, Hanuma Kodavalla, Girish Venkataramanappa

3204 - 3216

POLARIS: The Distributed SQL Engine in Azure Synapse

Josep Aguilar Saborit, Raghu Ramakrishnan

3217 - 3230

MyRocks: LSM-Tree Database Storage Engine Serving Facebook's Social Graph

Yoshinori Matsunobu, Siying Dong, Herman Lee

3231 - 3244

Helios: Hyperscale Indexing for the Cloud & Edge

Rahul Potharaju, Terry Kim, Wentao Wu, Vidip Acharya, Steve Suh, Andrew Fogarty, Apoorve Dave, Sinduja Ramanujam, Tomas Talius, Lev Novik, Raghu Ramakrishnan

3245 - 3257

Replication at the Speed of Change - a Fast, Scalable Replication Solution for Near Real-Time HTAP Processing

Dennis Butterstein, Daniel Martin, Knut Stolze, Felix Beier, Jia Zhong, Lingyun Wang

3258 - 3271

Exploiting Domain Knowledge to address Multi-Class Imbalance and a Heterogeneous Feature Space in Classification Tasks for Manufacturing Data

Vitali Hirsch, Peter Reimann, Bernhard Mitschang

3272 - 3284

Alibaba Hologres: A Cloud-Native Service for Hybrid Serving/Analytical Processing

Xiaowei Jiang, Yuejun Hu, Yu Xiang, Guangran Jiang, Xiaojun Jin, Chen Xia, Weihua Jiang, Jun Yu, Haitao Wang, Yuan Jiang, Jihong Ma, Li Su, Kai Zeng

3285 - 3298

DIAMetrics: Benchmarking Query Engines at Scale

Anja Gruenheid, Shaleen Deep, Kruthi Nagaraj, Hiro Naito, Jeff Naughton, Stratis Viglas

3299 - 3312

Db2 Event Store: A Purpose-Built IoT Database Engine

Christian Garcia-Arellano, Adam Storm, David Kalmuk, Hamdi Roumani, Ronald Barber, Yuanyuan Tian, Richard Sidle, Fatma Ozcan, Matthew Spilchen, Josh Tiefenbach, Daniel Zilio, Lan Pham, Kostas Rakopoulos, Alexander Cheung, Darren Pepper, Imran Sayyid, Gidon Gershinsky, Gal Lushi, Hamid Pirahesh

3313 - 3325

F1 Lightning: HTAP as a Service

Jiacheng Yang, Ian Rae, Jun Xu, Jeff Shute, Zhan Yuan, Kelvin Lau, Qiang Zeng, Xi Zhao, Jun Ma, Ziyang Chen, Yuan Gao, Qilin Dong, Junxiong Zhou, Jeremy Wood, Goetz Graefe, Jeff Naughton, John Cieslewicz

3326 - 3339

AutoToken: Predicting Peak Parallelism for Big Data Analytics at Microsoft

Rathijit Sen, Alekh Jindal, Hiren Patel, Shi Qiao

3340 - 3353

A Drop-in Middleware for Serializable DB Clustering across Geo-distributed Sites

Enrique Saurez, Bharath Balasubramanian, Richard Schlichting, Brendan Tschaen, Shankaranarayanan Puzhavakath Narayanan, Zhe Huang, Umakishore Ramachandran