Proceedings of the VLDB Endowment, Volume 12, 2018-2019
- Editors-in-Chief: Lei Chen and Fatma Özcan
- Founding Editor-in-Chief: H. V. Jagadish
- Managing Editor: Divesh Srivastava
- Advisory Committee: Peter Boncz, Xin Luna Dong, Juliana Freire, Jayant Haritsa, Wolfgang Lehner, Renée J. Miller, Tova Milo, M. Tamer Özsu
- Associate Editors: Azza Abouzied, Selcuk Candan, Surajit Chaudhuri, Amol Desphande, Johann-Christoph Freytag, Rainer Gemulla, Nick Koudas, Georgia Koutrika, Yunyao Li, Alexandra Meliou, Arnab Nandi, M. Tamer Özsu, Themis Palpanas, Alkis Polyzotis, Kyuseok Shim, Xiaokui Xiao, Meihui Zhang
- Review Board: see full list
Volume 12, No. 1, September 2018
- Lei Chen and Fatma Özcan:
Front Matter i-vi - Sunghwan Kim, Taesung Lee, Seungwon Hwang, Sameh Elnikety:
List Intersection for Web Search: Algorithms, Cost Models, and Optimizations 1-13 - Michael Whittaker, Joseph M. Hellerstein:
Interactive Checks for Coordination Avoidance 14-27 - Jianbin Qin, Chuan Xiao:
Pigeonring: A Principle for Faster Thresholded Similarity Search 28-42 - Ahmet Erdem Sariyuce, C. Seshadhri, Ali Pinar:
Local Algorithms for Hierarchical Dense Subgraph Discovery 43-56 - Jingru Yang, Ju Fan, Zhewei Wei, Guoliang Li, Tongyu Liu, Xiaoyong Du:
Cost-Effective Data Annotation using Game-Based Crowdsourcing 57-70 - Enhui Huang, Liping Peng, Luciano Di Palma, Ahmed Abdelkafi, Anna Liu, Yanlei Diao:
Optimization for Active Learning-based Interactive Database Exploration 71-84
Volume 12, No. 2, October 2018
- Lei Chen and Fatma Özcan:
Front Matter i-vi - Tobias Bleifuß, Leon Bornemann, Theodore Johnson, Dmitri V. Kalashnikov, Felix Naumann, Divesh Srivastava:
Exploring Change - A New Dimension of Data Analytics 85-98 - Bishwamittra Ghosh, Mohammed Eunus Ali, Farhana M. Choudhury, Sajid Hasan, Timos Sellis, Jianxin Li:
The Flexible Socio Spatial Group Queries 99-111 - Karima Echihabi, Kostas Zoumpatianos, Themis Palpanas, Houda Benbrahim:
The Lernaean Hydra of Data Series Similarity Search: An Experimental Evaluation of the State of the Art 112-127 - Wei Wang, Sheng Wang, Jinyang Gao, Meihui Zhang, Gang Chen, Teck Khim Ng, Beng Chin Ooi, Jie Shao:
Rafiki: Machine Learning as an Analytics Service System 128-140 - Pavle Subotic, Herbert Jordan, Lijun Chang, Alan Fekete, Bernhard Scholz:
Automatic Index Selection for Large-Scale Datalog Computation 141-153 - Shuang Song, Xu Liu, Qinzhe Wu, Andreas Gerstlauer, Tao Li, Lizy K. John:
Start Late or Finish Early: A Distributed Graph Processing System with Redundancy Reduction 154-168 - Bailu Ding, Lucja Kot, Johannes Gehrke:
Improving Optimistic Concurrency Control Through Transaction Batching and Operation Reordering 169-182
Volume 12, No. 3, November 2018
- Lei Chen and Fatma Özcan:
Front Matter i-vi - Ting Xie, Varun Chandola, Oliver Kennedy:
Query Log Compression for Workload Analytics 183-196 - Mohammed Eunus Ali, Shadman Saqib Eusuf, Kaysar Abdullah, Farhana M. Choudhury, J. Shane Culpepper, Timos Sellis:
The Maximum Trajectory Coverage Query in Spatial Databases 197-209 - Chenggang Wu, Alekh Jindal, Saeed Amizadeh, Hiren Patel, Wangchao Le, Shi Qiao, Sriram Rao:
Towards a Learning Optimizer for Shared Clouds 210-222 - Paroma Varma, Christopher Re:
Snuba: Automating Weak Supervision to Label Training Data 223-236 - Abolfazl Asudeh, H. Jagadish, Gerome Miklau, Julia Stoyanovich:
On Obtaining Stable Rankings 237-250 - Shuping Ji, Hans-Arno Jacobsen:
PS-Tree-based Efficient Boolean Expression Matching for High Dimensional and Dense Workloads 251-264 - Yizhou Yan, Lei Cao, Samuel Madden, Elke Rundensteiner:
SWIFT: Mining Representative Patterns from Large Event Streams 265-277 - Paul Suganthan G. C., Adel Ardalan, Anhai Doan, Aditya Akella:
Smurf: Self-Service String Matching Using Random Forests 278-291 - Feilong Liu, Ario Salmasi, Spyros Blanas, Anastasios Sidiropoulos:
Chasing Similarity: Distribution-aware Aggregation Scheduling 292-306 - Johes Bater, Xi He, William Ehrich, Ashwin Machanavajjhala, Jennie Rogers:
ShrinkWrap: Efficient SQL Query Processing in Differentially Private Data Federations 307-320
Volume 12, No. 4, December 2018
- Lei Chen and Fatma Özcan:
Front Matter i-vi - Gurbinder Gill, Roshan Dathathri, Loc Hoang, Keshav Pingali:
A Study of Partitioning Policies for Graph Analytics on Large-scale Distributed Platforms 321-334 - K. Ashwin Kumar, Petros Efstathopoulos:
Utility-Driven Graph Summarization 335-347 - Kaan Kara, Ken Eguro, Ce Zhang, Gustavo Alonso:
ColumnML: Column-Store Machine Learning with On-The-Fly Data Transformation 348-361 - Yanying Li, Haipei Sun, Boxiang Dong, Hui (wendy) Wang:
Cost-efficient Data Acquisition on Online Data Marketplaces for Correlation Analysis 362-375 - Mohamad Dolatshah, Mathew Teoh, Jiannan Wang, Jian Pei:
Cleaning Crowdsourced Labels Using Oracles For Statistical Classification 376-389 - Matteo Lissandrini, Martin Brugnara, Yannis Velegrakis:
Beyond Macrobenchmarks: Microbenchmark-based Graph Database Evaluation 390-403 - Valter Balegas, Sérgio Duarte, Carla Ferreira, Rodrigo Rodrigues, Nuno Preguiça:
IPA: Invariant-preserving Applications for Weakly consistent Replicated Databases 404-418 - Firas Abuzaid, Peter Kraft, Sahaana Suri, Edward Gan, Eric Xu, Atul Shenoy, Asvin Anathanaraya, John Sheu, Erik Meijer, Xi Wu, Jeff Naughton, Peter Bailis, Matei Zaharia:
DIFF: A Relational Interface for Large-Scale Data Explanation 419-432 - Ran Ben Basat, Roy Friedman, Rana Shahout:
Stream Frequency Over Interval Queries 433-445 - Doris Xin, Stephen Macke, Litian Ma, Jialin Liu, Shuchen Song, Aditya Parameswaran:
Helix: Holistic Optimization for Accelerating Iterative Machine Learning 446-460
Volume 12, No. 5, January 2019
- Lei Chen and Fatma Özcan:
Front Matter i-vi - Cong Fu, Chao Xiang, Changxu Wang, Deng Cai:
Fast Approximate Nearest Neighbor Search With The Navigating Spreading-out Graph 461-474 - Qi Wang, Torsten Suel:
Document Reordering for Faster Intersection 475-487 - Xiaofei Zhang, Tamer Özsu:
Correlation Constraint Shortest Path over Large Multi-Relation Graphs 488-501 - Harald Lang, Thomas Neumann, Alfons Kemper, Peter Boncz:
Performance-Optimal Filtering: Bloom overtakes Cuckoo at High-Throughput 502-515 - Steffen Zeuch, Sebastian Breß, Tilmann Rabl, Bonaventura Del Monte, Jeyhun Karimov, Clemens Lutz, Manuel Renz, Jonas Traub, Volker Markl:
Analyzing Efficient Stream Processing on Modern Hardware 516-530 - Chen Luo, Michael Carey:
Efficient Data Ingestion and Query Processing for LSM-Based Storage Systems 531-543 - Periklis Chrysogelos, Manos Karpathiotakis, Raja Appuswamy, Anastasia Ailamaki:
HetExchange: Encapsulating heterogeneous CPU–GPU parallelism in JIT compiled engines 544-556 - Paolo Atzeni, Luigi Bellomarini, Paolo Papotti, Riccardo Torlone:
Meta-Mappings for Schema Mapping Reuse 557-569 - Lijie Xu, Tian Guo, Wensheng Dou, Wei Wang, Jun Wei:
An Experimental Evaluation of Garbage Collectors on Big Data Applications 570-583 - Jinwei Guo, Peng Cai, Jiahao Wang, Weining Qian, Aoying Zhou:
Adaptive Optimistic Concurrency Control for Heterogeneous Workloads 584-596 - Yu-Shan Lin, Shao-Kan Pi, Meng-Kai Liao, Ching Tsai, Aaron Elmore, Shan-Hung Wu:
MgCrab: Transaction Crabbing for Live Migration in Deterministic Database Systems 597-610 - Sujaya Maiyya, Faisal Nawab, Divy Agrawal, Amr El Abbadi:
Unifying Consensus and Atomic Commitment for Effective Cloud Data Management 611-623
Volume 12, No. 6, February 2019
- Lei Chen and Fatma Özcan:
Front Matter i-vi - Chenggang Wu, Vikram Sreekanti, Joseph Hellerstein:
Autoscaling Tiered Cloud Storage in Anna 624-638 - Anton Dignös, Boris Glavic, Xing Niu, Johann Gamper, Michael Böhlen:
Snapshot Semantics for Temporal Multiset Relations 639-652 - Selasi Kwashie, Jixue Liu, Jiuyong Li, Lin Liu, Markus Stumptner, Lujing Yang:
Certus: An Effective Entity Resolution Approach with Graph Differential Dependencies (GDDs) 653-666 - Kai Han, Fei Gui, Xiaokui Xiao, Jing Tang, Yuntian He, Zongmai Cao, He Huang:
Efficient and Effective Algorithms for Clustering Uncertain Graphs 667-680 - Jia Zou, Arun Iyengar, Chris Jermaine:
Pangea: Monolithic Distributed Storage for Data Analytics 681-694 - Zhiwei Fan, Jianqiao Zhu, Zuyu Zhang, Aws Albarghouthi, Paraschos Koutris, Jignesh Patel:
Scaling-Up In-Memory Datalog Processing: Observations and Techniques 695-708 - Aaron Archer, Kevin Aydin, Mohammadhossein Bateni, Vahab Mirrokni, Aaron Schild, Ray Yang, Richard Zhuang:
Cache-aware load balancing of data center applications 709-723
Volume 12, No. 7, March 2019
- Lei Chen and Fatma Özcan:
Front Matter i-vi - Michael Borkowski, Christoph Hochreiner, Stefan Schulte:
Minimizing Cost by Reducing Scaling Operations in Distributed Stream Processing 724-737 - Yinjun Wu, Abdussalam Alawini, Daniel Deutch, Tova Milo, Susan Davidson:
ProvCite: Provenance-based Data Citation 738-751 - Wenfei Fan, Ping Lu, Chao Tian, Jingren Zhou:
Deducing Certain Fixes to Graphs 752-765 - Matteo Ceccarello, Andrea Pietracaprina, Geppino Pucci:
Solving k-center Clustering (with Outliers) in MapReduce and Streaming, almost as Accurately as Sequentially. 766-778 - Xiaolan Wang, Alexandra Meliou:
Explain3D: Explaining Disagreements in Disjoint Datasets 779-792 - Youjip Won, Sundoo Kim, Juseong Yun, Damquang Tuan, Jiwon Seo:
DASH: Database Shadowing for Mobile DBMS 793-806 - Zeke Wang, Kaan Kara, Hantian Zhang, Gustavo Alonso, Ce Zhang, Onur Mutlu:
Accelerating Generalized Linear Models with MLWeaving: A One-Size-Fits-All System for Any-precision Learning 807-821 - Dimitrije Jankov, Shangyu Luo, Binhang Yuan, Zhuhua Cai, Jia Zou, Chris Jermaine, Zekai Gao:
Declarative Recursive Computation on an RDBMS, or, Why You Should Use a Database For Distributed Machine Learning 822-835
Volume 12, No. 8, April 2019
- Lei Chen and Fatma Özcan:
Front Matter i-vi - Shahram Ghandeharizadeh, Hieu Nguyen:
Design, Implementation, and Evaluation of Write-Back Policy with Cache Augmented Data Stores 836-849 - Thanh Tam Nguyen, Hongzhi Yin, Matthias Weidlich, Bolong Zheng, Quoc Viet Hung Nguyen, Bela Stantic:
User Guidance for Efficient Fact Checking 850-863 - Xiangyu Ke, Arijit Khan, Leroy Lim:
An In-Depth Comparison of s-t Reliability Algorithms over Uncertain Graphs 864-876 - Wenfei Fan, Chunming Hu, Muyang Liu, Ping Lu, Qiang Yin, Jingren Zhou:
Dynamic Scaling for Parallel Graph Computations 877-890 - Yiming Zhang, Dongsheng Li, Jinyan Wang, Kian-Lee Tan:
TopoX: Topology Refactorization for Efficient Graph Partitioning and Processing 891-905 - Dmitrii Avdiukhin, Sergey Pupyrev, Grigory Yaroslavtsev:
Multi-Dimensional Balanced Graph Partitioning via Projected Gradient Descent 906-919 - Lei Cao, Yizhou Yan, Samuel Madden, Elke Rundensteiner, Mathan Gopalsamy:
Efficient Discovery of Sequence Outlier Patterns 920-932 - Dmytro Bogatov, George Kollios, Leo Reyzin:
A Comparative Evaluation of Order-Revealing Encryption Schemes and Secure Range-Query Protocols 933-947
Volume 12, No. 9, May 2019
- Lei Chen and Fatma Özcan:
Front Matter i-vi - Faisal Orakzai, Toon Calders, Torben Pedersen:
k/2-hop: Fast Mining of Convoy Patterns With Effective Pruning 948-960 - Ji Sun, Zeyuan Shang, Guoliang Li, Zhifeng Bao, Dong Deng:
Balance-Aware Distributed String Similarity-Based Query Processing System 961-974 - Pingcheng Ruan, Gang Chen, Anh Dinh, Qian Lin, Beng Chin Ooi, Meihui Zhang:
Fine-Grained, Secure and Efficient Data Provenance on Blockchain Systems 975-988 - Dalsu Choi, Chang-Sup Park, Yon Dohn Chung:
Progressive Top-k Subarray Query Processing in Array Databases 989-1001 - Moritz Hoffmann, Andrea Lattuada, Frank Mcsherry, Vasiliki Kalavri, John Liagouris, Timothy Roscoe:
Megaphone: Latency-conscious state migration for distributed streaming dataflows 1002-1015 - Thanh Tam Nguyen, Matthias Weidlich, Bolong Zheng, Hongzhi Yin, Quoc Viet Hung Nguyen, Bela Stantic:
From Anomaly Detection to Rumour Detection using Data Streams of Social Platforms 1016-1029 - Peeush Gupta, Yin Li, Sharad Mehrotra, Nisha Panwar, Shantanu Sharma, Sumaya Almanee:
Obscure: Information-Theoretic Oblivious and Verifiable Aggregation Queries 1030-1043 - Anshuman Dutt, Chi Wang, Azade Nazi, Srikanth Kandula, Vivek Narasayya, Surajit Chaudhuri:
Selectivity Estimation for Range Predicates using Lightweight Models 1044-1057
Volume 12, No. 10, June 2019
- Lei Chen and Fatma Özcan:
Front Matter i-vi - Ye Yuan, Xiang Lian, Guoren Wang, Yuliang Ma, Yishu Wang:
Constrained Shortest Path Query in a Large Time-Dependent Graph 1058-1070 - Lingyang Chu, Zhefeng Wang, Jian Pei, Yanyan Zhang, Yu Yang, Enhong Chen:
Finding Theme Communities from Database Networks 1071-1084 - James Pan, Guoliang Li, Juntao Hu:
Ridesharing: Simulator, Benchmark, and Evaluation 1085-1098 - Longbin Lai, Zhu Qing, Zhengyi Yang, Xin Jin, Zhengmin Lai, Ran Wang, Kongzhang Hao, Xuemin Lin, Lu Qin, Wenjie Zhang, Ying Zhang, Zhengping Qian, Jingren Zhou:
Distributed Subgraph Matching on Timely Dataflow 1099-1112 - Shi Qiao, Adrian Nicoara, Jin Sun, Marc Friedman, Hiren Patel, Jaliya Ekanayake:
Hyper Dimension Shuffle: Efficient Data Repartition at Petabyte Scale in Scope 1113-1125 - Graham Cormode, Tejas Kulkarni, Divesh Srivastava:
Answering Range Queries Under Local Differential Privacy 1126-1138 - Kai Wang, Xuemin Lin, Lu Qin, Wenjie Zhang, Ying Zhang:
Vertex Priority Based Butterfly Counting for Large-scale Bipartite Networks 1139-1152 - Yang Cao, Wenfei Fan, Tengfei Yuan:
Block as a Value for SQL over NoSQL 1153-1166 - Kanat Tangwongsan, Martin Hirzel, Scott Schneider:
Optimal and General Out-of-Order Sliding-Window Aggregation 1167-1180 - Bo Tang, Kyriakos Mouratidis, Man Lung Yiu, Zhenyu Chen:
Creating Top Ranking Options in the Continuous Option and Preference Space 1181-1194 - Hanchao Ma, Morteza Alipourlangouri, Yinghui Wu, Fei Chiang, Jiaxing Pi:
Ontology-based Entity Matching in Attributed Graphs 1195-1207 - Lu Chen, Yunjun Gao, Ziquan Fang, Xiaoye Miao, Christian Jensen, Chenjuan Guo:
Real-time Distributed Co-Movement Pattern Detection on Streaming Trajectories 1208-1220 - Jian Tan, Tieying Zhang, Feifei Li, Jie Chen, Qixing Zheng, Ping Zhang, Honglin Qiao, Yue Shi, Wei Cao, Rui Zhang:
iBTune: Individualized Buffer Tuning for Large-scale Cloud Databases 1221-1234
Volume 12, No. 11, July 2019
- Lei Chen and Fatma Özcan:
Front Matter i-viii - Michael J. Whittaker, Nick Edmonds, Sandeep Tata, James B. Wendt, Marc Najork:
Online Template Induction for Machine-Generated Emails 1235-1248 - Yong Wang, Guoliang Li, Nan Tang:
Querying Shortest Paths on Time Dependent Road Networks 1249-1261 - Anna Fariha, Alexandra Meliou:
Example-Driven Query Intent Discovery: Abductive Reasoning using Semantic Similarity 1262-1275 - Qi Zhou, Joy Arulraj, Shamkant Navathe, William Harris, Dong Xu:
Automated Verification of Query Equivalence Using Satisfiability Modulo Theories 1276-1288 - Pengfei Xu, Jiaheng Lu:
Towards a Unified Framework for String Similarity Joins 1289-1302 - Susik Yoon, Jae-Gil Lee, Byung Suk Lee:
NETS: Extremely Fast Outlier Detection from a Data Stream via Set-Based Processing 1303-1315 - Yi Lu, Xiangyao Yu, Samuel Madden:
STAR: Scaling Transactions through Asymmetric Replication 1316-1329 - Yuliang Li, Aaron Feng, Jinfeng Li, Saran Mumick, Alon Halevy, Vivian Li, Wang-Chiew Tan:
Subjective Databases 1330-1343 - Xuguang Ren, Junhu Wang, Wook-Shin Han, Jeffrey Xu Yu:
Fast and Robust Distributed Subgraph Enumeration 1344-1356 - Fangcheng Fu, Jiawei Jiang, Yingxia Shao, Bin Cui:
An Experimental Evaluation of Large Scale GBDT Systems 1357-1370 - Ios Kotsogiannis, Yuchao Tao, Xi He, Maryam Fanaeepour, Ashwin Machanavajjhala, Michael Hay, Gerome Miklau:
PrivateSQL: A Differentially Private SQL Query Engine 1371-1384 - Mohammad Javad Amiri, Divyakant Agrawal, Amr El Abbadi:
CAPER: A Cross-Application Permissioned Blockchain 1385-1398 - Alexandros Koliousis, Pijika Watcharapichat, Matthias Weidlich, Luo Mai, Paolo Costa, Peter Pietzuch:
Crossbow: Scaling Deep Learning with Small Batch Sizes on Multi-GPU Servers 1399-1413 - Kaiyu Feng, Gao Cong, Christian S. Jensen, Tao Guo:
Finding Attribute-Aware Similar Region for Data Analysis 1414-1426 - Dixin Tang, Zechao Shang, Aaron J. Elmore, Sanjay Krishnan, Michael J. Franklin:
Intermittent Query Processing 1427-1441 - Mihai Budiu, Parikshit Gopalan, Lalith Suresh, Udi Wieder, Han Kruiger, Marcos K. Aguilera:
Hillview: A trillion-cell spreadsheet for big data 1442-1457 - Ziheng Wei, Sebastian Link:
Embedded Functional Dependencies and Data-completeness Tailored Database Design 1458-1470 - Hua Fan, Wojciech Golab:
Ocean Vista: Gossip-Based Visibility Control for Speedy Geo-Distributed Transactions 1471-1484 - Xikui Wang, Michael Carey:
An IDEA: An Ingestion Framework for Data Enrichment in AsterixDB 1485-1498 - Alexey Karyakin, Kenneth Salem:
DimmStore: Memory Power Optimization for Database Systems 1499-1512 - Cong Yan, Alvin Cheung:
Generating Application-specific Data Layouts for In-memory Databases 1513-1525 - Rihan Hai, Christoph Quix:
Rewriting of Plain SO Tgds into Nested Tgds 1526-1538 - Senthil Nathan, Chander Govindarajan, Adarsh Saraf, Manish Sethi, Praveen Jayachandran:
Blockchain Meets Database: Design and Implementation of a Blockchain Relational Database 1539-1552 - Andreas Kunft, Asterios Katsifodimos, Sebastian Schelter, Sebastian Bress, Tilmann Rabl, Volker Markl:
An Intermediate Representation for Optimizing Machine Learning Pipelines 1553-1567 - Yuanwei Fang, Chen Zou, Andrew Chien:
Accelerating Raw Data Analysis with the ACCORDA Software and Hardware Architecture 1568-1582 - A. B. Siddique, Ahmed Eldawy, Vagelis Hristidis:
Comparing Synopsis Techniques for Approximate Spatial Data Analysis 1583-1596 - Muhammad El-Hindi, Carsten Binnig, Arvind Arasu, Donald Kossmann, Ravi Ramamurthy:
BlockchainDB - A Shared Database on Blockchains 1597-1609 - Ruoxi Jia, David Dao, Boxin Wang, Frances Ann Hubis, Nezihe Merve Gürel, Bo Li, Ce Zhang, Costas J. Spanos, Dawn Song:
Efficient Task-Specific Data Valuation for Nearest Neighbor Algorithms 1610-1623 - Hemant Saxena, Lukasz Golab, Ihab F. Ilyas:
Distributed Implementations of Dependency Discovery Algorithms 1624-1636 - Erfan Zamanian, Xiangyao Yu, Michael Stonebraker, Tim Kraska:
Rethinking Database High Availability with RDMA Networks 1637-1650 - Marco Bressan, Stefano Leucci, Alessandro Panconesi:
Motivo: Fast Motif Counting via Succinct Color Coding and Adaptive Sampling 1651-1663 - Rishabh Poddar, Tobias Boelter, Raluca Ada Popa:
Arx: An Encrypted Database using Semantically Secure Encryption 1664-1678 - Junyang Gao, Xian Li, Yifan Ethan Xu, Bunyamin Sisman, Xin Luna Dong, Jun Yang:
Efficient Knowledge Graph Accuracy Evaluation 1679-1691 - Amine Mhedhbi, Semih Salihoglu:
Optimizing Subgraph Queries by Combining Binary and Worst-Case Optimal Joins 1692-1704 - Ryan C. Marcus, Parimarjan Negi, Hongzi Mao, Chi Zhang, Mohammad Alizadeh, Tim Kraska, Olga Papaemmanouil, Nesime Tatbul:
Neo: A Learned Query Optimizer 1705-1718 - Yixiang Fang, Kaiqiang Yu, Reynold Cheng, Laks V.s. Lakshmanan, Xuemin Lin:
Efficient Algorithms for Densest Subgraph Discovery 1719-1732 - Ryan C. Marcus, Olga Papaemmanouil:
Plan-Structured Deep Neural Network Models for Query Performance Prediction 1733-1746 - Kun Ren, Dennis Li, Daniel J. Abadi:
SLOG: Serializable, Low-latency, Geo-replicated Transactions 1747-1761 - John Paparrizos, Michael Franklin:
GRAIL: Efficient Time-Series Representation Learning 1762-1777
Volume 12, No. 12, August 2019
- Lei Chen and Fatma Özcan:
Front Matter i-xiii - Guilherme Damasio, Spencer Bryson, Vincent Corvinelli, Parke Godfrey, Piotr Mierzejewski, Jaroslaw Szlichta, Calisto Zuzarte:
GALO: Guided Automated Learning for re-Optimization 1778-1781 - Yuanyuan Tian, Sui Jun Tong, Mir Hamid Pirahesh, Wen Sun, En Liang Xu, Wei Zhao:
Synergistic Graph and SQL Analytics Inside IBM Db2 1782-1785 - Xiaoou Ding, Hongzhi Wang, Jiaxuan Su, Zijue Li, Jianzhong Li, Hong Gao:
Cleanits: A Data Cleaning System for Industrial Time Series 1786-1789 - Yipeng Zhang, Zhifeng Bao, Songsong Mo, Yuchen Li, Yanghao Zhou:
ITAA: An Intelligent Trajectory-driven Outdoor Advertising Deployment Assistant 1790-1793 - Kun Qian, Lucian Popa, Prithviraj Sen:
SystemER: A Human-in-the-loop System for Explainable Entity Resolution 1794-1797 - Viet-Phi Huynh, Paolo Papotti:
Buckle: Evaluating Fact Checking Algorithms Built on Knowledge Bases 1798-1801 - Peng Gao, Xusheng Xiao, Zhichun Li, Kangkook Jee, Fengyuan Xu, Sanjeev R. Kulkarni, Prateek Mittal:
A Query System for Efficiently Investigating Complex Attack Behaviors for Enterprise Security 1802-1805 - Zhengjie Miao, Qitian Zeng, Chenjie Li, Boris Glavic, Oliver Kennedy, Sudeepa Roy:
CAPE: Explaining Outliers by Counterbalancing 1806-1809 - Karthik Ramachandra, Kwanghyun Park:
BlackMagic: Automatic Inlining of Scalar UDFs into SQL Queries with Froid 1810-1813 - Lukas Berg, Tobias Ziegler, Carsten Binnig, Uwe Röhm:
ProgressiveDB - Progressive Data Analytics as a Middleware 1814-1817 - Kaan Kara, Zeke Wang, Ce Zhang, Gustavo Alonso:
doppioDB 2.0: Hardware Techniques for Improved Integration of Machine Learning into Databases 1818-1821 - Cícero A. L. Pahins, Behrooz Omidvar-Tehrani, Sihem Amer-Yahia, Valérie Siroux, Jean-Louis Pepin, Jean-Christian Borel, João Comba:
COVIZ: A System for Visual Formation and Exploration of Patient Cohorts 1822-1825 - Martin Franke, Ziad Sehili, Erhard Rahm:
PRIMAT: A Toolbox for Fast Privacy-preserving Matching 1826-1829 - Ryan Marcus, Chi Zhang, Shuai Yu, Geoffrey Kao, Olga Papaemmanouil:
NashDB: Fragmentation, Replication, and Provisioning using Economic Methods 1830-1833 - Ibrahim Sabek, Mashaal Musleh, Mohamed F. Mokbel:
Flash in Action: Scalable Spatial Data Analysis Using Markov Logic Networks 1834-1837 - Lucas Kuhring, Zsolt István:
I Can't Believe It's Not (Only) Software! Bionic Distributed Storage for Parquet Files 1838-1841 - Hyewon Choi, Erkang Zhu, Arsala Bangash, Renée J. Miller:
VISE: Vehicle Image Search Engine with Traffic Camera 1842-1845 - Stephan Goldberg, Tova Milo, Slava Novgorodov, Kathy Razmadze:
WiClean: A System for Fixing Wikipedia Interlinks Using Revision History Patterns 1846-1849 - Abhishek Roy, Alekh Jindal, Hiren Patel, Ashit Gosalia, Subru Krishnan, Carlo Curino:
SparkCruise: Handsfree Computation Reuse in Spark 1850-1853 - Sandeep Singh Sandha, Wellington Cabrera, Mohammed Al-Kateb, Sanjay Nair, Mani Srivastava:
In-database Distributed Machine Learning: Demonstration using Teradata SQL Engine 1854-1857 - Zhao Li, Xia Chen, Xuming Pan, Pengcheng Zou, Yuchen Li, Guoxian Yu:
SHOAL: Large-scale Hierarchical Taxonomy via Graph-based Query Coalition in E-commerce 1858-1861 - Min Xu, Tianhao Wang, Bolin Ding, Jingren Zhou, Cheng Hong, Zhicong Huang:
DPSAaS: Multi-Dimensional Data Sharing and Analytics as Services under Local Differential Privacy 1862-1865 - Yang Cao, Yonghui Xiao, Li Xiong, Liquan Bai, Masatoshi Yoshikawa:
PriSTE: Protecting Spatiotemporal Event Privacy in Continuous Location-Based Services 1866-1869 - Daniel Deutch, Evgeny Marants, Yuval Moskovitch:
Datalignment: Ontology Schema Alignment Through Datalog Containment 1870-1873 - Congcong Ge, Yunjun Gao, Xiaoye Miao, Lu Chen, Christian S. Jensen, Ziyuan Zhu:
IHCS: An Integrated Hybrid Cleaning System 1874-1877 - Constantinos Costa, Xiaoyu Ge, Panos K. Chrysanthis:
CAPRIO: Graph-based Integration of Indoor and Outdoor Data for Path Discovery 1878-1881 - Yingjun Wu, Jia Yu, Yuanyuan Tian, Richard Sidle, Ronald Barber:
HERMIT in Action: Succinct Secondary Indexing Mechanism via Correlation Exploration 1882-1885 - Julien Loudet, Iulian Sandu-Popa, Luc Bouganim:
DISPERS: Securing Highly Distributed Queries on Personal Data Management Systems 1886-1889 - Adil Akhter, Marios Fragkoulis, Asterios Katsifodimos:
Stateful Functions as a Service in Action 1890-1893 - Allen Ordookhanians, Xin Li, Supun Nakandala, Arun Kumar:
Demonstration of Krypton: Optimized CNN Inference for Occlusion-based Deep CNN Explanations 1894-1897 - Zhengjie Miao, Andrew Lee, Sudeepa Roy:
LensXPlain: Visualizing and Explaining Contributing Subsets for Aggregate Query Answers 1898-1901 - Yi Zhang, Zachary G. Ives:
Juneau: Data Lake Management for Jupyter 1902-1905 - Sona Hasani, Faezeh Ghaderi, Shohedul Hasan, Saravanan Thirumuruganathan, Abolfazl Asudeh, Nick Koudas, Gautam Das:
ApproxML: Efficient Approximate Ad-Hoc ML Models Through Materialization and Reuse 1906-1909 - Grégory Essertel, Ruby Y. Tahboub, Fei Wang, James Decker, Tiark Rompf:
Flare & Lantern: Efficiently Swapping Horses Midstream 1910-1913 - Ruben Martins, Jia Chen, Yanju Chen, Yu Feng, Isil Dillig:
Trinity: An Extensible Synthesis Framework for Data Science 1914-1917 - Zhiqi Huang, Ryan Mckenna, George Bissias, Gerome Miklau, Michael Hay, Ashwin Machanavajjhala:
PSynDB: Accurate and Accessible Private Data Generation 1918-1921 - Badrish Chandramouli, Dong Xie, Yinan Li, Donald Kossmann:
FishStore: Fast Ingestion and Indexing of Raw Data 1922-1925 - Yanlei Diao, Pawel Guzewicz, Ioana Manolescu, Mirjana Mazuran:
Spade: A Modular Framework for Analytical Exploration of RDF Graphs 1926-1929 - Joseph Vinish D’silva, Florestan De Moor, Bettina Kemme:
Making an RDBMS Data Scientist Friendly: Advanced In-database Interactive Analytics with Visualization Support 1930-1933 - Khaled Zaouk, Fei Song, Chenghao Lyu, Arnab Sinha, Yanlei Diao, Prashant Shenoy:
UDAO: A Next-Generation Unified Data Analytics Optimizer 1934-1937 - Saehan Jo, Immanuel Trummer, Weicheng Yu, Xuezhi Wang, Cong Yu, Daniel Liu, Niyati Mehta:
AggChecker: A Fact-Checking System for Text Summaries of Relational Data Sets 1938-1941 - Hanzhang Wang, Phuong Nguyen, Jun Li, Selcuk Kopru, Gene Zhang, Sanjeev Katariya, Sami Ben-Romdhane:
GRANO: Interactive Graph-based Root Cause Analysis for Cloud-Native Distributed Data Platform 1942-1945 - Davide Frey, Marc X. Makkes, Pierre-Louis Roman, François Taïani, Spyros Voulgaris:
Dietcoin: Hardening Bitcoin Transaction Verification Process For Mobile Devices 1946-1949 - Samriddhi Singla, Ahmed Eldawy, Rami Alghamdi, Mohamed F. Mokbel:
Raptor: Large Scale Analysis of Big Raster and Vector Data 1950-1953 - El Kindi Rezig, Lei Cao, Michael Stonebraker, Giovanni Simonini, Wenbo Tao, Samuel Madden, Mourad Ouzzani, Nan Tang, Ahmed K. Elmagarmid:
Data Civilizer 2.0: A Holistic Framework for Data Preparation and Analytics 1954-1957 - Leonhard F. Spiegelberg, Tim Kraska:
Tuplex: Robust, Efficient Analytics When Python Rules 1958-1961 - Cedric Renggli, Frances Ann Hubis, Bojan Karlaš, Kevin Schawinski, Wentao Wu, Ce Zhang:
Ease.ml/ci and Ease.ml/meter in Action: Towards Data Management for Statistical Generalization 1962-1965 - Han Xueran, Jun Chen, Jiaheng Lu, Yueguo Chen, Xiaoyong Du:
PivotE: Revealing and Visualizing the Underlying Entity Structures for Exploration 1966-1969 - Jiaheng Lu, Yuxing Chen, Herodotos Herodotou, Shivnath Babu:
Speedup Your Analytics: Automatic Parameter Tuning for Databases and Big Data Systems 1970-1973 - Yu Meng, Jiaxin Huang, Jingbo Shang, Jiawei Han:
TextCube: Automated Construction and Multidimensional Exploration 1974-1977 - Sihem Amer-Yahia, Senjuti Basu Roy:
The Ever Evolving Online Labor Market: Overview, Challenges and Opportunities 1978-1981 - Ibrahim Sabek, Mohamed F. Mokbel:
Machine Learning Meets Big Spatial Data 1982-1985 - Fatemeh Nargesian, Erkang Zhu, Renée J. Miller, Ken Pu, Patricia C. Arocena:
Data Lake Management: Challenges and Opportunities 1986-1989 - Laks V.s. Lakshmanan, Michael Simpson, Saravanan Thirumuruganathan:
Combating Fake News: A Data Management and Mining Perspective 1990-1993 - Nicolas Anciaux, Luc Bouganim, Philippe Pucheral, Iulian Sandu Popa, Guillaume Scerri:
Personal Database Security and Trusted Execution Environments: A Tutorial at the Crossroads 1994-1997 - Stephan Kessler, Jens Hoff, Johann-Christoph Freytag:
SAP HANA goes private - From Privacy Research to Privacy Aware Enterprise Analytics 1998-2009 - Guilherme Damasio, Vincent Corvinelli, Parke Godfrey, Piotr Mierzejewski, Alex Mihaylov, Jaroslaw Szlichta, Calisto Zuzarte:
Guided automated learning for query workload re-optimization 2010-2021 - Biswapesh Chattopadhyay, Priyam Dutta, Weiran Liu, Ott Tinn, Andrew Mccormick, Aniket Mokashi, Paul Harvey, Hector Gonzalez, David Lomax, Sagar Mittal, Roee Ebenstein, Nikita Mikhaylin, Hung-Ching Lee, Xiaoyan Zhao, Tony Xu, Luis Perez, Farhad Shahmohammadi, Tran Bui, Neil Mckay, Selcuk Aya, Vera Lychagina, Brett Elliott:
Procella: Unifying serving and analytical data at YouTube 2022-2034 - Wei Lu, Zhanhao Zhao, Xiaoyu Wang, Haixiang Li, Zhenmiao Zhang, Zhiyu Shui, Sheng Ye, Anqun Pan, Xiaoyong Du:
A Lightweight and Efficient Temporal Database Management System in TDSQL 2035-2046 - Reza Sherkat, Colin Florendo, Mihnea Andrei, Rolando Blanco, Adrian Dragusanu, Amit Pathak, Pushkar Khadilkar, Neeraj Kulkarni, Christian Lemke, Sebastian Seifert, Sarika Iyer, Sasikanth Gottapu, Robert Schulze, Chaitanya Gottipati, Nirvik Basak, Yanhong Wang, Vivek Kandiyanallur, Santosh Pendap, Dheren Gala, Rajesh Almeida, Prasanta Ghosh:
Native Store Extension for SAP HANA 2047-2058 - Chaoqun Zhan, Maomeng Su, Chuangxian Wei, Xiaoqiang Peng, Liang Lin, Sheng Wang, Zhe Chen, Feifei Li, Yue Pan, Fang Zheng, Chengliang Chai:
AnalyticDB: Real-time OLAP Database System at Alibaba Cloud 2059-2070 - William Schultz, Tess Avitabile, Alyson Cabral:
Tunable Consistency in MongoDB 2071-2081 - Shaosheng Cao, Xinxing Yang, Cen Chen, Jun Zhou, Xiaolong Li, Yuan Qi:
TitAnt: Online Real-time Transaction Fraud Detection in Ant Financial 2082-2093 - Rong Zhu, Kun Zhao, Hongxia Yang, Wei Lin, Chang Zhou, Baole Ai, Yong Li, Jingren Zhou:
AliGraph: A Comprehensive Graph Neural Network Platform 2094-2105 - Zhimin Chen, Yue Wang, Vivek Narasayya, Surajit Chaudhuri:
Customizable and Scalable Fuzzy Join for Big Data 2106-2117 - Guoliang Li, Xuanhe Zhou, Shifu Li, Bo Gao:
QTune: A Query-Aware Database Tuning System with Deep Reinforcement Learning 2118-2130 - Srikanth Kandula, Kukjin Lee, Surajit Chaudhuri, Marc Friedman:
Experiences with Approximating Queries in Microsoft's Production Big-Data Clusters 2131-2142 - Panagiotis Antonopoulos, Peter Byrne, Wayne Chen, Cristian Diaconu, Raghavendra Thallam Kodandaramaih, Hanuma Kodavalla, Prashanth Purnananda, Adrian-Leonard Radu, Chaitanya Sreenivas Ravella, Girish Mittur Venkataramanappa:
Constant Time Recovery in Azure SQL Database 2143-2154 - Yuzhen Huang, Yingjie Shi, Zheng Zhong, Yihui Feng, James Cheng, Jiwei Li, Haochuan Fan, Chao Li, Tao Guan, Jingren Zhou:
Yugong: Geo-Distributed Data and Job Placement at Scale 2155-2169 - Junjay Tan, Thanaa Ghanem, Matthew Perron, Xiangyao Yu, Michael Stonebraker, David Dewitt, Marco Serafini, Ashraf Aboulnaga, Tim Kraska:
Choosing A Cloud DBMS: Architectures and Tradeoffs 2170-2182 - Jingtian Zhang, Sai Wu, Zeyuan Tan, Gang Chen, Zhushi Cheng, Wei Cao, Yusong Gao, Xiaojie Feng:
S3: A Scalable In-memory Skip-List Index for Key-Value Store 2183-2194 - Charles Masson, Jee E. Rim, Homin K. Lee:
DDSketch: A Fast and Fully-Mergeable Quantile Sketch with Relative-Error Guarantees 2195-2205 - Qiang Long, Wei Wang, Jinfu Deng, Song Liu, Wenhao Huang, Fangying Chen, Sifan Liu:
A Distributed System for Large-scale n-gram Language Models at Tencent 2206-2217 - Kayhan Dursun, Carsten Binnig, Ugur Cetintemel, Garret Swart, Weiwei Gong:
A Morsel-Driven Query Execution Engine for Heterogeneous Multi-Cores 2218-2229 - Lei Cao, Wenbo Tao, Sungtae An, Jing Jin, Yizhou Yan, Xiaoyu Liu, Wendong Ge, Adam Sah, Leilani Battle, Jimeng Sun, Remco Chang, Brandon Westover, Samuel Madden, Michael Stonebraker:
Smile: A System to Support Machine Learning on EEG Data at Scale 2230-2241 - Alastair Green, Paolo Guagliardo, Leonid Libkin, Tobias Lindaaker, Victor Marsault, Stefan Plantikow, Martin Schuster, Petra Selmer, Hannes Voigt:
Updating Graph Databases with Cypher 2242-2253 - Asya Kamsky:
Adapting TPC-C Benchmark to Measure Performance of Multi-Document Transactions in MongoDB 2254-2262 - Feifei Li:
Cloud native database systems at Alibaba: Opportunities and Challenges 2263-2272 - Alexander Boehm:
In-Memory for the masses: Enabling cost-efficient deployments of in-memory data management platforms for business applications 2273-2274 - Murtadha Al Hubail, Ali Alsuliman, Michael Blow, Michael Carey, Dmitry Lychagin, Ian Maxon, Till Westmann:
Couchbase Analytics: NoETL for Scalable NoSQL Data Analysis 2275-2286 - Adrian Coyler:
Performance in the spotlight 2287-2289 - Azza Abouzied, Daniel J. Abadi, Kamil Bajda-Pawlikowski, Avi Silberschatz:
Integration of Large-Scale Data Processing Systems and Traditional Parallel Database Technology 2290-2299 - Brian F. Cooper, P.p.s. Narayan, Raghu Ramakrishnan, Utkarsh Srivastava, Adam Silberstein, Philip Bohannon, Hans-Arno Jacobsen, Nick Puz, Daniel Weaver, Ramana Yerneni:
PNUTS to Sherpa: Lessons from Yahoo!’s Cloud Database 2300-2307 - Wang-Chiew Tan:
What I probably did right and what I think I could have done better 2308-2308 - Aditya Parameswaran:
Enabling Data Science for the Majority 2309-2322 - Theodoros Rekatsinas, Sudeepa Roy, Manasi Vartak, Ce Zhang, Neoklis Polyzotis:
Opportunities for Data Management Research in the Era of Horizontal AI/ML 2323-2324
Volume 12, No. 13, September 2019
- Lei Chen and Fatma Özcan:
Front Matter i-vi - Claude Barthels, Ingo Müller, Konstantin Taranov, Gustavo Alonso, Torsten Hoefler:
Strong consistency is not hard to get: Two-Phase Locking and Two-Phase Commit on Thousands of Cores 2325-2338 - Ziheng Wei, Uwe Leck, Sebastian Link:
Discovery and Ranking of Embedded Uniqueness Constraints 2339-2352 - Lingyang Chu, Yanyan Zhang, Yu Yang, Lanjun Wang, Jian Pei:
Online Density Bursting Subgraph Detection from Temporal Graphs 2353-2365 - Pedro Holanda, Stefan Manegold, Hannes Mühleisen, Mark Raasveldt:
Progressive Indexes: Indexing for Interactive Data Analysis 2366-2378 - Masatoshi Hanai, Toyotaro Suzumura, Wen Jun Tan, Elvis Liu, Georgios Theodoropoulos, Wentong Cai:
Distributed Edge Partitioning for Trillion-edge Graphs 2379-2392 - Manos Athanassoulis, Kenneth Bøgh, Stratos Idreos:
Optimal Column Layout for Hybrid Workloads 2393-2407 - Stavros Sintos, Pankaj K. Agarwal, Jun Yang:
Selecting Data to Clean for Fact Checking: Minimizing Uncertainty vs. Maximizing Surprise 2408-2421
About Cookies On This Site
We use cookies to ensure that we give you the best experience on our website. We'll assume you're ok with this, but you can opt-out if you wish.
Read More
Cookie settingsGot It!Privacy & Cookies Policy