Detailed Program

Tuesday 25, 2009

08:45-09:00

Opening Ceremony

Room: Auditorium Lumière

09:00-10:30

Keynote 1

Chair: Patrick Valduriez (INRIA Bretagne) -- Room: Auditorium Lumière

  • Cloud Data Serving: Key-Value Stores to DBMSs
    Raghu Ramakrishnan (Yahoo! Research)

11:00-12:30

Research sessions - Scientific Databases and Provenance

Chair: Frank Neven (U Hasselt) -- Room: Rhône 1

  • Believe It or Not: Adding Belief Annotations to Databases - Slides
    Wolfgang Gatterbauer (Univ. of Washington), Magdalena Balazinska (Univ. of Washington), Nodira Khoussainova (Univ. of Washington), Dan Suciu (Univ. of Washington)
  • Similarity Search on Bregman Divergence: Towards Non-Metric Indexing - Slides
    Zhenjie Zhang (National Univ. of Singapore), Beng Chin Ooi (National Univ. of Singapore), Srinivasan Parthasarathy (Ohio State Univ.), Anthony Tung (National Univ. of Singapore)
  • Comparing Stars: On Approximating Graph Edit Distance
    Zhiping Zeng (Tsinghua Univ.), Anthony Tung (National Univ. of Singapore), Jianyong Wang (Tsinghua Univ.), Jianhua Feng (Tsinghua Univ.), Lizhu Zhou (Tsinghua Univ.)

Research sessions - Information Filtering and Dissemination

Chair: Alin Deutsch (UCSD) -- Room: Rhône 2

  • Indexing Boolean Expressions - Slides
    Steven Whang (Stanford Univ.), Chad Brower (Yahoo! Research), Jayavel Shanmugasundaram (Yahoo! Research), Serguei Vassilvitskii (Yahoo! Research), Erik Vee (Yahoo! Research), Ramana Yerneni (Yahoo! Research), Hector Garcia-Molina (Stanford Univ.)
  • Scalable Delivery of Stream Query Results
    Yongluan Zhou (Univ. of Southern Denmark), Ali Salehi (EPFL), Karl Aberer (EPFL)
  • Schema-Based Independence Analysis for XML Updates - Slides
    Michael Benedikt (Oxford Univ.), James Cheney (Univ. of Edinburgh)

Research sessions - Stream Processing I

Chair: Yanlei Diao (UMass Amherst) -- Room: Rhône 3A

  • Tagging Stream Data for Rich Real-Time Services
    Rimma Nehme (Microsoft Jim Gray Systems Lab), Elke Rundensteiner (WPI), Elisa Bertino (Purdue Univ.)
  • Randomized Multi-pass Streaming Skyline Algorithms - Slides
    Atish Das Sarma (Georgia Tech.), Ashwin Lall (Georgia Tech), Danupon Nanongkai (Georgia Tech), Jun Xu (Georgia Tech)
  • Managing Massive Time Series Streams with MultiScale Compressed Trickles - Slides
    Galen Reeves (Univ. of California, Berkeley), Jie Liu (Microsoft Research), Suman Nath (Microsoft Research), Feng Zhao (Microsoft Research)

Research sessions - Database Search and Ranking

Chair: Cong Yu (Yahoo! Research, USA) -- Room: Rhône 3B

  • Promotion Analysis in Multi-Dimensional Space - Slides
    Tianyi Wu (UIUC), Dong Xin (Microsoft Research), Qiaozhu Mei (UIUC), Jiawei Han (UIUC)
  • Measure-driven Keyword-Query Expansion - Slides
    Nikos Sarkas (Univ. of Toronto), Nilesh Bansal (Univ. of Toronto), Gautam Das (Univ. of Texas At Arlington), Nick Koudas (Univ. of Toronto)
  • Using Trees to Depict a Forest - Slides
    Bin Liu (Univ. of Michigan), H.V. Jagadish (Univ. of Michigan)

Industrial sessions - Automatic Data Management

Chair: Len Seligman (MITRE Corporation, USA) -- Room: Auditorium Lumière

  • Declarative Database Management in SQLServer
    Hongfei Guo (Microsoft), Dan Jones (Microsoft), Jennifer Beckmann (Microsoft), Praveen Seshadri (Microsoft)
  • StatAdvisor: Recommending Statistical Views
    Amr El-Helw (Univ. of Waterloo), Ihab Ilyas (Univ. of Waterloo), Calisto Zuzarte (IBM Toronto)
  • An Object Placement Advisor for DB2 Using Solid State Storage
    Mustafa Canim (Univ. of Texas at Dallas), Bishwaranjan Bhattacharjee (IBM T.J. Watson Research Center), George Mihaila (IBM T.J.Watson Research Center), Christian Lang (IBM T.J.Watson Research Center), Ken Ross (Columbia Univ.)

Demo sessions - Core DB Technology & System issues

Room: Rhône 4 and Rhône 5

  • Query Mesh: Multi-Route Query Processing Technology
    Rimma Nehme (Microsoft Jim Gray Systems Lab), Karen Works (WPI), Elke Rundensteiner (WPI), Elisa Bertino (Purdue Univ.)
  • A Demonstration of SciDB: A Science-Oriented DBMS
    Philippe Cudre-Mauroux (Massachusetts Institute of Technology), Hideaki Kimura (Brown Univ.), Kian-Tat Lim (SLAC), Jennie Rogers (Brown Univ.), Roman Simakov (NIISI, Russian Academy of Science), Emad Soroush (Univ. of Washington), Pavel Velikhov (NIISI, Russian Academy of Science), Daniel Wang (SLAC), Magdalena Balazinska (Univ. of Washington), Jacek Becla (SLAC), David DeWitt (Microsoft Research), Bobbi Heath (Vertica), David Maier (Portland State Univ.), Samuel Madden (Massachusetts Institute of Technology), Jignesh Patel (Univ. of Wisconsin), Michael Stonebraker (Massachusetts Institute of Technology), Stan Zdonik (Brown Univ.)
  • MOIR/MT: Monitoring Large-Scale Road Network Traffic in Real-Time
    Kuien Liu (Institute of Software, Chinese Academy of Science), Ke Deng (The Univ. of Queensland), Zhiming Ding (Institute of Software, Chinese Academy of Science), Mingshu Li (Institute of Software, Chinese Academy of Science), Xiaofang Zhou (The Univ. of Queensland)
  • Oracle Database Replay
    Romain Colle (Oracle), Leonidas Galanis (Oracle), Yujun Wang (Oracle), Supiti Buranawatanachoke (Oracle), Stratos Papadomanolakis (Oracle)
  • DIADS: A Problem Diagnosis Tool for Databases and Storage Area Networks
    Nedyalko Borisov (Duke Univ.), Shivnath Babu (Duke Univ.), Sandeep Uttamchandani (IBM Almaden Research Center), Ramani Routray (IBM Almaden Research Center), Aameek Singh (IBM Almaden Research Center)
  • Artemis: A System for Analyzing Missing Answers
    Melanie Herschel (Univ. of Tübingen, Germany), Mauricio Hernandez (IBM Almaden Research Center), Wang-Chiew Tan (UC Santa Cruz)
  • Demonstration of the TrajStore System
    Eugene Wu (Massachusetts Institute of Technology), Philippe Cudre-Mauroux (Massachusetts Institute of Technology), Samuel Madden (Massachusetts Institute of Technology)
  • Microsoft CEP Server and Online Behavioral Targeting
    Mohamed Ali (Microsoft), Ciprian Gerea (Microsoft), Balan Raman (Microsoft), Beysim Sezgin (Microsoft), Tiho Tarnavski (Microsoft), Tomer Verona (Microsoft), Ping Wang (Microsoft), Peter Zabback (Microsoft), Anton Kirilov (Microsoft), Asvin Ananthanarayan (Microsoft), Ming Lu (Microsoft), Alex Raizman (Microsoft), Ramkumar Krishnan (Microsoft), Roman Schindlauer (Microsoft), Torsten Grabs (Microsoft), Sharon Bjeletich (Microsoft), Badrish Chandramouli (Microsoft Research), Jonathan Goldstein (Microsoft Research), Sudin Bhat (Microsoft), Ying Li (Microsoft), Vincenzo Di Nicola (Microsoft), Xianfang Wang (Microsoft), David Maier (Portland State Univ.), Ivo Santos (European Microsoft Innovation Center, Aachen), Olivier Nano (Microsoft), Stephan Grell (Microsoft)
  • A Testbed for Managing Dynamic Mixed Workloads
    Stefan Krompaß (Technische Universität München), Harumi Kuno (Hewlett-Packard Laboratories), Janet Wiener (Hewlett-Packard Laboratories), Kevin Wilkinson (Hewlett-Packard Laboratories), Umeshwar Dayal (Hewlett-Packard Laboratories), Alfons Kemper (Technische Universität München)
  • DBToaster: A SQL Compiler for High-Performance Delta Processing in Main-Memory Databases
    Yanif Ahmad (Cornell Univ.), Christoph Koch (Cornell Univ.)

Tutorial 1

Room: Auditorium Pasteur

  • Data fusion - Resolving Data Conflicts for Integration
    Xin Luna Dong (AT&T Labs - Research), Felix Naumann (Hasso Plattner Institute)

14:00-15:30

Research sessions - Sensor Networks

Chair: Divesh Srivastava (AT&T Labs-Research) -- Room: Rhône 1

  • Online Piece-wise Linear Approximation of Numerical Streams with Precision Guarantees - Slides
    Hazem Elmeleegy (Purdue Univ.), Ahmed Elmagarmid (Purdue Univ.), Emmanuel Cecchet (UMass Amherst), Walid Aref (Purdue Univ.), Willy Zwaenepoel (EPFL)
  • A Wavelet Transform for Efficient Consolidation of Sensor Relations with Quality Guarantees - Slides
    Mirco Stern (Universität Karlsruhe (TH)), Erik Buchmann (Universität Karlsruhe (TH)), Klemens Böhm (Universität Karlsruhe (TH))
  • Enabling eps-Approximate Querying in Sensor Networks - Slides
    LIU Yu (Harbin Institute of Technology), Jianzhong Li (Harbin Institute of Technology), Hong Gao (Harbin Institute of Technology), Xiaolin Fang (Harbin Institute of Technology)

Research sessions - Information Integration

Chair: Xin Dong (AT&T Labs--Research) -- Room: Rhône 2

  • HAMSTER: Using Search Clicklogs for Schema and Taxonomy Matching - Slides
    Arnab Nandi (Univ. of Michigan), Philip Bernstein (Microsoft Research)
  • Cooperative Update Exchange in the Youtopia System - Slides
    Lucja Kot (Cornell Univ.), Christoph Koch (Cornell Univ.)
  • Reference-Based Alignment in Large Sequence Databases - Slides
    Panagiotis Papapetrou (Boston Univ.), Vassilis Athitsos (Univ. of Texas at Arlington), George Kollios (Boston Univ.), Dimitrios Gunopulos (UoA, UCR)

Research sessions - Stream Processing II

Chair: Nesime Tatbul (ETH Zurich) -- Room: Rhône 3A

  • Thread Cooperation in Multicore Architectures for Frequency Counting over Multiple Data Streams - Slides
    Sudipto Das (UC Santa Barbara), Shyam Antony (UC Santa Barbara), Divyakant Agrawal (UC Santa Barbara), Amr El Abbadi (UC Santa Barbara)
  • Streams on Wires - A Query Compiler for FPGAs - Slides
    Rene Mueller (ETH Zurich), Jens Teubner (ETH Zurich), Gustavo Alonso (ETH Zurich)
  • On-the-fly Progress Detection in Iterative Stream Queries
    Badrish Chandramouli (Microsoft Research), Jonathan Goldstein (Microsoft Research), David Maier (Portland State Univ.)

Research sessions - Cloud Computing and Data Warehousing

Chair: Daniel Abadi (Yale U., USA) -- Room: Rhône 3B

  • Consistency Rationing in the Cloud: Pay only when it matters - Slides
    Tim Kraska (ETH Zurich), Martin Hentschel (ETH Zurich), Gustavo Alonso (ETH Zurich), Donald Kossmann (ETH Zurich)
  • Locking Key Ranges with Unbundled Transaction Services - Slides
    David Lomet (Microsoft Research), Mohamed Mokbel (Univ. of Minnesota)
  • A Scalable, Predictable Join Operator for Highly Concurrent Data Warehouses - Slides
    George Candea (EPFL and Aster Data Systems), Neoklis Polyzotis (UC Santa Cruz), Radek Vingralek (Aster Data Systems)

Industrial sessions - XML Data Management

Chair: Ioana Manolescu (INRIA Saclay) -- Room: Auditorium Lumière

  • XPEDIA: XML ProcEssing for Data IntegrAtion
    Manish Bhide (IBM Research), Manoj Agarwal (IBM India Research Lab), Amir Bar-Or (IBM), Sriram Padmanabhan (IBM SVL), Srinivas Mittapalli (IBM), Girish Venkatachaliah (IBM)
  • Binary XML Storage and Query Processing in Oracle 11g - Slides
    Ning Zhang (Facebook), Nipun Agarwal (Oracle), Sivasankaran Chandrasekar (Oracle), Sam Idicula (Oracle), Vijay Medi (Oracle), Sabina Petride (Oracle), Balasubramanyam Sthanikam (Oracle)
  • XQuery Reloaded
    Vinayak Borkar (Univ. of California, Irvine), Matthias Brantner (28msec, Inc.), Peter Fischer (ETH Zurich), Daniela Florescu (Oracle), Donald Kossmann (ETH Zurich), Tim Kraska (ETH Zurich), Markos Zacharioudaki (Oracle), David Graf (28msec), Roger Bamford (Oracle), Dan Muresan (FLWOR Foundation), Sorin Nasoi (FLWOR Foundation)

Demo sessions - Web & Data Integration

Room: Rhône 4 and Rhône 5

  • ANGIE: Active Knowledge for Interactive Exploration
    Nicoleta Preda (Max-Planck Institute), Fabian Suchanek (Max-Planck Institute), Gjergji Kasneci (Max-Planck Institute), Thomas Neumann (Max-Planck Institute), Maya Ramanath (Max-Planck Institute), Gerhard Weikum (Max-Planck Institute)
  • Comparative evaluation of entity resolution approaches with FEVER
    Hanna Köpcke (Univ. of Leipzig), Andreas Thor (Univ. of Leipzig), Erhard Rahm (Univ. of Leipzig)
  • RankIE: Document Retrieval on Ranked Entity Graphs
    Falk Brauer (SAP Research), Wojciech Barczynski (SAP Research), Gregor Hackenbroich (SAP Research), Marcus Schramm (SAP Research), Adrian Mocan (SAP Research ), Felix Förster (SAP AG)
  • Concise and Expressive Mappings with +Spicy
    Giansalvatore Mecca (Unviersità della Basilicata), Paolo Papotti (Università Roma Tre), Salvatore Raunich (Università della Basilicata), Marcello Buoncristiano (Università della Basilicata)
  • AgreementMaker: Efficient Matching for Large Real-World Schemas and Ontologies
    Isabel Cruz (Univ. of Illinois at Chicago), Flavio Palandri Antonelli (Univ. of Illinois at Chicago), Cosmin Stroe (Univ. of Illinois at Chicago)
  • Linkage Query Writer
    Oktie Hassanzadeh (Univ. of Toronto), Reynold Xin (Univ. of Toronto), Renée Miller (Univ. of Toronto), Anastasios Kementsietsidis (IBM T. J. Watson Research Center), Lipyeow Lim (IBM T. J. Watson Research Center), Min Wang (IBM T. J. Watson Research Center )
  • SMDM: Enhancing Enterprise-Wide Master Data Management Using Semantic Web Technologies
    Xiaoyuan Wang (IBM China Research Lab), Xingzhi Sun (IBM China Research Lab), Feng Cao (IBM China Research Lab), Li Ma (IBM), Nick Kanellos (IBM Software Group), Kang Zhang (Shanghai Jiao Tong Univ.), Yue Pan (IBM China Research Lab), Yong Yu (Shanghai Jiao Tong Univ.)
  • IBM UFO Repository
    Michael Gubanov (Univ. of Washington), Lucian Popa (IBM Almaden Research Center), Howard Ho (IBM Almaden Research Center), Hamid Pirahesh (IBM Almaden Research Center), Jeng-Yih Chang (National Yang-Ming Univ.), Shr-Chang Chen (National Yang-Ming Univ.)
  • Mashup by Surfing a Web of Data APIs
    Huajun Chen (Zhejiang Univ.), Bin Lu (Zhejiang Univ.), Yuan Ni (Ibm), Guo tong Xie (Zhejiang Univ.), Chunyin Zhou (Zhejiang Univ.), Jinhua Mi (Zhejiang Univ.), Zhaohui Wu (Zhejiang Univ.)
  • DEMo: Data Exchange Modeling Tool
    Reinhard Pichler (TU Wien), Vadim Savenkov (TU Wien)

Tutorial 2

Room: Auditorium Pasteur

  • Data visualization & social data analysis I
    Jeffrey Heer (Stanford), Joseph M. Hellerstein (UC Berkeley)

16:00-17:30

Research sessions - XML and Unstructured Data

Chair: Marie-Christine Rousset (University of Grenoble) -- Room: Rhône 1

  • Answering Table Augmentation Queries from Unstructured Lists on the Web
    Rahul Gupta (IIT Bombay), Sunita Sarawagi (IIT Bombay)
  • Efficient Rewriting of XPath Queries Using Query Set Specifications
    Bogdan Cautis (Telecom ParisTech), Alin Deutsch (Univ. of California, San Diego), Nicola Onose (Univ. of California, San Diego), Vasilis Vassalos (AUEB)
  • Structured Search Result Differentiation - Slides
    Ziyang Liu (Arizona State Univ.), Peng Sun (Arizona State Univ.), Yi Chen (Arizona State Univ.)

Research sessions - Web Data Integration

Chair: Irini Fundulaki (ICS-Forth, Greece) -- Room: Rhône 2

  • A Hierarchical Approach to Model Web Query Interfaces for Web Source Integration - Slides
    Thomas Kabisch (Humboldt Univ. Berlin), Eduard Constantin Dragut (Univ. of Illinois at Chicago), Clement Yu (Univ. of Illinois at Chicago), Ulf Leser (Humboldt Univ. Berlin)
  • Efficient Retrieval of the Top-k Most Relevant Spatial Web Objects
    Gao Cong (Aalborg Univ.), Christian S. Jensen (Aalborg Univ. and Google Inc.), Dingming Wu (Aalborg Univ.)
  • Stop Word and Related Problems in Web Interface Integration - Slides
    Eduard Constantin Dragut (Univ. of Illinois at Chicago), Fang Fang (Univ. of Illinois at Chicago), Prasad Sistla (Univ. of Illinois at Chicago), Clement Yu (Univ. of Illinois at Chicago), Weiyi Meng (Binghamton)

Research sessions - Query Processing on Modern Hardware

Chair: Anastasia Ailamaki (EPFL) -- Room: Rhône 3A

  • Lazy-Adaptive Tree: An Optimized Index Structure for Flash Devices - Slides
    Devesh Agrawal (UMass Amherst), Deepak Ganesan (UMass Amherst), Ramesh Sitaraman (UMass Amherst), Yanlei Diao (UMass Amherst), Shashi Singh (UMass Amherst)
  • MCC-DB: Minimizing Cache Conflicts in Multi-core Processors for Databases - Slides
    Rubao Lee (The Ohio State Univ.), Xiaoning Ding (The Ohio State Univ.), Feng Chen (The Ohio State Univ.), Qingda Lu (The Ohio State Univ.), Xiaodong Zhang (The Ohio State Univ.)
  • SIMD-Scan: Ultra Fast in-Memory Table Scan using on-Chip Vector Processing Units - Slides
    Thomas Willhalm (Intel GmbH), Nicolae Popovici (Intel GmbH), Yazan Boshmaf (SAP AG), Hasso Plattner (Hasso-Plattner-Institut), Alexander Zeier (Hasso-Plattner-Institut), Jan Schaffner (Hasso-Plattner-Institute)

Research sessions - Approximate Matching and Estimation

Chair: Amol Deshpande (University of Maryland) -- Room: Rhône 3B

  • Mining Document Collections to Facilitate Accurate Approximate Entity Matching
    Surajit Chaudhuri (Microsoft Research), Venkatesh Ganti (Microsoft Research), Dong Xin (Microsoft Research)
  • Reasoning about Record Matching Rules - Slides
    Wenfei Fan (Univ. of Edinburgh), Xibei Jia (Univ. of Edinburgh), Jianzhong Li (Harbin Institute of Technology), Shuai Ma (Univ. of Edinburgh)
  • Turbo-Charging Estimate Convergence in DBO
    Alin Dobra (Univ. of Florida), Chris Jermaine (Rice Univ.), Florin Rusu (Univ. of Florida), Fei Xu (Univ. of Florida)

Demo sessions - Warehouses & Workflows, IIS

Room: Rhône 4 and Rhône 5

  • Lahar Demonstration: Warehousing Markovian Streams
    Julie Letchner (Univ. of Washington), Christopher Re (Univ. of Wisconsin--Madison), Magdalena Balazinska (Univ. of Washington), Matthai Philipose (Intel)
  • WOLVES: Achieving Correct Provenance Analysis by Detecting and Resolving Unsound Workflow Views
    Peng Sun (Arizona State Univ.), Ziyang Liu (Arizona State Univ.), Sivaramakrishnan Natarajan (Arizona State Univ.), Susan Davidson (Univ. of Pennsylvania), Yi Chen (Arizona State Univ.)
  • TIAMAT: a Tool for Interactive Analysis of Microdata Anonymization Techniques
    Chenyun Dai (Purdue Univ.), Gabriel Ghinita (Purdue Univ.), Elisa Bertino (Purdue Univ.), Ji-Won Byun (Oracle), Ninghui Li (Purdue Univ.)
  • iNextCube: Information Network-Enhanced Text Cube
    Yintao Yu (UIUC), Cindy Lin (UIUC), Yizhou Sun (UIUC), Chen Chen (UIUC), Jiawei Han (UIUC), Binbin Liao (UIUC), Tianyi Wu (UIUC), ChengXiang Zhai (UIUC), Duo Zhang (UIUC), Bo Zhao (UIUC)
  • Hive - A Warehousing Solution Over a Map-Reduce Framework
    Ashish Thusoo (Facebook), Joydeep Sen Sarma (Facebook), Namit Jain (Facebook), Zheng Shao (Facebook), Prasad Chakka (Facebook), Suresh Anthony (Facebook), Hao Liu (Facebook), Pete Wyckoff (Facebook), Raghotham Murthy (Facebook)
  • Tolkien: An Event Based Storytelling System
    Arjun Satish (Univ. of California, Irvine), Ramesh Jain (Univ. of California, Irvine), Amarnath Gupta (Univ. of California, San Diego)
  • Enabling social networking in ad hoc networks of mobile phones
    Emre Sarigoel (ETH Zurich), Oriana Riva (ETH Zurich), Patrick Stuedi (Microsoft Research), Gustavo Alonso (ETH Zurich)
  • PDiffView: Viewing the Difference in Provenance of Workflow Results
    Zhuowei Bao (Univ. of Pennsylvania), Sarah Cohen-Boulakia (Universite Paris-Sud), Susan Davidson (Univ. of Pennsylvania), Pierrick Girard (Univ. of Pennsylvania)
  • Goal-Oriented Web-site Navigation for On-line Shoppers
    Daniel Deutch (Tel Aviv Univ.), Tova Milo (Tel Aviv Univ.), Tom Yam (Tel Aviv Univ.)

Tutorial 2

Room: Auditorium Pasteur

  • Data visualization & social data analysis II
    Jeffrey Heer (Stanford), Joseph M. Hellerstein (UC Berkeley)

Panel 1

Room: Auditorium Lumière

  • Answering Web Questions Using Structured Data – Dream or Reality?
    Fernando Pereira (Google Inc.), Anand Rajaraman (Kosmix Inc), Sunita Sarawagi (IIT Bombay), William Tunstall-Pedoe (True Knowledge), Gerhard Weikum (MPI Saarbruken). Moderator: Alon Halevy (Google Inc.)

Wednesday 26, 2009

08:45-09:00

Program Committee Report

Room: Auditorium Lumière

09:00-10:30

Keynote 2

Chair: Jignesh Patel (Univ. Wisconsin) -- Room: Auditorium Lumière

  • Bringing Database Research to Computer Games and Simulations
    Johannes Gehrke (Cornell Univ.)

11:00-12:30

Research sessions - P2P and Networked Data Management

Chair: Bernd Amann (LIP 6-Université de Pierre et Marie Curie) -- Room: Rhône 2

  • Composable, Scalable, and Accurate Weight Summarization of Unaggregated Data Sets - Slides
    Edith Cohen (AT&T Labs-Research), Nick Duffield (AT&T Labs-Research), Haim Kaplan (Tel Aviv Univ.), Carsten Lund (AT&T Labs Research), Mikkel Thorup (AT&T Labs-Research)
  • Distributed Online Aggregation
    Sai Wu (National Univ. of Singapore), Shouxu Jiang (Harbin Institute of Technology), Beng Chin Ooi (National Univ. of Singapore), Kian-Lee Tan (National Univ. of Singapore)
  • A Recall-Based Cluster Formation Game in Peer-to-Peer Systems - Slides
    Georgia Koloniari (Univ. of Ioannina, Greece), Evaggelia Pitoura (Univ. of Ioannina, Greece)

Research sessions - Transaction Processing

Chair: Paul Larson (Microsoft) -- Room: Rhône 1

  • Quantifying Isolation Anomalies - Slides
    Alan Fekete (Univ. of Sydney), Shirley Goldrei (Didco Systems), Jorge Perez Asenjo (Univ. of Sydney)
  • Improving OLTP Scalability using Speculative Lock Inheritance - Slides
    Ryan Johnson (EPFL & CMU), Ippokratis Pandis (Carnegie Mellon Univ.), Anastasia Ailamaki (EPFL)
  • Segment-based recovery: Write ahead logging revisited
    Russell Sears (UC Berkeley), Eric Brewer (UC Berkeley)

Research sessions - Probabilistic and Fuzzy Databases

Chair: Dan Suciu (University of Washington) -- Room: Rhône 3A

  • A Unified Approach to Ranking in Probabilistic Databases (Best Paper Award) - Slides
    Jian Li (Univ. of Maryland), Barna Saha (Univ. of Maryland), Amol Deshpande (Univ. of Maryland)
  • Learning String Transformations From Examples
    Arvind Arasu (Microsoft Research), Surajit Chaudhuri (Microsoft Research), Raghav Kaushik (Microsoft Research)
  • Probabilistic Histograms for Probabilistic Data - Slides
    Graham Cormode (AT&T Labs - Research), Antonios Deligiannakis (Technical Univ. of Crete), Minos Garofalakis (Technical Univ. of Crete), Andrew McGregor (UMass Amherst)

Research sessions - Data Integration I

Chair: Christoph Koch (Cornell University) -- Room: Rhône 3B

  • Autocompletion for Mashups - Slides
    Ohad Greenshpan (Tel Aviv Univ.), Tova Milo (Tel Aviv Univ.), Neoklis Polyzotis (UC Santa Cruz)
  • Integrating Conflicting Data: The Role of Source Dependence - Slides
    Xin Luna Dong (AT&T Labs - Research), Laure Berti-Equille (Univ. of Rennes 1 / AT&T Labs - Research), Divesh Srivastava (AT&T Labs - Research)
  • Truth Discovery and Copying Detection in a Dynamic World
    Xin Luna Dong (AT&T Labs--Research), Laure Berti-Equille (Univ. of Rennes 1, France / AT&T Labs - Research), Divesh Srivastava (AT&T Labs-Research)

Industrial sessions - Query Processing

Chair: Mohamed Ali (Microsoft) -- Room: Auditorium Lumière

  • Enhanced Subquery Optimizations in Oracle
    Srikanth Bellamkonda (Oracle), Rafi Ahmed (Oracle), Andrew Witkowski (Oracle), Mohamed Zait (Oracle), Angela Amor (Oracle), Chun Chieh Lin (Oracle)
  • Sort vs. Hash Revisited: Fast Join Implementation on Modern Multi-Core CPUs - Slides
    Changkyu Kim (Intel Corporation), Eric Sedlar (Oracle), Jatin Chhugani (Intel Corporation), Tim Kaldewey (Oracle), Anthony Nguyen (Intel), Andrea Di Blas (Oracle, UCSC), Victor Lee (Intel Corporation), Nadathur Satish (Intel Corporation), Pradeep Dubey (Intel Corporation)
  • Efficient Outer Join Data Skew Handling in Parallel DBMS
    Yu Xu (Teradata), Pekka Kostamaa (Teradata)

Demo sessions - Warehouses & Workflows, IIS

Room: Rhône 4 and Rhône 5

  • Lahar Demonstration: Warehousing Markovian Streams
    Julie Letchner (Univ. of Washington), Christopher Re (Univ. of Washington), Magdalena Balazinska (Univ. of Washington), Matthai Philipose (Intel)
  • WOLVES: Achieving Correct Provenance Analysis by Detecting and Resolving Unsound Workflow Views
    Peng Sun (Arizona State Univ.), Ziyang Liu (Arizona State Univ.), Sivaramakrishnan Natarajan (Arizona State Univ.), Susan Davidson (Univ. of Pennsylvania), Yi Chen (Arizona State Univ.)
  • TIAMAT: a Tool for Interactive Analysis of Microdata Anonymization Techniques
    Chenyun Dai (Purdue Univ.), Gabriel Ghinita (Purdue Univ.), Elisa Bertino (Purdue Univ.), Ji-Won Byun (Oracle), Ninghui Li (Purdue Univ.)
  • iNextCube: Information Network-Enhanced Text Cube
    Yintao Yu (UIUC), Cindy Lin (UIUC), Yizhou Sun (UIUC), Chen Chen (UIUC), Jiawei Han (UIUC), Binbin Liao (UIUC), Tianyi Wu (UIUC), ChengXiang Zhai (UIUC), Duo Zhang (UIUC), Bo Zhao (UIUC)
  • Hive - A Warehousing Solution Over a Map-Reduce Framework
    Ashish Thusoo (Facebook), Joydeep Sen Sarma (Facebook), Namit Jain (Facebook), Zheng Shao (Facebook), Prasad Chakka (Facebook), Suresh Anthony (Facebook), Hao Liu (Facebook), Pete Wyckoff (Facebook), Raghotham Murthy (Facebook)
  • Tolkien: An Event Based Storytelling System
    Arjun Satish (Univ. of California, Irvine), Ramesh Jain (Univ. of California, Irvine), Amarnath Gupta (Univ. of California, San Diego)
  • Enabling social networking in ad hoc networks of mobile phones
    Emre Sarigoel (ETH Zurich), Oriana Riva (ETH Zurich), Patrick Stuedi (Microsoft Research), Gustavo Alonso (ETH Zurich)
  • PDiffView: Viewing the Difference in Provenance of Workflow Results
    Zhuowei Bao (Univ. of Pennsylvania), Sarah Cohen-Boulakia (Universite Paris-Sud), Susan Davidson (Univ. of Pennsylvania), Pierrick Girard (Univ. of Pennsylvania)
  • Goal-Oriented Web-site Navigation for On-line Shoppers
    Daniel Deutch (Tel Aviv Univ.), Tova Milo (Tel Aviv Univ.), Tom Yam (Tel Aviv Univ.)

Tutorial 3

Room: Auditorium Pasteur

  • Keyword querying and Ranking in Databases
    Surajit Chaudhuri (Microsoft Research), Gautam Das (Univ. of Texas At Arlington)

14:00-15:30

Research sessions - Data Quality

Chair: Zoe Lacroix (ASU) -- Room: Rhône 1

  • Sequential Dependencies - Slides
    Lukasz Golab (AT&T Labs - Research), Howard Karloff (AT&T Labs - Research), Flip Korn (AT&T Labs - Research), Avishek Saha (Univ. of Utah), Divesh Srivastava (AT&T Labs-Research)
  • SHARC: Framework for Quality-Conscious Web Archiving - Slides
    Dimitar Denev (Max Planck Institute Informatik), Arturas Mazeika (Max Planck Institute Informatik), Marc Spaniol (Max Planck Institute Informatik), Gerhard Weikum (Max Planck Institut Informatik)
  • Modeling and Querying Possible Repairs in Duplicate Detection - Slides
    George Beskales (Univ. of Waterloo), Mohamed Soliman (Univ. of Waterloo), Ihab Ilyas (Univ. of Waterloo), Shai Ben-David (Univ. of Waterloo)

Research sessions - Data Mining I

Chair: Chris Jermaine (Rice University) -- Room: Rhône 2

  • Discovering Relative Importance of Skyline Attributes - Slides
    Denis Mindolin (SUNY at Buffalo), Jan Chomicki (SUNY at Buffalo)
  • A Particle-and-Density Based Evolutionary Clustering Method for Dynamic Networks - Slides
    Min-Soo Kim (UIUC), Jiawei Han (UIUC)
  • Summarizing Relational Databases - Slides
    Xiaoyan Yang (National Univ. of Singapore), Cecilia Procopiuc (AT&T Labs-Research), Divesh Srivastava (AT&T Labs-Research)

Research sessions - Query Estimation

Chair: Beng Chin Ooi (NUS) -- Room: Rhône 3A

  • Coordinated Weighted Sampling for Estimating Aggregates Over Multiple Weight Assignments - Slides
    Edith Cohen (AT&T Labs-Research), Haim Kaplan (Tel Aviv Univ.), Subhabrata Sen (AT&T Labs - Research)
  • Power-Law Based Estimation of Set Similarity Join Size - Slides
    Hongrae Lee (Univ. of British Columbia), Raymond Ng (Univ. of British Columbia), Kyuseok Shim (Seoul National Univ.)
  • Optimality and Scalability in Lattice Histogram Construction - Slides
    Panagiotis Karras (National Univ. of Singapore)

Research sessions - Parallelism

Chair: Peter Boncz (CWI) -- Room: Rhône 3B

  • Adaptively Parallelizing Distributed Range Queries - Slides
    Ymir Vigfusson (Cornell Univ.), Adam Silberstein (Yahoo! Research), Brian Cooper (Yahoo! Research), Rodrigo Fonseca (Yahoo! Research)
  • Mining Tree-Structured Data on Multicore Systems
    Shirish Tatikonda (Ohio State Univ.), Srinivasan Parthasarathy (Ohio State Univ.)
  • Predictable Performance for Unpredictable Workloads - Slides
    Philipp Unterbrunner (ETH Zurich), Georgios Giannikis (ETH Zurich), Gustavo Alonso (ETH Zurich), Dietmar Fauser (Amadeus IT Group SA), Donald Kossmann (ETH Zurich)

Industrial sessions - MapReduce

Chair: Praveen Rao (University of Missouri-KC) -- Room: Auditorium Lumière

  • SQL/MapReduce: A practical approach to self-describing, polymorphic, and parallelizable user-defined functions
    John Cieslewicz (Aster Data Systems), Peter Pawlowski (Aster Data Systems), Eric Friedman (Aster Data Systems)
  • Building a HighLevel Dataflow System on top of MapReduce: The Pig Experience
    Alan Gates (Yahoo!), Olga Natkovich (Yahoo!), Shubham Chopra (Yahoo! Research), Pradeep Kamath (Yahoo!), Shravan Narayanam (Yahoo!), Christopher Olston (Yahoo! Research), Benjamin Reed (Yahoo! Research), Santhosh Srinivasan (Yahoo! Research), Utkarsh Srivastava (Yahoo! Research)
  • PLANET: Massively Parallel Learning of Tree Ensembles with MapReduce
    Biswanath Panda (Google Inc.), Joshua Herbach (Google Inc.), Sugato Basu (Google Inc.), Roberto Bayardo (Google Inc.)

Demo sessions - Core DB Technology & System issues

Room: Rhône 4 and Rhône 5

  • Query Mesh: Multi-Route Query Processing Technology
    Rimma Nehme (Purdue Univ.), Karen Works (WPI), Elke Rundensteiner (RPI), Elisa Bertino (Purdue Univ.)
  • A Demonstration of SciDB: A Science-Oriented DBMS
    Philippe Cudre-Mauroux (Massachusetts Institute of Technology), Hideaki Kimura (Brown Univ.), Kian-Tat Lim (SLAC), Jennie Rogers (Brown Univ.), Roman Simakov (NIISI), Emad Soroush (Univ. of Washington), Pavel Velikhov (NIISI), Daniel Wang (SLAC), Magdalena Balazinska (Univ. of Washington), Jacek Becla (SLAC), David DeWitt (Microsoft Research), Bobbi Heath (Vertica), David Maier (Portland State Univ.), Samuel Madden (Massachusetts Institute of Technology), Jignesh Patel (Univ. of Wisconsin), Michael Stonebraker (Massachusetts Institute of Technology), Stan Zdonik (Brown Univ.)
  • MOIR/MT: Monitoring Large-Scale Road Network Traffic in Real-Time
    Kuien Liu (The Univ. of Queensland), Ke Deng (The Univ. of Queensland), Zhiming Ding (Institute of Software, Chinese Academy of Science), Mingshu Li (Institute of Software, Chinese Academy of Science), Xiaofang Zhou (The Univ. of Queensland)
  • Oracle Database Replay
    Romain Colle (Oracle), Leonidas Galanis (Oracle), Yujun Wang (Oracle), Supiti Buranawatanachoke (Oracle), Stratos Papadomanolakis (Oracle)
  • DIADS: A Problem Diagnosis Tool for Databases and Storage Area Networks
    Nedyalko Borisov (Duke Univ.), Shivnath Babu (Duke Univ.), Sandeep Uttamchandani (IBM Almaden Research Center), Ramani Routray (IBM Almaden Research Center), Aameek Singh (IBM Almaden Research Center)
  • Artemis: A System for Analyzing Missing Answers
    Melanie Herschel (IBM Almaden Research Center), Mauricio Hernandez (IBM Almaden Research Center), Wang-Chiew Tan (UC Santa Cruz)
  • Demonstration of the TrajStore System
    Eugene Wu (Massachusetts Institute of Technology), Philippe Cudre-Mauroux (Massachusetts Institute of Technology), Samuel Madden (Massachusetts Institute of Technology)
  • Microsoft CEP Server and Online Behavioral Targeting
    Mohamed Ali (Microsoft), Ciprian Gerea (Microsoft), Balan Raman (Microsoft), Beysim Sezgin (Microsoft), Tiho Tarnavski (Microsoft), Tomer Verona (Microsoft), Ping Wang (Microsoft), Peter Zabback (Microsoft), Anton Kirilov (Microsoft), Asvin Ananthanarayan (Microsoft), Ming Lu (Microsoft), Alex Raizman (Microsoft), Ramkumar Krishnan (Microsoft), Roman Schindlauer (Microsoft), Torsten Grabs (Microsoft), Sharon Bjeletich (Microsoft), Badrish Chandramouli (Microsoft Research), Jonathan Goldstein (Microsoft Research), Sudin Bhat (Microsoft), Ying Li (Microsoft), Vincenzo Di Nicola (Microsoft), Xianfang Wang (Microsoft), David Maier (Portland State Univ.), Ivo Santos (Microsoft Inc.), Olivier Nano (Microsoft), Stephan Grell (Microsoft)
  • A Testbed for Managing Dynamic Mixed Workloads
    Stefan Krompaß (Technische Universität München), Harumi Kuno (Hewlett-Packard Laboratories), Janet Wiener (Hewlett-Packard Laboratories), Kevin Wilkinson (Hewlett-Packard Laboratories), Umeshwar Dayal (Hewlett-Packard Laboratories), Alfons Kemper (Technische Universität München)
  • DBToaster: A SQL Compiler for High-Performance Delta Processing in Main-Memory Databases
    Yanif Ahmad (Cornell Univ.), Christoph Koch (Cornell Univ.)

Tutorial 4

Room: Auditorium Pasteur

  • Efficient Approximate Search on String Collections I
    Marios Hadjieleftheriou (AT&T Labs - Research), Chen Li (Univ. of California, Irvine)

16:00-17:30

Research sessions - Graph Data Mining

Chair: H.V. Jagadish (University of Michigan) -- Room: Rhône 1

  • Graph Clustering Based on Structural/Attribute Similarities - Slides
    Yang Zhou (Chinese Univ. of HongKong), Hong Cheng (Chinese Univ. of Hong Kong), Jeffrey Yu (Chinese Univ. of Hong Kong)
  • Output Space Sampling for Graph Patterns - Slides
    Mohammad Hasan (Rensselaer Polytechnic Institute, Troy, New-York), Mohammed Zaki (Rensselaer Polytechnic Institute)
  • Mining Graph Patterns Efficiently via Randomized Summaries - Slides
    Chen Chen (UIUC), Cindy Lin (UIUC), Matt Fredrikson (Univ. of Wisconsin - Madison), Mihai Christodorescu (IBM T.J. Watson Research Center), Xifeng Yan (Univ. of California at Santa Barbara), Jiawei Han (UIUC)

Research sessions - Social Networks and Recommendations

Chair: Wang-Chiew Tan (UC Santa Cruz) -- Room: Rhône 2

  • Group Recommendation: Semantics and Efficiency - Slides
    Sihem Amer-Yahia (Yahoo! Research), Senjuti Basu Roy ( Univ. of Texas at Arlington), Ashish Chawla (Yahoo! Inc), Gautam Das (Univ. of Texas At Arlington), Cong Yu (Yahoo! Research)
  • Class-based graph anonymization for social network data - Slides
    Graham Cormode (AT&T Labs), Divesh Srivastava (AT&T Labs - Research), Smriti Bhagat (Rutgers Univ.), Balachander Krishnamurthy (AT&T Labs - Research)
  • Improved Search for Socially Annotated Data
    Nikos Sarkas (Univ. of Toronto), Gautam Das (Univ. of Texas), Nick Koudas (Univ. of Toronto)

Research sessions - Privacy I

Chair: Angela Bonifati (CNR, Italy) -- Room: Rhône 3A

  • Data Publishing against Realistic Adversaries - Slides
    Ashwin Machanavajjhala (Yahoo! Research), Johannes Gehrke (Cornell Univ.), Michaela Goetz (Cornell Univ.)
  • Scalable Verification for Outsourced Dynamic Databases - Slides
    HweeHwa Pang (Singapore Management Univ.), Jilian Zhang (Singapore Management Univ.), Kyriakos Mouratidis (Singapore Management Univ.)
  • Optimal Random Perturbation at Multiple Privacy Levels
    Xiaokui Xiao (Nanyang Technological Univ.), Yufei Tao (Chinese Univ. of Hong Kong), Minghua Chen (Chinese Univ. of Hong Kong)

Research sessions - Potpourri

Chair: Carlo Zaniolo (UCLA, USA) -- Room: Rhône 3B

  • Anticipatory DTW for Efficient Similarity Search in Time Series Databases - Slides
    Ira Assent (Aalborg Univ.), Marc Wichterich (RWTH Aachen Univ.), Ralph Krieger (RWTH Aachen Univ.), Hardy Kremer (RWTH Aachen Univ.), Thomas Seidl (RWTH Aachen Univ.)
  • Improving the Performance of List Intersection
    Dimitris Tsirogiannis (Univ. of Toronto), Sudipto Guha (Univ. of Pennsylvania), Nick Koudas (Univ. of Toronto)
  • Consistent Histograms In The Presence of Distinct Value Counts
    Raghav Kaushik (Microsoft Research), Dan Suciu (Univ. of Washington)

Demo sessions - Web & Data Integration

Room: Rhône 4 and Rhône 5

  • ANGIE: Active Knowledge for Interactive Exploration
    Nicoleta Preda (Max-Planck Institute), Fabian Suchanek (Max-Planck Institute), Gjergji Kasneci (Max-Planck Institute), Thomas Neumann (Max-Planck Institute), Maya Ramanath (Max-Planck Institute), Gerhard Weikum (Max-Planck Institute)
  • DEMo: Data Exchange Modeling Tool
    Reinhard Pichler (TU Wien), Vadim Savenkov (TU Wien)
  • Comparative evaluation of entity resolution approaches with FEVER
    Hanna Köpcke (Univ. of Leipzig), Andreas Thor (Univ. of Leipzig), Erhard Rahm (Univ. of Leipzig)
  • RankIE: Document Retrieval on Ranked Entity Graphs
    Falk Brauer (SAP Research), Wojciech Barczynski (SAP Research), Gregor Hackenbroich (SAP Research), Marcus Schramm (SAP Research), Adrian Mocan (SAP Research), Felix Förster (SAP AG)
  • Concise and Expressive Mappings with +Spicy
    Giansalvatore Mecca (Unviersità della Basilicata), Paolo Papotti (Università Roma Tre), Salvatore Raunich (Università della Basilicata), Marcello Buoncristiano (Università della Basilicata)
  • AgreementMaker: Efficient Matching for Large Real-World Schemas and Ontologies
    Isabel Cruz (Univ. of Illinois at Chicago), Flavio Palandri Antonelli (Univ. of Illinois at Chicago), Cosmin Stroe (Univ. of Illinois at Chicago)
  • Linkage Query Writer
    Renee Miller (Univ. of Toronto), Anastasios Kementsietsidis (IBM T. J. Watson Research Center), Lipyeow Lim (IBM T. J. Watson Research Center), Min Wang (IBM T. J. Watson Research Center)
  • SMDM: Enhancing Enterprise-Wide Master Data Management Using Semantic Web Technologies
    Xiaoyuan Wang (IBM China Research Lab), Xingzhi Sun (IBM China Research Lab), Feng Cao (IBM China Research Lab), Li Ma (IBM), Nick Kanellos (IBM Software Group), Kang Zhang (Shanghai Jiao Tong Univ.), Yue Pan (IBM China Research Lab), Yong Yu (Shanghai Jiao Tong Univ.)
  • IBM UFO Repository
    Michael Gubanov (Univ. of Washington), Lucian Popa (IBM Almaden Research Center), Howard Ho (IBM Almaden Research Center), Hamid Pirahesh (IBM Almaden Research Center), Jeng-Yih Chang (National Yang-Ming Univ.), Shr-Chang Chen (National Yang-Ming Univ.)
  • Mashup by Surfing a Web of Data APIs
    Huajun Chen (Zhejiang Univ.), Bin Lu (Zhejiang Univ.), Yuan Ni (Ibm), Guo tong Xie (Zhejiang Univ.), Chunyin Zhou (Zhejiang Univ.), Jinhua Mi (Zhejiang Univ.), Zhaohui Wu (Zhejiang Univ.)

Tutorial 4

Room: Auditorium Pasteur

  • Efficient Approximate Search on String Collections II
    Marios Hadjieleftheriou (AT&T Labs - Research), Chen Li (Univ. of California, Irvine)

Panel 2

Room: Auditorium Lumière

  • How Best to Build Web-Scale Data Managers? A Panel Discussion.
    Daniel J. Abadi (Yale), Michael J. Cafarella (U. of Washington), Joseph M. Hellerstein (U.C. Berkeley), Donald Kossmann (ETH Zürich), Samuel Madden (Massachusetts Institute of Technology). Moderator: Philip A. Bernstein (Microsoft)

Thursday 27, 2009

09:00-09:30

Awards and Business Session

Room: Auditorium Lumière

  • Proceedings of VLDB Endowment
    H.V. Jagadish (Univ. of Michigan)
  • Awards ceremony
    Susan Davidson (Univ. Pennsylvania), Johann-Christoph Freytag (Humboldt Univ. Berlin ), C. Mohan (IBM Almaden Research Center)
  • 10-year Best Paper Award: Database Architecture Optimized for the New Bottleneck: Memory Access
    Peter A. Boncz (CWI, Amsterdam), Stefan Manegold (CWI, Amsterdam), Martin L. Kersten (CWI, Amsterdam)
  • Best Paper Award: A Unified Approach to Ranking in Probabilistic Databases (presented in Session Probabilistic and Fuzzy Databases)
    Jian Li (Univ. of Maryland), Barna Saha (Univ. of Maryland), Amol Deshpande (Univ. of Maryland)
  • VLDB Journal certificates
    Gerhard Weikum (Max Plank Institute)

09:30-10:30

10-year Award Keynote

Chair: Tova Milo (Univ. Tel Aviv) -- Room: Auditorium Lumière

  • Database Architecture Evolution: Mammals Flourished long before Dinosaurs became Extinct
    Peter A. Boncz (CWI, Amsterdam), Stefan Manegold (CWI, Amsterdam), Martin L. Kersten (CWI, Amsterdam)

11:00-12:30

Research sessions - Data Mining II

Chair: Neoklis Polyzotis (UC Santa Cruz) -- Room: Rhône 1

  • GConnect: A Connectivity Index for Massive Disk-resident Graphs
    Charu Aggarwal (IBM T. J. Watson Research Ctr.), Yan Xie (Univ. of Illinois at Chicago), Philip Yu (Univ. of Illinois at Chicago)
  • A Shared Execution Strategy for Multiple Pattern Mining Requests over Streaming Data - Slides
    Di Yang (WPI), Elke Rundensteiner (WPI), Matthew Ward (WPI)
  • DistanceJoin: Pattern Match Query In a Large Graph Database
    Lei Zou (Huazhong Univ. of Science and Technology), Lei Chen (Honk-Kong Univ. of Science and Technology), M. Tamer Özsu (Univ. of Waterloo)

Research sessions - Novel/Advanced Applications

Chair: Val Tannen (Upenn) -- Room: Rhône 2

  • Creating Competitive Products - Slides
    Qian Wan (Hong Kong Univ. of Science and Technology), Raymond Chi-Wing Wong (Hong Kong Univ. of Science and Technology), Ihab Ilyas (Univ. of Waterloo), M. Tamer Özsu (Univ. of Waterloo), Yu Peng (Hong Kong Univ. of Science and Technology)
  • Data Processing on FPGAs - Slides
    Rene Mueller (ETH Zurich), Jens Teubner (ETH Zurich), Gustavo Alonso (ETH Zurich)
  • HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads - Slides
    Azza Abouzeid (Yale Univ.), Kamil Bajda-Pawlikowski (Yale Univ.), Daniel Abadi (Yale Univ.), Alexander Rasin (Brown Univ.), Avi Silberschatz (Yale Univ.)

Research sessions - Privacy II

Chair: Walid Aref (Purdue U., USA) -- Room: Rhône 3A

  • Anonymization of Set-Valued Data via Top-Down, Local Generalization - Slides
    Yeye He (Univ. of Wisconsin-Madison), Jeff Naughton (Univ. of Wisconsin-Madison)
  • K-Automorphism: A General Framework For Privacy Preserving Network Publication
    Lei Zou (Huazhong Univ. of Science and Technology), Lei Chen (Hong Kong Univ. of Science and Technology), M. Tamer Özsu (Univ. of Waterloo)
  • Distribution-based Microdata Anonymization - Slides
    Nick Koudas (Toronto), Divesh Srivastava (AT&T Labs-Research), Ting Yu (North Carolina State Univ.), Qing Zhang (Teradata)

Research sessions - Query Optimization

Chair: Torsten Grust (Universität Tübingen, Germany) -- Room: Rhône 3B

  • On Chase Termination Beyond Stratification - Slides
    Michael Meier (Univ. of Freiburg), Michael Schmidt (Univ. of Freiburg), Georg Lausen (Univ. of Freiburg)
  • Preventing Bad Plans by Bounding the Impact of Cardinality Estimation Errors - Slides
    Guido Moerkotte (Univ. of Mannheim), Thomas Neumann (MPI), Gabriele Steidl (Univ. of Mannheim)
  • Exact Cardinality Query Optimization for Optimizer Testing
    Surajit Chaudhuri (Microsoft Research), Vivek Narasayya (Microsoft Research), Ravi Ramamurthy (Microsoft Research)

Industrial sessions - Business Data Management

Chair: Klemens Böhm (Universität Karlsruhe (TH)) -- Room: Auditorium Lumière

  • Robust Distributed Top-N Frequent Pattern Mining Using the SAP BW Accelerator
    Thomas Legler (SAP), Wolfgang Lehner (TU Dresden), Jan Schaffner (Hasso-Plattner-Institute), Jens Krueger (Hasso-Plattner-Institut)
  • 1,000 Tables Inside the From
    Nicolas Dieu (SAP), Adrian Dragusanu (SAP), Francoise Fabret (SAP), Francois Llirbat (SAP), Eric Simon (SAP)
  • Efficient Index Compression in DB2 LUW
    Bishwaranjan Bhattacharjee (IBM T.J. Watson Research Center), Lipyeow Lim (IBM T. J. Watson Research Center), Timothy Malkemus (IBM T.J.Watson Research Center), George Mihaila (IBM T.J.Watson Research Center), Ken Ross (Columbia Univ.), Sherman Lau (IBM Toronto Lab), Cathy McCarthur (IBM Toronto Lab), Zoltan Toth (IBM Toronto Labs), Reza Sherkat (Univ. of Alberta)

14:00-15:30

Research sessions - Meta Data Management

Chair: Isabel Cruz (University of Illinois at Chicago) -- Room: Rhône 1

  • Laconic Schema Mappings: Computing the Core with SQL Queries - Slides
    Balder Ten Cate (INRIA and ENS Cachan), Laura Chiticariu (IBM Almaden Research Center), Phokion Kolaitis (UC Santa Cruz), Wang-Chiew Tan (UC Santa Cruz)
  • Inverting Schema Mappings: Bridging the Gap between Theory and Practice
    Marcelo Arenas (PUC Chile), Jorge Perez (PUC Chile), Juan Reutter (PUC Chile), Cristian Riveros (R&M Tech)
  • Full-Fidelity Flexible Object-Oriented XML Access - Slides
    James Terwilliger (Microsoft Research), Philip Bernstein (Microsoft Research), Sergey Melnik (Google Inc.)

Research sessions - Database Services and Preferences

Chair: Christopher Re (University of Wisconsin--Madison) -- Room: Rhône 2

  • Privacy-Aware Mobile Services over Road Networks
    Ting Wang (Georgia Institute of Technology), Ling Liu (Georgia Institute of Technology)
  • A Fair Assignment Algorithm for Multiple Preference Queries
    Leong Hou U (Univ. of Hong Kong), Nikos Mamoulis (Univ. of Hong Kong), Kyriakos Mouratidis (Singapore Management Univ.)
  • Pangea: An Eager Database Replication Middleware guaranteeing Snapshot Isolation without Modification of Database Servers - Slides
    Takeshi Mishima (Univ. of Tokyo), Hiroshi Nakamura (Univ. of Tokyo)

Research sessions - Data Integration II

Chair: Vasilis Vassalos (AUEB) -- Room: Rhône 3B

  • Harvesting Relational Tables from Lists on the Web - Slides
    Hazem Elmeleegy (Purdue Univ.), Jayant Madhavan (Google Inc.), Alon Halevy (Google Inc.)
  • Data Integration for the Relational Web - Slides
    Michael Cafarella (Univ. of Washington), Alon Halevy (Google Inc.), Nodira Khoussainova (Univ. of Washington)
  • Normalization and Optimization of Schema Mappings - Slides
    Reinhard Pichler (TU Wien), Georg Gottlob (Univ. of Oxford), Vadim Savenkov (TU Wien)

Research sessions - Nearest-Neighbor Processing

Chair: Xiaofang Zhou (U. of Queensland, Australia) -- Room: Rhône 3A

  • Continuous Monitoring of Nearest Neighbors on Land Surface - Slides
    Songhua Xing (Univ. of Southern California), Cyrus Shahabi (Univ. of Southern California), Bei Pan (Univ. of Southern California)
  • Efficient Method for Maximizing Bichromatic Reverse Nearest Neighbor - Slides
    Raymond Chi-Wing Wong (Hong Kong Univ. of Science and Technology), M. Tamer Özsu (Univ. of Waterloo), Philip Yu (Univ. of Illinois at Chicago), Ada Fu (Chinese Univ. of Hong Kong), Lian Liu ((Hong Kong Univ. of Science and Technology)
  • Lazy Updates: An Efficient Technique to Continuously Monitoring Reverse kNN
    Muhammad Cheema (UNSW), Xuemin Lin (UNSW & NICTA), Ying Zhang (UNSW), Wei Wang (UNSW), Wenjie Zhang (UNSW & NICTA)

Industrial sessions - Experiences and Lessons

Chair: Raghunath Nambiar (Hewlett-Packard) -- Room: Auditorium Lumière

  • Storing Scientific Workflows in a Database
    Zoé Lacroix (TGen), Christophe Legendre (ASU), Spyro Mousses (TGen)
  • MAD Skills: New Analysis Practices for Big Data
    Jeffrey Cohen (Greenplum), Brian Dolan (Fox Audience Network), Mark Dunlap (Evergreen Technologies), Joseph Hellerstein (UC Berkeley), Caleb Welton (Greenplum)
  • DBLP - Some Lessons Learned
    Michael Ley (Univ. of Trier)

Tutorial 5

Room: Rhône 4 and 5

  • Information Theory For Data Management I
    Divesh Srivastava (AT&T Labs-Research), Suresh Venkatasubramanian (Univ. of Utah)

Tutorial 6

Room: Auditorium Pasteur

  • Column oriented Database Systems I
    Daniel J. Abadi (Yale), Peter A. Boncz (CWI, Amsterdam), Stavros Harizopoulos (HP Labs)

16:00-17:30

Research sessions - Mining and Privacy

Chair: Bogdan Cautis (Telecom ParisTech) -- Room: Rhône 1

  • NEAR-Miner: Mining Evolution Associations of Web Site Directories for Efficient Maintenance of Web Archives
    Ling Chen (L3S, Univ. of Hannover), Sourav Bhowmick (NTU, Singapore), Wolfgang Nejdl (L3S Research Center)
  • An Audit Environment for Outsourcing of Frequent Itemset Mining
    Wai Kit Wong (Univ. of Hong Kong), David Wai Lok Cheung (Univ. of Hong Kong), Edward Hung (Hong Kong Polytechnic Univ.), Ben Kao (Univ. of Hong Kong), Nikos Mamoulis (Univ. of Hong Kong)
  • Publishing Naive Bayesian Classifiers: Privacy without Accuracy Loss
    Barzan Mozafari (UC Los Angeles), Carlo Zaniolo (UC Los Angeles)

Research sessions - Spatial Query Processing

Chair: Raymond Chi-Wing Wong (the Hong Kong University of Science and Technology) -- Room: Rhône 3B

  • Workload-Aware Indexing of Continuously Moving Objects - Slides
    Kostas Tzoumas (Aalborg Univ.), Man Lung Yiu (Aalborg Univ.), Christian S. Jensen (Aalborg Univ. and Google Inc.)
  • Effectively Indexing Uncertain Moving Objects for Predictive Queries - Slides
    Meihui Zhang (National Univ. of Singapore), Su Chen (National Univ. of Singapore), Christian S. Jensen (Aalborg Univ. and Google Inc.), Beng Chin Ooi (National Univ. of Singapore), Zhenjie Zhang (National Univ. of Singapore)
  • Path Oracles for Spatial Networks
    Jagan Sankaranarayanan (Univ. of Maryland), Hanan Samet (Univ. of Maryland), Houman Alborzi (Univ. of Maryland)

Research sessions - Index Interactions and Database Manageability

Chair: Alan Fekete (University of Sydney, Australia) -- Room: Rhône 3A

  • Correlation Maps: A Compressed Access Method for Exploiting Soft Functional Dependencies - Slides
    Hideaki Kimura (Brown Univ.), George Huo (Google, Inc), Alexander Rasin (Brown Univ.), Samuel Madden (Massachusetts Institute of Technology), Stan Zdonik (Brown Univ.)
  • Index Interactions in Physical Design Tuning: Modeling, Analysis, and Applications - Slides
    Karl Schnaitter (UC Santa Cruz), Neoklis Polyzotis (UC Santa Cruz), Lise Getoor (Univ. of Maryland, College Park)
  • Tuning Database Configuration Parameters with iTuned - Slides
    Songyun Duan (Duke Univ.), Vamsidhar Thummala (Duke Univ.), Shivnath Babu (Duke Univ.)

Research sessions - Experiments

Chair: Stefan Manegold (CWI, Amsterdam) -- Room: Rhône 2

  • An Evaluation of Checkpoint Recovery for Massively Multiplayer Online Games - Slides
    Marcos Vaz Salles (Cornell Univ.), Tuan Cao (Cornell Univ.), Benjamin Sowell (Cornell Univ.), Alan Demers (Cornell Univ.), Johannes Gehrke (Cornell Univ.), Christoph Koch (Cornell Univ.), Walker White (Cornell Univ.)
  • Evaluating Clustering in Subspace Projections of High Dimensional Data - Slides
    Emmanuel Müller (RWTH Aachen Univ.), Stephan Günnemann (RWTH Aachen Univ.), Ira Assent (Aalborg Univ.), Thomas Seidl (RWTH Aachen Univ.)
  • Framework for Evaluating Clustering Algorithms in Duplicate Detection - Slides
    Oktie Hassanzadeh (Univ. of Toronto), Fei Chiang (Univ. of Toronto), Renée Miller (Univ. of Toronto), Hyun Chul Lee (Thoora)

Industrial sessions - Non-Traditional Data Management

Chair: Paolo Papotti (Università Roma Tre) -- Room: Auditorium Lumière

  • Oracle SecureFiles: Prepared for the Digital Deluge - Slides
    Niloy Mukherjee (Oracle), Amit Ganesh (Oracle), V Djegaradjane (Oracle), Sujatha Muthulingam (Oracle), Wei Zhang (Oracle), Scott Lynn (Oracle), Krishna Kunchithapadam (Oracle), Bharath Aleti (Oracle), Kam Shergill (Oracle), Shaoyu Wang (Oracle)
  • Scalable Web Data Extraction for Online Market Intelligence
    Robert Baumgartner (Lixto Software GmbH), Georg Gottlob (Oxford Univ.), Marcus Herzog (Lixto Software GmbH)
  • Kosmix: Exploring the Deep Web using Taxonomies and Categorization
    Anand Rajaraman (Kosmix)

Tutorial 5

Room: Rhône 4 and 5

  • Information Theory For Data Management II
    Divesh Srivastava (AT&T Labs-Research), Suresh Venkatasubramanian (Univ. of Utah)

Tutorial 6

Room: Auditorium Pasteur

  • Column oriented Database Systems II
    Daniel J. Abadi (Yale), Peter A. Boncz (CWI, Amsterdam), Stavros Harizopoulos (HP Labs)