Introduction
Welcome
PC Chairs message Keynotes

Programme

Proceedings
Presentations
Accepted papers
Program committees
Conference officers
Important Dates
Guidelines for Authors
Submission
Workshops
Calls
Sponsorship
Co-located conferences
Registration
Venue
Auckland Airport Transport
Auckland Accommodation
Tourism
Conference Details
Conference video
Contact

Programme

Monday, 25th August 2008

8:30 - 9:00

Opening Ceremony

Monday

9:00 - 10:15

Keynote

  • Is Transactional Memory an Oxymoron?

    Mark D. Hill (University of Wisconsin-Madison)

10:45 - 12:30

Research Session 1 Systems A

Session Chair: Ken Ross
  • Constrained Physical Design Tuning

    Nicolas Bruno (Microsoft Research, USA), Surajit Chaudhuri (Microsoft Research, USA).
  • Scalable Multi-Query Optimization for Exploratory Queries over Federated Scientific Databases

    Dieter Van de Craen (Hasselt University), Frank Neven (Hasselt University), Anastasios Kementsietsidis (IBM T.J. Watson Research Center), Stijn Vansummeren (Hasselt University).
  • Clustera: An Integrated Computation and Data Management System

    David DeWitt (UW - Madison), Eric Robinson (UW - Madison), Srinath Shankar (UW - Madison), Erik Paulson (UW - Madison), Jeffrey Naughton (UW - Madison), Andrew Krioukov (UW - Madison), Joshua Royalty (UW - Madison).
  • Performance Profiling with EndoScope, an Acquisitional Software Monitoring Framework

    Alvin Cheung (MIT CSAIL, USA), Samuel Madden (MIT CSAIL, USA).

Research Session 2 Mining A

Session Chair: Phil Gibbon
  • Brighthouse: An Analytic Data Warehouse for Ad-hoc Queries (New Date and Time!)

    Dominik Slezak (Infobright), Jakub Wroblewski (Infobright), Victoria Eastwood (Infobright), Piotr Synak (Infobright).
  • Plan-based Complex Event Detection across Distributed Sources

    Mert Akdere (Brown University), Ugur Cetintemel (Brown University), Nesime Tatbul (ETH Zurich).
  • Finding Relevant Patterns in Bursty Sequences

    Alexander Lachmann (Cornell University), Mirek Riedewald (Cornell University).
  • Constrained Locally Weighted Clustering

    Hao Cheng (University of Central Florida), Kien Hua (University of Central Florida), Khanh Vu (University of Central Florida).

Research Session 3 Privacy & Authentication

Session Chair: N.N.
  • Resisting Structural Re-identification in Anonymized Social Networks

    Michael Hay (University of Massachusetts Amherst), Gerome Miklau (University of Massachusetts Amherst), David Jensen (University of Massachusetts Amherst), Don Towsley (University of Massachusetts Amherst), Philipp Weis (University of Massachusetts Amherst).
  • Privacy-preserving Anonymization of Set-valued Data

    Manolis Terrovitis (Univeristy of Hong Kong), Nikos Mamoulis (Univeristy of Hong Kong), Panos Kalnis (National University of Singapore).
  • Authenticating Query Results for Text Search Engines

    HweeHwa Pang (Singapore Management University), Kyriakos Mouratidis (Singapore Management University).
  • Structural Signatures for Tree Data Structures

    Ashish Kundu (Purdue University, USA), Elisa Bertino (Purdue University, USA).

Tutorial Session 1 Business Process

Session Chair: Dinesh Das
  • Querying and Monitoring Distributed Business Processes

    Tova Milo (Tel Aviv University, Israel), Daniel Deutch (Tel Aviv University, Israel).

Demo Group 1 XML

  • eXtract: A Snippet Generation System for XML Search

    Yu Huang, Ziyang Liu, Yi Chen.
  • Language-Integrated Querying of XML Data in SQL Server

    James Terwilliger, Sergey Melnik, Philip Bernstein.
  • XTCcmp: XQuery Compilation on XTC

    Christian Mathis, Andreas Weiner, Theo Harder, Caesar Ralf Franz Hoppen.
  • Periscope/GQ: A Graph Querying Toolkit

    Yuanyuan Tian, Jignesh Patel, Viji Nair, Sebastian Martini, Matthias Kretzler.
  • SEDA: A System for Search, Exploration, Discovery, and Analysis of XML Data

    Andrey Balmin, Latha Colby, Emiran Curtmola, Quanzhong Li, Fatma Ozcan, Sharath Srinivas, Zografoula Vagena.
  • Process Spaceship: Process Views Discovery and Exploration

    Hamid Reza Motahari Nezhad, Boualem Benatallah, Fabio Casati, Periklis Andritsos, Regis Saint-Paul.

14:00 - 15:15

Research Session 4 Web

Session Chair: Jens Dittrich
  • Maintaining Dynamic Channel Profiles on the Web

    Haggai Roitman (IBM), David Carmel (IBM-Haifa Research Lab), Elad Yom-Tov (IBM-Haifa Research Lab).
  • WYSIWYG Development of Data Driven Web Applications

    Fan Yang (Yahoo), Chavdar Botev (Cornell University), Nitin Gupta (Cornell University), Elizabeth Churchill (Yahoo! Research), Levchenko George (Yahoo! Research), Jayavel Shanmugasundaram (Yahoo! Research).
  • Web Page Language Identification Based on URLs

    Eda Baykan (EPF Lausanne), Monika Henzinger (EPF Lausanne), Ingmar Weber (EPF Lausanne).

Research Session 5 Query Optimization

Session Chair: Shivnath Babu
  • Parallelizing Query Optimization

    Wook-Shin Han (Kyungpook National University), Wooseong Kwak (Kyungpook National University), Jinsoo Lee (Kyungpook National University), Guy Lohman (IBM Research Almaden, USA), Volker Markl (IBM Research Almaden, USA).
  • Hashed Samples: Selectivity Estimators For Set Similarity Selection Queries

    Marios Hadjieleftheriou (AT&T Labs Inc. ), Xiaohui Yu (York University), Nick Koudas (U of Toronto), Divesh Srivastava (AT&T, USA).
  • Tighter Estimation using Bottom-k Sketches

    Edith Cohen (AT&T, USA), Haim Kaplan (Tel Aviv University).

Research Session 6 Schema A

Session Chair: Peter Buneman
  • STBenchmark: Towards a Benchmark for Mapping Systems

    Bogdan Alexe (UC Santa Cruz), Wang-Chiew Tan (UC Santa Cruz), Yannis Velegrakis (University of Trento).
  • Interactive Source Registration in Community-oriented Information Integration

    Yannis Katsis (UC San Diego), Alin Deutsch (UC San Diego), Yannis Papakonstantinou (UC San Diego).
  • Data Exchange with Data-Metadata Translations

    Mauricio Hernandez (IBM Almaden Research Center), Paolo Papotti (Universita Roma Tre), Wang-Chiew Tan (UC Santa Cruz).

Tutorial Session 2 Dataspaces

Session Chair: Xiaofang Zhou
  • Dataspaces

    Michael Franklin (University of California, Berkeley, USA), Alon Halevy (Google), David Maier (Portland State University, USA).

Demo Group 2 P2P

  • P3N: Profiling the Potential of a Peer-based Data Management System

    Mihai Lupu, Y. C. Tay.
  • P2P Logging and Timestamping for Reconciliation

    Mounir Tlili, Kokou Dedzoe, Esther Pacitti, Patrick Valduriez, Reza Akbarinia.
  • AlvisP2P: Scalable Peer-to-Peer Text Retrieval in a Structured P2P Network

    Toan Luu, Gleb Skobeltsyn, Fabius Klemm, Maroje Puh, Ivana Podnar Zarko, Martin Rajman, Karl Aberer.
  • WebContent: Efficient P2P Warehousing of Web Data

    Serge Abiteboul, Tristan Allard, Philippe Chatalic, Georges Gardarin, Anca Ghitescu, Francois Goasdoue, Ioana Manolescu, Benjamin Nguyen, Mohamed Ouazara, Aditya Somani, Nicolas Travers,, Gabriel Vasile,, Spyros Zoupanos.
  • DObjects: Enabling Distributed Data Services for Metacomputing Platforms

    Pawel Jurczyk, Li Xiong.
  • EasyTicket: A Ticket Routing Recommendation Engine for Enterprise Problem Resolution

    Qihong Shao, Yi Chen, Shu Tao, Xifeng Yan, Nikos Anerousis.

15:45 - 17:00

Industry Session 7 Web

Session Chair: Meichun Hsu
  • SLEUTH: Single-pubLisher attack dEtection Using correlaTion Hunting

    Ahmed Metwally (UCSB), Fatih Emekci (UCSB), Divyakant Agrawal (UCSB), Amr El Abbadi (UCSB)
  • Energy Cost, The Key Challenge of Today's Data Centers: A Power Consumption Analysis of TPC-C Results

    Meikel Poess (Oracle USA), Raghunath Othayoth Nambiar (Hewlett-Packard).
  • Google's Deep-Web Crawl

    Jayant Madhavan (Google), David Ko (Google), Lucja Kot (Cornell University), Vignesh Ganapathy (Google), Alex Rasmussen (University of California - San Diego), Alon Halevy (Google).

Research Session 8 Stream Processing

Session Chair: Zack Ives
  • Out-of-Order Processing: A New Architecture for High-Performance Stream Systems

    Jin Li (Portland State University), Kristin Tufte (Portland State University), Vladislav Shkapenyuk (AT&T labs - Research), Vassilis Papadimos (Portland State University), Theodore Johnson (AT&T labs - Research), David Maier (Portland State University).
  • StreamTX: Extracting Tuples from Streaming XML Data

    Wook-Shin Han (Kyungpook National University), Haifeng Jiang (Google), Howard Ho (IBM Almaden Research Center), Quanzhong Li (IBM).
  • Sliding-Window Top-k Queries on Uncertain Streams

    Cheqing Jin (ECUST), Ke Yi (Hong Kong University of Science and Technology), Lei Chen (Hong Kong University of Science and Technology), Jeffrey Xu Yu (Chin. U. HK), Xuemin Lin (UNSW).

Research Session 9 Query Processing in Uncertain Databases

Session Chair: Reynold Cheng
  • Conditioning Probabilistic Databases

    Christoph Koch (Cornell University), Dan Olteanu (Oxford University).
  • Efficient Search for the Top-k Probable Nearest Neighbors in Uncertain Databases

    George Beskales (University of Waterloo), Mohamed Soliman (University of Waterloo), Ihab Francis Ilyas (University of Waterloo).
  • BayesStore: Managing Large, Uncertain Data Repositories with Probabilistic Graphical Models

    Daisy Zhe Wang (UC Berkeley), Eirinaios Michelakis (UC Berkeley), Minos Garofalakis (Yahoo Research, USA), Joseph Hellerstein (UC Berkeley)

Tutorial Session 3

Session Chair: Yannis Velegrakis
  • Ontologies and Databases: Myths and Challenges

    Enrico Franconi (Free University of Bozen-Bolzano, Italy).

Demo Group 3 Web, Textual data

  • AJAXSearch: Crawling, Indexing and Searching Web 2.0 Applications

    Cristian Duda, Gianni Frey, Donald Kossman, Chong Zhou.
  • ManyAspects: A System for Highlighting Diverse Concepts in Documents

    Kun Liu, Evimaria Terzi, Tyrone Grandison.
  • Large-Scale Collaborative Analysis and Extraction of Web Data

    Felix Weigel, Biswanath Panda, Mirek Riedewald, Johannes Gehrke.
  • An Effective and Versatile Keyword Search Engine on Heterogenous Data

    Guoliang Li, Jianhua Feng, Jianyong Wang, Lizhu Zhou.
  • DBPubs: Multidimensional Exploration of Database Publications

    Akanksha Baid, Andrey Balmin, Heasoo Hwang, Erik Nijkamp, Jun Rao, Berthold Reinwald, Alkis Simitsis, Yannis Sismanis, Frank Van Ham.
  • Semandaq: A Data Quality System Based on Conditional Functional Dependencies

    Wenfei Fan, Floris Geerts, Xibei Jia.

Tuesday, 26th August 2008

9:00 - 10:15

Keynote

  • Databases and the Silification of Health

    Justin Zobel (NICTA, University of Melbourne).

10:45 - 12:30

Session 10 Experiments & Analyses

Session Chair: Volker Markl
  • Finding Frequent Items in Data Streams

    Graham Cormode (AT&T Labs, USA), Marios Hadjieleftheriou (AT&T Labs, USA).
  • Querying and Mining of Time Series Data: Experimental Comparison of Representations and Distance Measures

    Hui Ding (Northwestern University), Goce Trajcevski (Northwestern University), Hui Ding (Northwestern University), Peter Scheuermann (Northwestern University), Xiaoyue Wang (University of California, Riverside), Eamonn Keogh (University of California, Riverside).
  • Column-Store Support for RDF Data Management: Not All Swans are White

    Lefteris Sidirourgos (CWI, Amsterdam, The Netherlands), Romulo Goncalves (CWI, Amsterdam, The Netherlands), Martin Kersten (CWI, Amsterdam, The Netherlands), Niels Nes (CWI, Amsterdam, The Netherlands), Stefan Manegold (CWI, Amsterdam, The Netherlands).
  • Prefix based numbering schemes for XML : Techniques, Applications and Performances

    Virginie Sans (ETIS - CNRS ENSEA Univ Cergy-Pontoise), Dominique Laurent (ETIS - CNRS ENSEA Univ Cergy-Pontoise).
  • A Benchmark for Evaluating Moving Objects Indexes

    Su Chen (National University of Singapore), Dan Lin (Purdue University), Christian Jensen (Aalborg University, Denmark).
  • Dwarfs in the Rearview Mirror: How Big are they Really?

    Jens Dittrich (ETH Zurich), Lukas Blunschi (ETH Zurich), Marcos Antonio Vaz Salles (ETH Zurich).

Research Session 11 Theory

Session Chair: Alin Deutsch
  • Type Inference and Type Checking for Queries on Execution Traces

    Daniel Deutch (Tel Aviv University), Tova Milo (Tel Aviv University).
  • Taming Verification Hardness: An Efficient Algorithm For Testing Subgraph Isomorphism

    Haichuan Shang (UNSW), Ying Zhang (UNSW), Xuemin Lin (UNSW), Jeffrey Xu Yu (Chin. U. HK).
  • On Generating Near-Optimal Tableaux for Conditional Functional Dependencies

    Lukasz Golab (AT&T Labs - Research), Howard Karloff (AT&T Labs - Research), Flip Korn (AT&T Labs - Research), Divesh Srivastava (AT&T Labs - Research), Bei Yu (Singapore-MIT Alliance (SMA), Singapore).
  • Propagating Functional Dependencies with Conditions

    Wenfei Fan (University of Edinburgh, UK), Shuai Ma (University of Edinburgh, UK), Yanli Hu (University of Edinburgh, UK), Jie Liu (Chinese Academy of Sciences, China), Yinghui Wu (University of Edinburgh, UK).

Research Session 12 Web Rank & PubSub

Session Chair: Alexandros Labrinidis
  • Simrank++: Query Rewriting through Link Analysis of the Click Graph

    Ioannis Antonellis (Stanford University), Hector Garcia-Molina (Stanford University), Chi-Chao Chang (Yahoo!).
  • Accuracy Estimate and Optimization Techniques for SimRank Computation

    Dmitry Lizorkin (ISP RAS), Pavel Velikhov (ISP RAS), Maxim Grinev (ISP RAS), Denis Turdakov (ISP RAS).
  • End-to-End Support for Joins in Large-Scale Publish/Subscribe Systems

    Badrish Chandramouli (Duke University), Jun Yang (Duke University).
  • Scalable Ranked Publish/Subscribe

    Ashwin Machanavajjhala (Cornell University), Erik Vee (Yahoo! Research), Minos Garofalakis (Yahoo! Research), Jayavel Shanmugasundaram (Yahoo! Research).

Tutorial Session 4 Probabilistic Data Management

Session Chair: Paolo Papotti
  • Systems Aspects of Probabilistic Data Management

    Magdalena Balazinska (University of Washington, USA), Christopher Re (University of Washington, USA), Dan Suciu (University of Washington, USA).

Demo Group 4 Data integration, collaboration

  • RIDE: A Tool for Interactive Source Registration in GLAV-based Information Integration

    Yannis Katsis, Alin Deutsch, Yannis Papakonstantinou, Keliang Zhao.
  • Comparing and Evaluating Mapping Systems with STMark

    Bogdan Alexe, Wang-Chiew Tan, Yannis Velegrakis.
  • Ad-Hoc Data Processing in the Cloud

    Dionysios Logothetis, Kenneth Yocum.
  • XTreeNet: Democratic Community Search

    Emiran Curtmola, Alin Deutsch, Kadangode Ramakrishnan, Divesh Srivastava, Kenneth Yocum, Dionysios Logothetis.
  • Making SENSE: Socially Enhanced Search and Exploration

    Tom Crecelius, Mouna Kacimi, Sebastian Michel, Thomas Neumann, Josiane Xavier Parreira, Ralf Schenkel, Gerhard Weikum.
  • AuditGuard: A system for database auditing under retention restrictions

    Wentian Lu, Gerome Miklau.

14:00 - 15:15

Industry Session 13 Massive Data

Session Chair: Neoklis Polyzotis
  • Industry-Scale Duplicate Detection

    Melanie Weis (Hasso-Plattner-Institut), Felix Naumann (Hasso-Plattner-Institute), Ulrich Jehle (Schufa), Holger Schuster (Schufa), Jens Lufter (Schufa).
  • SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets

    Ronnie Chaiken (Microsoft), Bob Jenkins (Microsoft), Paul Larson (Microsoft Research, USA), Bill Ramsey (Microsoft), Darren Shakib (Microsoft), Simon Weaver (Microsoft), Jingren Zhou (Microsoft Research, USA).
  • PNUTS: Yahoo!'s Hosted Data Serving Platform

    Brian Cooper (Yahoo! Research), Raghu Ramakrishnan (Yahoo! Research), Utkarsh Srivastava (Yahoo! Research), Adam Silberstein (Yahoo! Research), Phil Bohannon (Yahoo!), Hans-Arno Jacobsen (Yahoo! Research and University of Toronto), Nick Puz (Yahoo! Research), Daniel Weaver (Yahoo! Research), Ramana Yerneni (Yahoo! Research).

Research Session 14 XML Databases

Session Chair: Yi Chen
  • Dependable Cardinality Forecasts for XQuery

    Jens Teubner (IBM T.J. Watson Research Center), Torsten Grust (Technische Universitat Munchen), Sebastian Maneth (NICTA), Sherif Sakr (NICTA).
  • Hash-based Subgraph Query Processing Method for Graph-structured XML Documents

    Hongzhi Wang (Harbin Institute of Technology), Jianzhong Li (Harbin Institute of Technology), Jizhou Luo (Harbin Institute of Technology), Hong Gao (Harbin Institute of Technology).
  • Generating XML Structure Using Examples and Constraints

    Sara Cohen (Hebrew University of Jerusalem).

Research Session 15 DB Performance & Evaluation

Session Chair: Nick Koudas
  • Read-Optimized Databases, In Depth

    Allison Holloway (University of Wisconsin), David DeWitt (University of Wisconsin).
  • Flashing Up The Storage Layer

    Ioannis Koltsidas (University of Edinburgh), Stratis Viglas (University of Edinburgh).
  • Rose: Compressed, Log-Structured Replication

    Russell Sears (UC Berkeley), Mark Callaghan (Google), Eric Brewer (UC Berkeley).

Tutorial Session 5 Dataspaces

Session Chair: Xiaofang Zhou
  • Dataspaces

    Michael Franklin (University of California, Berkeley, USA), Alon Halevy (Google), David Maier (Portland State University, USA).

Demo Group 5 Tuning, systems, optimization, etc

  • QueryScope: Visualizing Queries for Repeatable Database Tuning

    Ling Hu, Yuan-chi Chang, Christian Lang, Kenneth Ross, Donghui Zhang.
  • When is it Time to Rethink the Aggregate Configuration of Your OLAP Server?

    Katja Hose, Daniel Klan, Matthias Marx, Kai-Uwe Sattler.
  • H-Store: A High-Performance, Distributed Main Memory Transaction Processing System

    Robert Kallman, Jonathan Natkins, Hideaki Kimura, Andrew Pavlo, Alexander Rasin, Stan Zdonik, Evan Jone, Samuel Madden, Michael Stonebraker, Daniel Abadi.
  • Organizing and Indexing Non-Convex Regions

    Eric Perlman, Randal Burns, Michael Kazhdan.
  • Capri/MR: Exploring Protein Databases from a Structural and Physicochemical Point of View

    Eric Paquet, Herna Viktor.
  • C-DEM: A Multi-Modal Query System for Drosophila Embryo Databases

    Fan Guo, Lei Li, Eric Xing, Christos Faloutsos.

15:45 - 17:00

Industry Session 16 Storage & Sorting

Session Chair: Brian Cooper
  • Relational Support for Flexible Schema Scenarios

    Srini Acharya (Microsoft Corp.), Peter Carlin (Microsoft Corp.), Cesar Galindo-Legaria (Microsoft Corp.), Krzysztof Kozielczyk (Microsoft Corp.), Pawel Terlecki (Microsoft Corp.), Peter Zabback (Microsoft Corp.).
  • Oracle Securefiles System

    Niloy Mukherjee (Oracle), Bharath Aleti (Oracle), Amit Ganesh (Oracle), Krishna Kunchithapadam (Oracle), Scott Lynn (Oracle), Sujatha Muthulingam (Oracle), Kam Shergill (Oracle), Shaoyu Wang (Oracle), Wei Zhang (Oracle).
  • Efficient Implementation of Sorting on Multi-Core SIMD CPU Architecture

    Jatin Chhugani (Intel Corporation), Skip Macy (Intel Corporation), Akram Baransi (Intel Corporation), Anthony Nguyen (Intel Corporation), Mostafa Hagog (Intel Corporation), Sanjeev Kumar (Intel Corporation), Victor Lee (Intel Corporation), Yen-Kuang Chen (Intel Corporation), Pradeep Dubey (Intel Corporation).

Research Session 17 Web Queries

Session Chair: Nicolas Bruno
  • WebTables: Exploring the Power of Tables on the Web

    Michael Cafarella (University of Washington), Alon Halevy (Google, Inc.), Daisy Zhe Wang (UC Berkeley), Eugene Wu (MIT), Yang Zhang (MIT).
  • Scalable Query Result Caching for Web Applications

    Charles Garrod (Carnegie Mellon University), Amit Manjhi (Google), Bruce Maggs (Carnegie Mellon University), Todd Mowry (Carnegie Mellon University), Anthony Tomasic (Carnegie Mellon University), Christopher Olston (Yahoo! Research), Anastasia Ailamaki (Carnegie Mellon University).
  • Optimization of Multi-Domain Queries on the Web

    Daniele Braga (Politecnico di Milano), Stefano Ceri (Politecnico di Milano), Florian Daniel (Politecnico di Milano), Davide Martinenghi (Politecnico di Milano).

Research Session 18 Distributed Systems Processing

Session Chair: Paul Larson
  • Fault-tolerant Stream Processing using a Distributed, Replicated File System

    YongChul Kwon (University of Washington), Magdalena Balazinska (University of Washington), Albert Greenberg (Microsoft Research).
  • LEEWAVE: Level-Wise Distribution of Wavelet Coefficients for Processing kNN Queries over Distributed Streams

    Mi-Yen Yeh (National Taiwan University), Kun-Lung Wu (IBM T. J. Watson Research Center), Philip Yu (University of Illinois at Chicago), Ming-Syan Chen (National Taiwan University).
  • A Practical Scalable Distributed B-Tree

    Marcos Aguilera (HP Labs), Wojciech Golab (University of Toronto), Mehul Shah (HP Labs).

Tutorial Session 6 Data Cleaning

Session Chair: Anastasios Kementsietsidis
  • A Revival of Integrity Constraints for Data Cleaning

    Wenfei Fan (University of Edinburgh, UK and Bell Labs, USA), Floris Geerts (University of Edinburgh, UK).

Demo Group 1 XML

  • Xnippet: Generating Query Biased Result Snippet for XML Search

    Yu Huang, Ziyang Liu, Ziyang Liu.
  • Language-Integrated Querying of XML Data in SQL Server

    James Terwilliger, Sergey Melnik, Philip Bernstein.
  • XTCcmp: XQuery Compilation on XTC

    Christian Mathis, Andreas Weiner, Theo Harder, Caesar Ralf Franz Hoppen.
  • Periscope/GQ: A Graph Querying Toolkit

    Yuanyuan Tian, Jignesh Patel, Viji Nair, Sebastian Martini, Matthias Kretzler.
  • SEDA: A System for Search, Exploration, Discovery, and Analysis of XML Data

    Andrey Balmin, Latha Colby, Emiran Curtmola, Quanzhong Li, Fatma Ozcan, Sharath Srinivas, Zografoula Vagena.
  • Process Spaceship: Process Views Discovery and Exploration

    Hamid Reza Motahari Nezhad, Boualem Benatallah, Fabio Casati, Periklis Andritsos, Regis Saint-Paul.

Wednesday, 27th August 2008

9:00 - 10:15

10 Year Best Paper Award Session

  • A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces

    Roger Weber, Hans-Jorg Schek, Stephen Blott.

10:45 - 12:30

Research Session 19 System Centric Optimization

Session Chair: Timos Sellis
  • Main-Memory Scan Sharing For Multi-Core CPUs

    Lin Qiao (IBM Almaden Research Lab), Vijayshankar Raman (IBM Almaden Research Lab), Frederick Reiss (IBM Almaden Research Lab), Peter Haas (IBM Almaden Research Lab), Guy Lohman (IBM Almaden Research Lab).
  • Row-wise Parallel Predicate Evaluation

    Ryan Johnson (Carnegie Mellon University), Vijayshankar Raman (IBM Almaden Research Lab), Richard Sidle (IBM Almaden Research Lab), Garret Swart (Oracle).
  • Dynamic Partitioning of the Cache Hierarchy in Shared Data Centers

    Gokul Soundararajan (University of Toronto), Jin Chen (University of Toronto), Mohamed Sharaf (University of Toronto), Cristiana Amza (University of Toronto).
  • RDF-3X: a RISC-style Engine for RDF

    Thomas Neumann (Max-Planck-Institut Informatik), Gerhard Weikum (MPI).

Research Session 20 IR & Forms

Session Chair: Justin Zobel
  • Multidimensional Content eXploration

    Alkis Simitsis (IBM Research Almaden, USA), Akanksha Baid (University of Wisconsin-Madison), Yannis Sismanis (IBM Research Almaden, USA), Berthold Reinwald (IBM Research Almaden, USA).
  • Relaxation in Text Search using Taxonomies

    Marcus Fontoura (Yahoo! Research), Vanja Josifovski (Yahoo! Research), Ravi Kumar (Yahoo! Research), Christopher Olston (Yahoo! Research), Sergei Vassilvitskii (Yahoo! Research), Andrew Tomkins (Yahoo! Research).
  • Learning to Extract Form Labels

    Hoa Nguyen (University of Utah), Thanh Nguyen (University of Utah), Juliana Freire (University of Utah).
  • Automated Creation of a Forms

    Magesh Jayapandian (University of Michigan), H V Jagadish (University of Michigan).

Research Session 21 New Topics

Session Chair: Xiaofang Zhou
  • Efficient Network-Aware Search in Collaborative Tagging Sites

    Michael Benedikt (Oxford University), Sihem Amer Yahia (Yahoo Research, USA), Laks Lakshmanan (University of British Columbia), Julia Stoyanovich (Columbia University).
  • Cleaning Uncertain Data with Quality Guarantees

    Reynold Cheng (Hong Kong Polytechnic University, China), Jinchuan Chen (Hong Kong Polytechnic University, China), Xike Xie (Hong Kong Polytechnic University, China).
  • On the Provenance of Non-Answers to Queries over Extracted Data

    Jiansheng Huang (Univ. of Wisconsin-Madison), Ting Chen (Univ. of Wisconsin-Madison), AnHai Doan (Univ. of Wisconsin-Madison), Jeffrey Naughton (Univ. of Wisconsin-Madison).
  • Dynamic Active Probing of Helpdesk Databases

    Shenghuo Zhu (NEC Lab), Tao Li (Florida International University), Zhiyuan Chen (UMBC), Dingding Wang (Florida International University), Yihong Gong (NEC Lab).

Tutorial Session 7 XML Structural Summaries

Session Chair: Marios Hatzieleftheriou
  • XML Structural Summaries

    Mirella M. Moro (Univ. Fed. Rio Grande do Sul, Brazil), Zografoula Vagena (Microsoft Research, UK), Vassilis J. Tsotras (University of California Riverside, USA).

Demo Group 2 P2P

  • P3N: Profiling the Potential of a Peer-based Data Management System

    Mihai Lupu, Y. C. Tay.
  • P2P Logging and Timestamping for Reconciliation

    Mounir Tlili, Kokou Dedzoe, Esther Pacitti, Patrick Valduriez, Reza Akbarinia.
  • AlvisP2P: Scalable Peer-to-Peer Text Retrieval in a Structured P2P Network

    Toan Luu, Gleb Skobeltsyn, Fabius Klemm, Maroje Puh, Ivana Podnar Zarko, Martin Rajman, Karl Aberer.
  • WebContent: Efficient P2P Warehousing of Web Data

    Serge Abiteboul, Tristan Allard, Philippe Chatalic, Georges Gardarin, Anca Ghitescu, Francois Goasdoue, Ioana Manolescu, Benjamin Nguyen, Mohamed Ouazara, Aditya Somani, Nicolas Travers,, Gabriel Vasile,, Spyros Zoupanos.
  • DObjects: Enabling Distributed Data Services for Metacomputing Platforms

    Pawel Jurczyk, Li Xiong.
  • EasyTicket: A Ticket Routing Recommendation Engine for Enterprise Problem Resolution

    Qihong Shao, Yi Chen, Shu Tao, Xifeng Yan, Nikos Anerousis.

14:00 - 15:15

Industry Session 22 Query Optimization

Session Chair: N.N.
  • Efficiently Approximating Query Optimizer Plan Diagrams

    Atreyee Dey (Indian Institute of Science), Sourjya Bhaumik (Indian Institute of Science), Harish D (Indian Institute of Science), Jayant Haritsa (Indian Institute of Science).
  • Mining Search Engine Query Logs via Suggestion Sampling (New Date and Time!)

    Maxim Gurevich (Technion), Ziv Bar-Yossef (Google and Technion).
  • Optimizer Plan Change Management: Improved Stability and Performance in Oracle 11g

    Mohamed Ziauddin (Oracle), Dinesh Das (Oracle), Hong Su (Oracle), Yali Zhu (Oracle), Khaled Yagoub (Oracle).

Research Session23 Schema B

Session Chair: Ralf Schenkel
  • Graceful Database Schema Evolution: the PRISM Workbench

    Carlo Curino (Politecnico di Milano), Hyun Moon (UCLA), Carlo Zaniolo (UCLA).
  • Analyzing and Revising Data Integration Schemas to Improve Their Matchability

    Xiaoyong Chai (University of Wisconsin-Madiso), Mayssam Sayyadian (University of Wisconsin-Madiso), AnHai Doan (University of Wisconsin-Madiso), Arnon Rosenthal (The MITRE Corporation), Len Seligman (The MITRE Corporation).
  • Learning to Create Data-Integrating Queries

    Partha Talukdar (University of Pennsylvania ), Marie Jacob (University of Pennsylvania), Mohammad Mehmood (University of Pennsylvania ), Koby Crammer (University of Pennsylvania), Zachary Ives (University of Pennsylvania ), Fernando Pereira (University of Pennsylvania), Sudipto Guha (University of Pennsylvania).

Research Session 24 Uncertain DB B (Rel & AC)

Session Chair: Lei Chen
  • Approximate Lineage for Probabilistic Databases

    Chris Re (University of Washington), Dan Suciu (University of Washington and Microsoft).
  • Exploiting Shared Correlations in Probabilistic Databases

    Prithviraj Sen (University of Maryland), Amol Deshpande (University of Maryland), Lise Getoor (University of Maryland).
  • Access Control over Uncertain Data

    Vibhor Rastogi (University of Washington), Dan Suciu (University of Washington and Microsoft), Evan Welbourne (University of Washington).

Tutorial Session 8

Session Chair: N.N.
  • Ontologies and Databases: Myths and Challenges

    Enrico Franconi (Free University of Bozen-Bolzano, Italy).

Demo Group 4 Data integration, collaboration

  • RIDE: A Tool for Interactive Source Registration in GLAV-based Information Integration

    Yannis Katsis, Alin Deutsch, Yannis Papakonstantinou, Keliang Zhao.
  • Comparing and Evaluating Mapping Systems with STMark

    Bogdan Alexe, Wang-Chiew Tan, Yannis Velegrakis.
  • Ad-Hoc Data Processing in the Cloud

    Dionysios Logothetis, Kenneth Yocum.
  • XTreeNet: Democratic Community Search

    Emiran Curtmola, Alin Deutsch, Kadangode Ramakrishnan, Divesh Srivastava, Kenneth Yocum, Dionysios Logothetis.
  • Making SENSE: Socially Enhanced Search and Exploration

    Tom Crecelius, Mouna Kacimi, Sebastian Michel, Thomas Neumann, Josiane Xavier Parreira, Ralf Schenkel, Gerhard Weikum.
  • AuditGuard: A system for database auditing under retention restrictions

    Wentian Lu, Gerome Miklau.

15:45 - 17:00

Industry Session 25 Query Processing

Session Chair: Jayant Haritsa
  • Towards a Physical XML independent XQuery/SQL/XML Engine

    Zhen Hua Liu (Oracle), Thomas Baby (Oracle), Sivasankaran Chandrasekar (Oracle), Hui Chang (Oracle).
  • Closing The Query Processing Loop in Oracle 11g

    Mohamed Zait (Oracle), Allison Lee (Oracle).
  • Towards a Streaming SQL Standard

    Stan Zdonik (Streambase,Inc.), Namit Jain (Oracle), Shailendra Mishra (Oracle), Anand Srinivasan (Oracle), Johannes Gehrke (Cornell University, USA), Jennifer Widom (Stanford University), Hari Balakrishnan (Streambase,Inc.), Mitch Cherniack (Streambase,Inc.), Ugur Cetintemel (Streambase,Inc.), Richard Tibbetts (Streambase,Inc.).

Research Session 26 Privacy Preservation

Session Chair: Elisa Bertino
  • Anonymizing Bipartite Graph Data using Safe Groupings

    Graham Cormode (AT&T Labs, USA), Divesh Srivastava (AT&T Labs, USA), Ting Yu (North Carolina State University), Qing Zhang (North Carolina State University).
  • Privacy Preserving Serial Data Publishing By Role Composition

    Yingyi Bu (The Chinese University of HK), Ada WaiChee Fu (The Chinese University of Hong Kong), Raymond Chi-Wing Wong (The Hong Kong University of Science and Technology), Lei Chen (The Hong Kong University of Science and Technology), Jiuyong Li (University of South Australia).
  • Output Perturbation with Query Relaxation

    Xiaokui Xiao (The Chinese University of Hong Kong), Yufei Tao (The Chinese University of Hong Kong).

Research Session 27 Temporal Indexing & Searching

Session Chair: Cyrus Shahabi
  • Transaction Time Indexing with Version Compression

    David Lomet (Microsoft Research, USA), Mingsheng Hong (Cornell University), Rimma Nehme (Purdue University), Rui Zhang (University of Melbourne).
  • Managing and Querying Transaction-time Databases under Schema Evolution

    Hyun Moon (UCLA), Carlo Curino (Politecnico di Milano), Alin Deutsch (UCSD), Chien-Yi Hou (UCSD), Carlo Zaniolo (UCLA).
  • On Efficiently Searching Trajectories and Archival Data for Historical Similarities

    Reza Sherkat (IBM Toronto Lab.), Davood Rafiei (University of Alberta ).

Tutorial Session 9 Probabilistic Data Management

Session Chair: Paolo Papotti
  • Systems Aspects of Probabilistic Data Management

    Magdalena Balazinska (University of Washington, USA), Christopher Re (University of Washington, USA), Dan Suciu (University of Washington, USA).

Thursday, 28th August 2008

9:00 - 10:15

Research Session 28 Text & Keyword Query Processing

Session Chair: Jayant Madhavan
  • Keyword Query Cleaning

    Ken Pu (UOIT), Xiaohui Yu (York University).
  • Reasoning and Identifying Relevant Matches for XML Keyword Search

    Ziyang Liu (Arizona State University,USA), Yi Chen (Arizona State University,USA).
  • Ed-Join: An Efficient Algorithm for Similarity Joins With Edit Distance Constraints

    Chuan Xiao (University of New South Wales), Wei Wang (University of New South Wales), Xuemin Lin (University of New South Wales).
  • Scalable Ad-hoc Entity Extraction from Text Collections

    Sanjay Agrawal (Microsoft Research), Kaushik Chakrabarti (Microsoft Research), Surajit Chaudhuri (Microsoft Research), Venkatesh Ganti (Microsoft Research).

Research Session 29 Systems B

Session Chair: Jingren Zhou
  • Scheduling Shared Scans of Large Data Files

    Parag Agrawal (Stanford University), Daniel Kifer (Yahoo! Research), Christopher Olston (Yahoo! Research).
  • Online Maintenance of Very Large Random Samples on Flash Storage

    Suman Nath (Microsoft Research), Phillip Gibbons (Intel Research).
  • A Skip-list Approach for Efficiently Processing Forecasting Queries

    Tingjian Ge (Brown University), Stan Zdonik (Brown University).
  • A Request-Routing Framework for SOA-Based Enterprise Computing

    Thomas Phan (IBM Almaden), Wen-Syan Li (SAP Research Center - China).

Research Session 30 Indexing & Query Processing

Session Chair: Chen Li
  • Hexastore: Sextuple Indexing for Semantic Web Data Management

    Cathrin Weiss (University of Zurich), Panagiotis Karras (National University of Singapore), Abraham Bernstein (University of Zurich).
  • Indexing Land Surface for Efficient kNN Query

    Cyrus Shahabi (Univ. of Southern California), Lu-An Tang (Univ. of Southern California), Songhua Xing (Univ. of Southern California).
  • Efficient Skyline Querying with Variable User Preferences on Nominal Attributes

    Raymond Chi-Wing Wong (The Hong Kong University of Science and Technology), Ada WaiChee Fu ( The Chinese University of HK), Jian Pei (Simon Fraser University), Yip Sing Ho (The Chinese University of HK), Tai Wong (The Chinese University of HK), Yubao Liu (Sun Yat-Sen University, China).
  • Efficient Top-K Processing over Query-Dependent Functions

    Lin Guo (Yahoo! Research), Sihem Amer Yahia (Yahoo! Research), Raghu Ramakrishnan (Yahoo! Research), Jayavel Shanmugasundaram (Yahoo! Research), Utkarsh Srivastava (Yahoo! Research), Erik Vee (Yahoo! Research).

Tutorial Session 10 Continuous Queries

  • Scheduling Continuous Queries in Data Stream Management Systems

    Mohamed A. Sharaf (University of Toronto, Canada), Alexandros Labrinidis (University of Pittsburgh, USA), Panos K. Chrysanthis (University of Pittsburgh, USA).

Demo Group 3 Web, Textual data

  • AJAXSearch: Crawling, Indexing and Searching Web 2.0 Applications

    Cristian Duda, Gianni Frey, Donald Kossman, Chong Zhou.
  • ManyAspects: A System for Highlighting Diverse Concepts in Documents

    Kun Liu, Evimaria Terzi, Tyrone Grandison.
  • Large-Scale Collaborative Analysis and Extraction of Web Data

    Felix Weigel, Biswanath Panda, Mirek Riedewald, Johannes Gehrke.
  • An Effective and Versatile Keyword Search Engine on Heterogenous Data

    Guoliang Li, Jianhua Feng, Jianyong Wang, Lizhu Zhou.
  • DBPubs: Multidimensional Exploration of Database Publications

    Akanksha Baid, Andrey Balmin, Heasoo Hwang, Erik Nijkamp, Jun Rao, Berthold Reinwald, Alkis Simitsis, Yannis Sismanis, Frank Van Ham.
  • Semandaq: A Data Quality System Based on Conditional Functional Dependencies

    Wenfei Fan, Floris Geerts, Xibei Jia.

11:00 - 13:00

Research Session 31 Spatial and Motion Data

Session Chair: Xuemin Lin
  • FINCH: Evaluating Reverse k-Nearest-Neighbor Queries on Location Data

    Wei Wu (National University of Singapore), Fei Yang (National University of Singapore), Chee-Yong Chan (National University of Singapore), Kian-Lee Tan (National University of Singapore).
  • Discovery of Convoys in Trajectory Databases

    Hoyoung Jeung (The university of queenslad), Man Lung Yiu (Aalborg University), Xiaofang Zhou (The university of queenslad), Christian Jensen (Aalborg University), Heng Tao Shen (The university of queenslad).
  • TraClass: Trajectory Classification Using Hierarchical Region-Based and Trajectory-Based Clustering

    Jae-Gil Lee (UIUC), Jiawei Han (UIUC), Xiaolei Li (UIUC), Hector Gonzalez (UIUC).
  • The V*-Diagram: a Query-Dependent Method for Moving kNN Queries

    Sarana Nutanong (The University of Melbourne), Rui Zhang (The University of Melbourne), Egemen Tanin (The University of Melbourne), Lars Kulik (The University of Melbourne).

Research Session 32 Query Processing

Session Chair: Divesh Srivastava
  • Rewriting Procedures for Batched Bindings

    Ravindra Guravannavar (IIT Bombay), S. Sudarshan (IIT Bombay).
  • Identifying Robust Plans through Plan Diagram Reduction

    Harish D (Indian Institute of Science), Pooja Darera (Indian Institute of Science), Jayant Haritsa (Indian Institute of Science)
  • A Pay-As-You-Go Framework for Query Execution Feedback

    Surajit Chaudhuri (Microsoft Research), Vivek Narasayya (Microsoft Research), Ravishankar Ramamurthy (Microsoft Research).
  • Evita Raced: Metacompilation for Declarative Networks

    Tyson Condie (UC Berkeley), Joseph Hellerstein (UC Berkeley), Petros Maniatis (UC Berkeley), David Chu (UC Berkeley).

Research Session 33 Mining B & External Memory

Session Chair: Shinichi Morishita
  • Discovering Data Quality Rules

    Fei Chiang (University of Toronto), Renee Miller (University of Toronto).
  • Mining Non-Redundant High Order Correlations in Binary Data

    Xiang Zhang (Univeristy of North Carolina), Feng Pan (Univeristy of North Carolina), Wei Wang (Univeristy of North Carolina), Andrew Nobel (Univeristy of North Carolina).
  • Keyword Search on External Memory Data Graphs

    Bhavana Dalvi (IIT Bombay, India), Meghana Kshirsagar (IIT Bombay, India), S. Sudarshan (IIT Bombay, India).
  • Sorting Hierarchical Data in External Memory for Archiving

    Ioannis Koltsidas (University of Edinburgh), Heiko Mueller (University of Edinburgh), Stratis Viglas (University of Edinburgh).

Tutorial Session 11 Clusters in High Dimensions

Session Chair: Stratis Viglas
  • Detecting Clusters in Moderate-to-High Dimensional Data: Subspace Clustering, Pattern-based Clustering, and Correlation Clustering

    Hans-Peter Kriegel (Ludwig-Maximilians-Universitat Munchen, Germany), Peer Kroger (Ludwig-Maximilians-Universitat Munchen, Germany), Arthur Zimek (Ludwig-Maximilians-Universitat Munchen, Germany).

Demo Group 5 Tuning, systems, optimization, etc

  • QueryScope: Visualizing Queries for Repeatable Database Tuning

    Ling Hu, Yuan-chi Chang, Christian Lang, Kenneth Ross, Donghui Zhang.
  • When is it Time to Rethink the Aggregate Configuration of Your OLAP Server?

    Katja Hose, Daniel Klan, Matthias Marx, Kai-Uwe Sattler.
  • H-Store: A High-Performance, Distributed Main Memory Transaction Processing System

    Robert Kallman, Jonathan Natkins, Hideaki Kimura, Andrew Pavlo, Alexander Rasin, Stan Zdonik, Evan Jone, Samuel Madden, Michael Stonebraker, Daniel Abadi.
  • Organizing and Indexing Non-Convex Regions

    Eric Perlman, Randal Burns, Michael Kazhdan.
  • Capri/MR: Exploring Protein Databases from a Structural and Physicochemical Point of View

    Eric Paquet, Herna Viktor.
  • C-DEM: A Multi-Modal Query System for Drosophila Embryo Databases

    Fan Guo, Lei Li, Eric Xing, Christos Faloutsos.