ACM SIGMOD Anthology VLDB dblp.uni-trier.de

Bulk Loading into an OODB: A Performance Study.

Janet L. Wiener, Jeffrey F. Naughton: Bulk Loading into an OODB: A Performance Study. VLDB 1994: 120-131
@inproceedings{DBLP:conf/vldb/WienerN94,
  author    = {Janet L. Wiener and
               Jeffrey F. Naughton},
  editor    = {Jorge B. Bocca and
               Matthias Jarke and
               Carlo Zaniolo},
  title     = {Bulk Loading into an OODB: A Performance Study},
  booktitle = {VLDB'94, Proceedings of 20th International Conference on Very
               Large Data Bases, September 12-15, 1994, Santiago de Chile, Chile},
  publisher = {Morgan Kaufmann},
  year      = {1994},
  isbn      = {1-55860-153-8},
  pages     = {120-131},
  ee        = {db/conf/vldb/vldb94-120.html},
  crossref  = {DBLP:conf/vldb/94},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}

Abstract

Object-oriented database (OODB) users bring with them large quantities of legacy data (megabytes and even gigabytes). In addition, scientific OODB users continually generate new data. All this data must be loaded into the OODB. Every relational database system has a load utility, but most OODBs do not. The process of loading data into an OODB is complicated by inter-object references, or relationships, in the data. These relationships are expressed in the OODB as object identifiers, which are not known at the time the load data is generated; they may contain cycles; and there may be implicit system-maintained inverse relationships that must also be stored.

We introduce seven algorithms for loading data into an OODB that examine different techniques for dealing with circular and inverse relationships. We present a performance study based on both an analytic model and an implementation of all seven algorithms on top of the Shore object repository. Our study demonstrates that it is important to choose a load algorithm carefully; in some cases the best algorithm achieved an improvement of one to two orders of magnitude over the naive algorithm.

Copyright © 1994 by the VLDB Endowment. Permission to copy without fee all or part of this material is granted provided that the copies are not made or distributed for direct commercial advantage, the VLDB copyright notice and the title of the publication and its date appear, and notice is given that copying is by the permission of the Very Large Data Base Endowment. To copy otherwise, or to republish, requires a fee and/or special permission from the Endowment.


Online Paper

ACM SIGMOD Anthology

CDROM Version: Load the CDROM "Volume 1 Issue 5, VLDB '89-'97" and ... DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...

Printed Edition

Jorge B. Bocca, Matthias Jarke, Carlo Zaniolo (Eds.): VLDB'94, Proceedings of 20th International Conference on Very Large Data Bases, September 12-15, 1994, Santiago de Chile, Chile. Morgan Kaufmann 1994, ISBN 1-55860-153-8
Contents CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML

References

[Cat93]
R. G. G. Cattell: The Object Database Standard: ODMG-93. Morgan Kaufmann 1993, ISBN 1-55860-302-6
CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[CDF+94]
Michael J. Carey, David J. DeWitt, Michael J. Franklin, Nancy E. Hall, Mark L. McAuliffe, Jeffrey F. Naughton, Daniel T. Schuh, Marvin H. Solomon, C. K. Tan, Odysseas G. Tsatalos, Seth J. White, Michael J. Zwilling: Shoring Up Persistent Applications. SIGMOD Conference 1994: 383-394 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[CMR92]
...
[DLP+93]
...
[Mai]
...
[Moh93a]
C. Mohan: A Survey of DBMS Research Issues in Supporting Very Large Tables. FODO 1993: 279-300 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[Moh93b]
C. Mohan: IBM's Relational DBMS Products: Features and Technologies. SIGMOD Conference 1993: 445-448 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[Nel91]
...
[Obj92]
...
[OHMS92]
Jack A. Orenstein, Sam Haradhvala, Benson Margulies, Don Sakahara: Query Processing in the ObjectStore Database System. SIGMOD Conference 1992: 403-412 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[Ont92]
...
[PG88]
Norman W. Paton, Peter M. D. Gray: Identification of Database Objects by Key. OODBS 1988: 280-285 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[Sho93]
Arie Shoshani: A Layered Approach to Scientific Data Management at Lawrence Berkeley Laboratory. IEEE Data Eng. Bull. 16(1): 4-8(1993) CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[Sno89]
...
[Veg86]
Steven R. Vegdahl: Moving Structures between Smalltalk Images. OOPSLA 1986: 466-471 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[Ver93]
...
[WI93]
Janet L. Wiener, Yannis E. Ioannidis: A Moose and a Fox Can Aid Scientists with Data Management Problems. DBPL 1993: 376-398 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML

Copyright © Fri Mar 12 17:22:52 2010 by Michael Ley (ley@uni-trier.de)