ACM SIGMOD Anthology VLDB dblp.uni-trier.de

A New Compression Method with Fast Searching on Large Databases.

Jianzhong Li, Doron Rotem, Harry K. T. Wong: A New Compression Method with Fast Searching on Large Databases. VLDB 1987: 311-318
@inproceedings{DBLP:conf/vldb/LiRW87,
  author    = {Jianzhong Li and
               Doron Rotem and
               Harry K. T. Wong},
  editor    = {Peter M. Stocker and
               William Kent and
               Peter Hammersley},
  title     = {A New Compression Method with Fast Searching on Large Databases},
  booktitle = {VLDB'87, Proceedings of 13th International Conference on Very
               Large Data Bases, September 1-4, 1987, Brighton, England},
  publisher = {Morgan Kaufmann},
  year      = {1987},
  isbn      = {0-934613-46-X},
  pages     = {311-318},
  ee        = {db/conf/vldb/LiRW87.html},
  crossref  = {DBLP:conf/vldb/87},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}

Abstract

In this paper, a new compression method for constant removal from very large scientific and statistical databases is presented. The new method combines the best features from several classical constant removal compression methods. The result, both analytical and experimental, shows that the method is superior to these popular methods in terms of compression effectiveness and eficient searching on the compressed data. In addition to the development, analysis and validation of this new method, this paper also presents analysis of several traditional constant removal methods for the purpose of analytic comparison. A large collection of experiments have been designed and run to observe and validate the behavior of the compression methods. Another contribution of the paper is that performance characteristics are identified for different compression methods under different data properties assumptions. The result can be used as a basis of selecting compression methods by matching the properties of the database at hand to the data properties experimented in the paper.

Copyright © 1987 by the VLDB Endowment. Permission to copy without fee all or part of this material is granted provided that the copies are not made or distributed for direct commercial advantage, the VLDB copyright notice and the title of the publication and its date appear, and notice is given that copying is by the permission of the Very Large Data Base Endowment. To copy otherwise, or to republish, requires a fee and/or special permission from the Endowment.


Online Paper

ACM SIGMOD Anthology

CDROM Version: Load the CDROM "Volume 1 Issue 4, VLDB '75-'88" and ... DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...

Printed Edition

Peter M. Stocker, William Kent, Peter Hammersley (Eds.): VLDB'87, Proceedings of 13th International Conference on Very Large Data Bases, September 1-4, 1987, Brighton, England. Morgan Kaufmann 1987, ISBN 0-934613-46-X
Contents CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML

References

[1]
...
[2]
Susan J. Eggers, Arie Shoshani: Efficient Access of Compressed Data. VLDB 1980: 205-211 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[3]
Susan J. Eggers, Frank Olken, Arie Shoshani: A Compression Technique for Large Statistical Data-Bases. VLDB 1981: 424-434 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[4]
Arie Shoshani, Frank Olken, Harry K. T. Wong: Characteristics of Scientific Databases. VLDB 1984: 147-160 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[5]
Arie Shoshani: Statistical Databases: Characteristics, Problems, and some Solutions. VLDB 1982: 208-222 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[6]
...
[7]
...
[8]
...
[9]
...
[10]
Bruce Hahn: A New Technique for Compression and Storage of Data. Commun. ACM 17(8): 434-436(1974) CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[11]
Donald E. Knuth: The Art of Computer Programming, Volume III: Sorting and Searching. Addison-Wesley 1973, ISBN 0-201-03803-X
CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[12]
Robert Endre Tarjan, Andrew Chi-Chih Yao: Storing a Sparse Table. Commun. ACM 22(11): 606-611(1979) CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[13]
Mostafa A. Bassiouni: Data Compression in Scientific and Statistical Databases. IEEE Trans. Software Eng. 11(10): 1047-1058(1985) CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[14]
...
[15]
Jukka Teuhola: A Compression Method for Clustered Bit-Vectors. Inf. Process. Lett. 7(6): 308-311(1978) CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[16]
...
[17]
Harry K. T. Wong, J. Z. Li: Transposition Algorithms on Very Large Compressed Databases. VLDB 1986: 304-311 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML

Copyright © Tue Mar 16 02:21:59 2010 by Michael Ley (ley@uni-trier.de)