The stability of symbol sets produced by variety generation from bibliographic data
Program: electronic library and information systems
ISSN: 0033-0337
Article publication date: 1 February 1978
Abstract
Variety Generation involves the selection of sets of character strings, or symbols, which are intended to occur with equal probabilities in bodies of text or sets of text units from a particular source. It is important that the sample used to generate the symbol set should be representative of the data with which the set will be used. An assessment is given here of the amount of variation in symbol sets generated from files of titles and author names from BNB MARC data over a five year period, and a comparison is made with LC MARC. Some of the BNB symbol sets are compared directly, and equifrequency statistics are obtained for the assignment of each symbol set to each file. The differences between the equifrequency statistics are examined by means of an analysis of variance technique.
Citation
Verity Brack, E., Cooper, D. and Lynch, M.F. (1978), "The stability of symbol sets produced by variety generation from bibliographic data", Program: electronic library and information systems, Vol. 12 No. 2, pp. 64-77. https://doi.org/10.1108/eb046772
Publisher
:MCB UP Ltd
Copyright © 1978, MCB UP Limited