Cheshire II Commands

read_datastore - read (or count) parsed SGML/XML documents in a CheshireII DataStore file


read_datastore [-q] configfile filename [n] ["EXACT"|"NUMBER"]


Read_datastore reads the stored SGML/XML data from a datastore file used in the Cheshire II system. Datastore files store pre-parsed versions of the SGML/XML documents and the original documents need not be retain for access.

There are two forms of this command: when the keyword "NUMBER" is specified" it just delivers a single line with the Maximum record number in the datastore file, otherwise the command returns one or all of the SGML/XML records in the file to stdout.

If the "EXACT" keyword is used, the SGML/XML output will be exactly as stored. If not used the SGML/XML will be output in a partial form that has been broken up with each tag that contains only text on a single line. Note that this default form may not have the full text for some very long tags.

The configfile argument should be the name of the configuration file to be processed. Note that ONLY the first file definition in the configuration file will be created, the configfile can be simple and omit index definitions, etc. if desired. The file named in the FILENAME tag of the configfile will be created if it doesn't exist, or used if it does. The parsed SGML/XML data will all be loaded into that file.

The sgmlfile argument should be the name of the (single) SGML file to be processed.

The directory argument, used with the -r option, should be the pathname of the root directory subtree to be processed.

The n argument should be a single number representing the id number of the record in the datastore file.

If the -q flag is used, information about each record processed is suppressed, otherwise the program indicates each record found and it's length. This will also suppress reports of attempts to load duplicate records (they are not loaded in any case).


Errors are reported to stderr;


None known


Configuration file documentation, index_cheshire


Ray R. Larson ()