CREATING ON MINISIS DATA BASE A RETRIEVAL SYSTEM FOR BASIC RESEARCH INFORMATION
Liu Qimao
University Library
Huazhong University of Science and Technology
Wuhan, 430074 China
Abstract: This paper describes briefly the features of MINISIS data base management systems and the HP3000/925LX hardware applied to the automation of our library, and analyzes concretely the data structure and performance of the basic research information retrieval system built on MINISIS data base. Preliminary considerations on improving the system and its application and dissemination are stated.
The HP3000 series computer is fitted with an MPE operating system,which is a time sharing system based on magnetic disk for multi-programming execution, and is responsible for operating and controlling the computer system and user programs. It suits concurrent transactions, data com-munication, program development and batched job processing. It supports simultaneous multiuser access to system resources with individual interface to each user, and supports many languages including COBOL, FORTRAN, BASIC, PASCAL and SPL.
The relational database management system MINISIS, developed by Canadian International Development and Research Center (IDRC) specifically for document and information processing, operates on the HP3000 series computer under MPE environment. The four types of data base within MINISIS are: main base RD, projection base PS of the main base for different users, combi-nation base DS for linking base with base, and the CD base for data transmission.
Ingeniously handling these data bases, various functional modules of MINISIS allows the library integrated system to be implemented with a minimum amount of programing. The library automation system in our campus adopts HP3000/925LX, which is the medium level product in HP3000 series, designed on the so-called precision architecture (HPPA) concept, with 24MB internal storage exten-sible to 48 MB, operating at 3.2 MIPS, supporting up to 40 users,utilizing 48 bits virtual address-ing, advanced instruction pipeline technique and high performance floating point co-processor, and containing a back-up battery to give automatic restart.
2. SYSTEM ANALYSIS
In order to fully utilize resources of the present facilities and to extend the application of the HP3000 system, we have launched a project of creating the data base and retrieval system of six main funding programs in China. These six are National Natural Science Foundation, Doctorate Conferring Units Fund, State Education Commission's Excellent Young Teachers Subsidy Fund, Sir Huo Yingdong Foundation, National Social Science Foundation, and National Young Men Social Science Foundation. It is built on the basis of the MINISIS data base management system with the HP3000 computer. The MINISIS system is very suitable for processing documental information and data as it supports the processing of variable field and repetitive field and allows a full description of every attribute of documents and factual data. In addition, modules ISOCONV and BATCHIN in MINISIS solve effectively the input and output problems of various types of data. A centralized system is designed for the retrieval of basic research information based on the defini-tion of relational data with a general data base of funds, which consists of six subdata bases of funds. The data fields of the general data base include funding program, title of project, category of project, unit of applicant, keyword,etc. Subdata bases are projections of the general data base con-tents corresponding to the previously mentioned six main funding programs respectively.
Moreover, the system has two subsidiary data bases, viz. the database of project classification catalog and code and the database of professional position codes referring to names. The former contains disciplinary codes and names of corresponding disciplines, while the latter contains profes-sional position codes and names of corresponding positions to facilitate retrieval and print out.
A microcomputer interface is designed in the system to transfer data in the general data base into text file format to store in diskettes through PC to serve PC users.
The system takes advantage of the menu driven program supplied by MINISIS to design user interface,which is convenient, straightforward and easy to operate. The account structure of MPE/XL operating system is utilized to provide different users with different capabilities, and design proper passwords to ensure the data safety. The MINISIS system offers powerful capability in data base maintenance, which can extend file space, delete useless data, recover free space, and guarantee the system's security and reliability.
3. FUNCTIONS OF THE SYSTEM
The major function of the system is basic research information retrieval. Three subsystems include:
• data log in and update subsystem,
• retrieval subsystem and
• print out subsystem.
3.1. Data log in and update subsystem
This subsystem has the following functions:
• Automatic data transcribing: A data transcriber software is designed to make full use of exist-ing data sources. Desired data taken from data base of funds in DBASE are stored in PC diskettes in TXT file format, then sent to HP3000 from PC through the emulate communication technique, and finally loaded after format transformation into the data base of funds in MINISIS with further processing.
• Manual data log in: Grant projects without data source are processed to have needed format and loaded in manually through terminals with menu prompting function offered by MINISIS and the screen input function designed by means of VPLUS software. Thus the data completeness and updating are ensured.
• Data base updating: Data base records are updated by means of VPLUS screen operation to ensure the data accuracy.
3.2. Retrieval subsystem:
The system provides funding applicants with multi-path and multi-level originality check service. At present, single item and combined retrieval can be done from key words, funding programs, disciplinary codes or title phrases.
• Retrieving from key words: Retrieving related items of various funding programs from key words can further limit the retrieving scope on funding programs, disciplinary codes, and the beginning and end dates to reduce the retrieving operation and raise the hit ratio.
• Retrieving from the funding programs: Data base of the corresponding foundation can be scanned respectively as needed.
• Retrieving from disciplinary codes: Related projects in corresponding data base of funds can be retrieved from disciplinary codes.
• Retrieving from phrases in project title: Related projects in corresponding data base of funds can be retrieved from the project title phrases in aid of contextual correlation of title components.
3.3. Print out subsystem:
It has two print out formats,viz. Simplified format and complete format. The simplified format contains the funding program, project name, applicant's title and unit, beginning and end dates, key words, and disciplinary codes. The complete format has a synopsis of the project in addition to that in simplified format. Sorted printout is possible in terms of funding programs, disciplinary codes, applicant's units, or applicant's name respectively.
4. APPLICATIONS
The system has collected relevant information of about 20 thousand and more items of granted projects by the Doctorate Conferring Units Fund, State Education Commission's Excellent Young Teachers Subsidy Fund, Sir Huo Yingdong Foundation and National Natural Science Foundation. The system provides funding applicants with multi-path and multi-level originality check service. By means of this system we have found through originality check that 9 of the 34 projects that our university was applying for Doctorate Conferring Units Fund are similar or correlated with projects already granted. Similarly, the system showed that 22 of the 201 projects in this campus applying for grants from the National Natural Science Foundation have correlated even entirely same projects approved earlier. At the same time, the system has given consultative service to 27 faculty members applying grants from National Natural Science Foundation in 1992, and has printed lists of 454 items about the correlated projects already existed, for applicants as a basis in re-selecting research projects.
We are now making efforts to continue the building of these data bases,
to enhance the intelligent retrieval function, to realize both local and
remote retrieval, and to consider the business of charged data transfer
and software implantation if possible under the permission of concerned
authorities to exert greater social benefit.
REFERENCES
Liu, Qimao & Fang, Hong. (1991). "Preliminary Application of MINISIS Data Base System in Our Library,"Chinese MINISIS Users Communication, 4 (4).
IDRC. MINISIS G Version, Application Programmer's Guide. July 29, 1989.
Liu, Qimao & Cai, Hongmei. (1992). "Library Automation and HP3000 System," Computer Application Study (in Chinese), 9 (2)..
Liu, Qimao; Cai, Hongmei & Fang, Hong. (1992). "Application Study of HP3000 System," Proceedings of the 4th Conference of HP Section Members of Chinese Association of Computer Users (in Chinese), 1992.