Information Retrieval List Digest 181 (September 27, 1993) URL = http://hegel.lib.ncsu.edu/stacks/serials/irld/irld-181 IRLIST Digest ISSN 1064-6965 September 27, 1993 Volume X, Number 37 Issue 181 ********************************************************** I. NOTICES A. Meeting Announcements/Calls for Papers 1. MIST: Training Course on Information Management B. Publications 1. Wordnet 1.4 II. QUERIES B. Requests for Information 1. FAQ for IR Research IV. PROJECT WORK A. Initiatives and Proposals 1. NSF Digital Libraries Proposal ********************************************************** I. NOTICES I.A.1. Fr: Patrick Vanouplines Re: Training Course on Information Management: MIST2 MANAGEMENT OF INFORMATION IN SCIENCE AND TECHNOLOGY with an emphasis on information related to water and the environment and approved by the Flemish Interuniversity Council (VLIR); sponsored by the Belgian Administration for Development Cooperation (BADC or ABOS) and by the UNESCO International Hydrological Programme (IHP). COSTS: - Free of charge(!) for 12 participants from developing countries, to be selected by the BADC and the organizers. They will also receive a grant to cover the costs of accommodation and an airplane return ticket. - Others pay a fee to participate during the whole period (70,000 Belgian Francs = about 2,000 US$) including all social activities, or to particular items selected from the programme. (1,000 Belgian Francs per module of a half day = about 35 US$) - Participants pay themselves for travel, meals, and housing. FOR FULL INFORMATION, CONTACT: Tel. ++32-2-641 24 29 Fax ++32-2-641 2609 (or 2282) Telex 61051 vubco-b, E-mail (Internet): PVOUPLIN@VUB.AC.BE or PNIEUWEN@VUB.AC.BE Mail: Paul NIEUWENHUYSEN or Patrick VANOUPLINES MIST 2 University Library Free University Brussels Pleinlaan 2, B-1050 Brussels BELGIUM PRELIMINARY PROGRAMME: * Orientation tour of the University Library * Orientation tour of the Laboratory of Hydrology * Introduction to microcomputer systems: hardware * Introduction to microcomputer systems: software * Microcomputer operating systems * The flow of scientific information * Bibliographic descriptions; ISBD * National libraries and national bibliographies * Subject classification schemes and thesaurus systems * Telematics, data communication, computer networks * Electronic mail * Online information retrieval * Bibliographic databases related to water and the environment * Search strategies * Software packages for local storage and retrieval of bibliographic information * Introduction to CDS/ISIS software package for information storage and retrieval * The application of CDS/ISIS: searching * The application of CDS/ISIS: editing data in a database * The application of CDS/ISIS: output of selected data to file or printer * The application of CDS/ISIS: developing a database structure * The application of CDS/ISIS: indexing data for fast retrieval * Downloading of information and record format conversion: principles * Downloading of information and record format conversion: application of Fangorn with CDS/ISIS * Statistics for information science * Queuing theory * Citation analysis * The bibliometric laws * Scientometrics * Library automation * Online Public Access Catalogues (OPACs) * Interlibrary lending and cooperation * Document collection development * Introduction to the IDAMS microcomputer software package for data analysis * Text editing / Word processing * Information technology for developing countries * The information society * Copyright * Transborder data flow * CD-ROM = Compact Disk * Read Only Memory * Presentation of data, using a microcomputer * Marketing of information and documentation * CD-interactive * Audio visual media * Computer-based multimedia * Relational data base management systems * Public relations * Archives * Image databases * Geographic Information Systems (GIS) * Artificial intelligence for information systems POSSIBLE STUDY VISITS: * Library of the University of Antwerp (UIA) + Inter-university Postgraduate School for Information and Library Science at the University of Antwerp (Belgium) * The Royal (National) Library (Brussels, Belgium) * Documentation centre of the national association for the prevention of accidents (in Brussels, Belgium) * Information service of the Geology Department of the Royal Museum on Africa (in Tervuren near Brussels, Belgium) * Institute for Marine Scientific Research IZWO (at the sea coast near Ostend, Belgium) * International Reference Centre (IRC) for Water Supply and Sanitation (The Hague * The Netherlands) * Documentation Department of the KIT (in Amsterdam, The Netherlands) * National Online Information Meeting 1994 (in Rotterdam, The Netherlands) ********** I.B.1. Fr: Shari L. Landes Re: Wordnet 1.4 Now Available WordNet is an on-line lexical reference systems whose design is inspired by current psycholinguistic theories of human lexical memory. English nouns, verbs, and adjectives are organized into synonym sets, each representing one underlying lexical concept. Different relations link the synonym sets. WordNet Version 1.4 is now available. The WordNet database is close to 13.5 megabytes, exclusive of the search code. The entire package is approximately 17.5 megabytes. The WordNet search code is distributed in binary form only, and is presently available for Sun-4, NeXT, DECstation, RS-6000, Macintosh and PC architectures. An X Windows interface is available for Sun-4, DECstation, RS-6000 and NeXT (please note that this is NOT a NeXTStep application - you must have X Windows for the NeXT in order to use the X Windows interface). A Microsoft Windows interface is available for the PC. A command line interface is also provided for all architectures except the Macintosh. If you are currently using an earlier version of WordNet you are strongly encouraged to upgrade to version 1.4. Small bugs and inconsistencies in both the database and search software have been corrected, and the database coverage has been expanded. Attributes have been added with this release. New with release 1.4 is a semantic concordance: a textual corpus linked to a lexicon with semantic tags. The concordance consists of 103 files from the Brown Corpus annotated with pointers to word senses in the WordNet 1.4 database. An X Windows application, Escort, is provided for searching the concordance files for occurrences and co-occurrences of semantic tags. Escort has been ported to the Sun-4, NeXT and DECstation platforms. You must install WordNet 1.4 before installing and using the semantic concordance package. The semantic concordance package is approximately 20 megabytes. SUMMARY OF CHANGES: Updates to database - additional coverage, cleanup Addition of attributes Port to RS-6000 New semantic concordance package We prefer that you ftp the WordNet system via anonymous ftp from clarity.princeton.edu. The packages are located in the subdirectory 'pub'. IF YOU FTP WordNet, PLEASE SEND MAIL TO wordnet@princeton.edu SO WE CAN UPDATE OUR RECORDS AND KEEP TRACK OF OUR USERS FOR FUTURE MAILINGS AND RELEASES. EVEN IF YOU ARE A CURRENT USER WHO IS UPDATING, IT IS USEFUL TO US TO KNOW THAT YOU HAVE UPGRADED TO 1.4. ***** REMEMBER TO FTP IN "binary" MODE!!! ***** To ftp the UNIX version of WordNet 1.4, ftp the following file: wn1.4unix.tar.ZWordNet Version 1.4 for UNIX systems in compressed tar format. This includes the WordNet database, binary installation of search code for Sun-4, DECstation, RS-6000 and NeXT, and documentation. Installation instructions and a Makefile are included. Man pages are provided as unformatted troff files. To ftp the PC (DOS) version of WordNet 1.4, ftp the following files: (1) readme.pcREADME file for PC installation. (2) wn14.arcPC version in ARC format. This includes the WordNet database, binary installation of search code (command line and Microsoft Windows interfaces), and documentation. Installation instructions and installation batch file, and a batch file for running WordNet are included. Man pages are provided in a format which can be sent to the line printer or viewed on the screen. (3) arc.exearc program needed to 'unarc' the PC version. If you already have this on your PC you do not need to ftp this file. To ftp the Macintosh version of WordNet 1.4, ftp the following files: (1) readme.macREADME file for Macintosh installation. (2) MacWordNet1.4.sit.bin Macintosh version in Stuffit format. This includes the WordNet database, binary installation of search code, and documentation. Man pages are provided in Postscript format. (3) UnStuffit-Deluxe-TM.bin Unstuffit program needed to unpack the Macintosh version. If you already have UnStuffit on your Macintosh, you don't need to ftp this file. Semantic concordance: wn1.4semcor.tar.Z Semantic concordance package in compressed tar format. Includes the semantically tagged files, Escort searchingapplication for Sun-4, DECstation and NeXT, and documentation. Installation instructions and a Makefile are included. Man pages are provided as unformatted troff files. Papers and WordNet documentation only: wn1.4man.tar.Z WordNet 1.4 documentation (man pages) only as unformatted troff files. 5papers.tar.Z troff paper describing WordNet project in compressed tar format ("Five Papers on WordNet"). A Makefile for formatting and printing the papers is included. If you need a PC or Macintosh version on diskette we will provide WordNet on magnetic media. Upgrades are also available as are printed copies of "Five Papers on WordNet." Contact the following for price information: Princeton University Cognitive Science Laboratory 221 Nassau Street Princeton, NJ 08544-2093 Attn: Laura Hawkins Please address all email concerning WordNet to wordnet@princeton.edu. If you have received this message via email and do not wish to remain in the user database, please send a request to be deleted. WORDNET USERS' MAILING LIST: We have (finally) set up a WordNet users' mailing list that will be administered here at Princeton. Items addressed to the mailing list will be automatically forwarded to all users on the list. Please note that this mailing list is separate from the user database. To participate in the mailing list, you must request to be added. To post a message to the mailing list, address mail to 'wn-users@princeton.edu'. Requests to be added to or removed from the mailing list should be sent to 'wn-users-request@princeton.edu'. Although you have received this announcement, you will only be added to the mailing list if you send a request to 'wn-users-request@princeton.edu'. Please be sure to include your correct e-mail address in the body of your request. Also, to help us keep our records up to date, if you are a current WordNet user it would be helpful to us if you would include the version of WordNet you are using (the latest release is 1.4) and the platform(s) that you are running on. If you have code or various flavors of the WordNet database that you would like to share with others, at the present time we prefer that you keep the data at your site, announce it to users via the mailing list, and make it available to interested parties either via 'ftp' or e-mail. If your site does not allow anonymous ftp, then we will consider moving the data to Princeton. Requests of this sort should be addressed to 'wn-users-request@princeton.edu'. To help with the administrative end of things, items sent to 'wn-users-request@princeton.edu' should use the 'Subject' of the message to convey the intent of the request. To be added to the mailing list, please specify a subject of 'Add user'. Similarly, to be removed from the list, specify a subject of 'Remove user'. Other types of requests should attempt to make intelligent use of the message subject. ********************************************************** II. QUERIES II.B.1. Fr: Padmini Srinivasan Re: FAQ for IR Research Is there an FAQ available for IR research? My impression is that there isn't one. If I am correct in this, may I suggest that some of us volunteer to create one. I know that I would find it most useful. I believe that there are many who would agree - especially those who are beginning their investigation into IR problems. I am listing below potential topics for inclusion. This schema partially derives from an FAQ that some volunteers including myself just created for the ai-medicine discussion group. Discussion on this topic is welcome. Finally if there exists such an FAQ, I would appreciate directions to it. Regards, Padmini Srinivasan Associate Professor School of Library & Information Science and Department of Manegement Sciences Univ. of Iowa padmini-srinivasan@uiowa.edu Potential Schema for IR-FAQ (1) Test Databases available for researchers. In each case specify source, ftp site if any, contact name, etc. (1a) Databases with queries and relevance judgments Bibilographic Other Listing of some citations using the dataset (1b) Databases without queries and relevance judgments Full text Bibilographic Other (2) Thesauri and Other Vocabulary Systems. In each case specify, source, ftp site if any, contact name, etc. (2a) Subject Specific Vocabulary Tools (2b) General Tools: Dictionaries, etc. Listing of some citations using these tools. (3) Retrieval Systems (3a) Experimental Prototypes (3b) Operational (4) Listing of IR-Related Journals (5) Listing of IR-Related Conferences (6) Listing of IR-Related Internet Resources: listservs, etc. (7) Listing of Schools Offering Masters and Ph.D Degrees in IR, including fellowships, etc. ********************************************************** IV. PROJECT WORK IV.A.1. Fr: Gregory B. Newby Re: NSF Digital Libraries Proposal Title : NSF 93-141 - Research on Digital Libraries Type : Program Guideline NSF Org: CISE / IRI Date : September 16, 1993 File : nsf93141 ** This submission was edited for space accommodation. For complete information contact ** e- mail to: lrosenbe@nsf.gov(internet). By mail: Laurence C. Rosenberg Deputy Division Director Information, Robotics & Intelligent Systems Prior to October 25, 1993: After October 25, 1993: National Science Foundation National Science Foundation Washington, D.C. 20550 4201 Wilson Blvd. By telephone: Prior to October 25, 1993: After October 25, 1993: (202) 357-9592 (703) 306-1927 By Fax: Prior to October 25, 1993: After October 25, 1993: (202) 357-0320 (703) 306-0577 RESEARCH ON DIGITAL LIBRARIES: A Joint Initiative of National Science Foundation, Computer and Information Science and Engineering Directorate, Advanced Research Projects Agency, Computing Systems Technology Office and the, Software and Intelligent Systems Technology Office, National Aeronautics and Space Administration PROPOSALS MUST BE RECEIVED AT NSF NO LATER THAN FEBRUARY 4, 1994 Effective October 25, 1993, the new mailing address will be: National Science Foundation 4201 Wilson Blvd. Arlington, VA 22230 INTRODUCTION: Today, the network connects some information sources that are a mixture of publicly available (with or without charge) information and private information shared by collaborators. These information sources, when connected electronically through a network, represent important components of an emerging, universally accessable, digital library. It is the purpose of this announcement to support necessary research and development to achieve universal connection as well as the ability to digitize, store, search, process, and retrieve information, by combining the complementary strengths of the participating agencies in basic research, advanced development and applications, and academic/industry linkage. RESEARCH TO BE SUPPORTED: Applicants can propose research in the following areas: 1-capturing data (and descriptive information about such data) of all forms (text, images, sound, speech, etc.) and categorizing and organizing electronic information in a variety of formats. 2-advanced software and algorithms for browsing, searching, filtering, abstracting, summarizing and combining large volumes of data, imagery, and all kinds of information; and 3-the utilization of networked databases distributed around the nation and around the world. PROPOSAL PREPARATION: The proposals must be marked DIGITAL LIBRARIES in the top left hand box, "Program Announcement," on the cover sheet (NSF Form 1207). Proposals must be prepared according to the instructions given in Grants for Research and Education in Science and Engineering (GRESE: NSF 92-89), with some additional requirements. PROPOSAL EVALUATION: Proposals will be evaluated by panels of experts. Supplementary mail reviews will be solicited as feasible and necessary to achieve a fair and accurate review of all proposals. Some potentially successful submissions may receive site visits. Selection of awards will be made jointly by the sponsoring agencies from the group of proposals receiving the highest ratings from the panel of experts. SCHEDULE: Organizations interested in submitting proposals should send an electronic mail message with the following statement: I am interested in submitting a proposal to the Digital Libraries Initiative in Areas ___(1,2,3) Name Title Organization Mailing address email address Address the email to: gbarber@nsf.gov cc: lrosenbe@nsf.gov If email on the internet is not available to you, then send a postcard with the same information to: Up to October 25, 1993 As of October 25, 1993 Gwendolyn Barber Gwendolyn Barber Room 310 National Science Foundation National Science Foundation 4201 Wilson Boulevard Washington, D.C. 20550 Arlington, VA 22230 This information will be used to notify you of special regional briefings that will be held for this announcement. These briefings will be designed to answer your questions and to help you find potential partners for research consortia under this announcement. It is anticipated that these briefings will occur during November/December, 1993. IN ORDER TO BE CONSIDERED UNDER THIS ANNOUNCEMENT, PROPOSALS MUST BE RECEIVED BY NSF NO LATER THAN FEBRUARY 4, 1994 SUCCESSFUL PROPOSERS WILL RECEIVE NOTIFICATION IN JUNE/JULY, 1994. Facilitation Awards for Scientists and Engineers with Disabilities provides funding for special assistance or equipment to enable persons with disabilities (investigators and other staff, including student research assistants) to work on an NSF project. Contact the program coordinator in the Directorate for Education and Human Resources. The telephone number is (202)357-7562 prior to September 24, 1993. Thereafter, the number is (703) 306-1636. The Foundation has TDD (Telephonic Device for the Deaf) capability, which enables individuals with hearing impairment to communicate with the NSF Information Center about NSF programs, employment, or general information. The telephone number is (202) 357-7492 prior to October 1, 1993. Thereafter the number is (703) 306-0090. ********************************************************** IRLIST Digest is distributed from the University of California, Division of Library Automation, 300 Lakeside Drive, Oakland, CA. 94612-3550. Send subscription requests to: LISTSERV@UCCVMA.BITNET Send submissions to IRLIST to: IR-L@UCCVMA.BITNET Editorial Staff: Clifford Lynch calur@uccmvsa.ucop.edu or calur@uccmvsa.bitnet Nancy Gusack ncgur@uccmvsa.bitnet or ncgur@uccmvsa.ucop.edu Mary Engle meeur@uccmvsa.bitnet The IRLIST Archives is now set up for anonymous FTP, as well as via the LISTSERV. Using anonymous FTP via the host dla.ucop.edu, the files will be found in the directory pub/irl, stored in subdirectories by year (e.g., /pub/irl/1993). Using LISTSERV, send the message INDEX IR-L to LISTSERV@UCCVMA.BITNET. To get a specific issue listed in the Index, send the message GET IR-L LOGYYMM, where YY is the year and MM is the numeric month in which the issue was mailed, to LISTSERV@UCCVMA (Bitnet) or LISTSERV@UCCVMA.UCOP.EDU. You will receive the issues for the entire month you have requested. These files are not to be sold or used for commercial purposes. Contact Nancy Gusack or Mary Engle for more information on IRLIST. THE OPINIONS EXPRESSED IN IRLIST DO NOT REPRESENT THOSE OF THE EDITORS OR THE UNIVERSITY OF CALIFORNIA. AUTHORS ASSUME FULL RESPONSIBILITY FOR THE CONTENTS OF THEIR SUBMISSIONS TO IRLIST.