logo_color_small.jpg

LINSearch - Indexing and natural-language search for technical and scientific documents

The daily business of FIZ Technik and TIB Hannover is to provide professional users with high quality information in technical domains such as mechanical engineering or materials. For the users it is important to get high quality results according to their queries delivered by intelligent retrieval technologies.
The daily business of a technical information service (FIZ) is to provide professional users with high quality information in certain domain. The aim of LinSearch project is the development, evaluation and usage of an efficient integrated system for the indexing and searching of technical and scientific documents for FIZ Technik and TIB Hannover. Therefore different methods will be combined to exploit the benefits of the approaches.
The project
  LINSearch ? Indexing and natural-language search for technical and scientific documents




The aim of LinSearch project is the development, evaluation and usage of an integrated system for the indexing and searching of technical and scientific documents for FIZ Technik and TIB Hannover. This system should support the natural language of the users as well as multiple languages, especially German and English. To access documents the correct index terms about a document are most important. Currently, the indexing is done manually by experts, which is very time-consuming. Also the constantly increasing amount of publications leads to a reduced actuality of the provided information.

Approach
In the first phase of the project available technologies for the indexing and information extraction will be used to automatically index a subset of the documents of FIZ Technik and TIB Hannover. The results will be evaluated and compared with the manually generated index. In the next step, the different techniques will be combined in order to exploit the benefits of the different approaches.
The semi-automatic indexing will support the experts in their job by proposing appropriate index terms, which can afterwards be corrected. With the help of self-learning algorithm the index algorithm should learn over time to improve its suggestions.

The results of the indexing process also support the users of the system. The user should be assisted in the query formulation by proposing related terms. After the query execution, the result set contains not only the description of the documents, but also related index terms, which can be used to further refine the query. Finally the retrieval engine, which handles the user queries, combines linguistic methods and thesauri to improve the query results.

After three years
At the end of the project FIZ Technik and TIB Hannover will be able to index new documents much more efficient and with a higher quality. Thus strengthen their position in the global information market, as their customers will also get high quality and up to date information.

Project Partners:
  • Fachinformation Technik (FIZ Technik), Frankfurt
  • IAI - Institut der Gesellschaft zur Förderung der Angewandten Informationsforschung e.V., Saarbrücken
  • Technische Informationsbibliothek Hannover (TIB Hannover)
  • L3S Research Center, Hannover

URL:

http://www.linsearch.de/


StartDate:

01/01/2007


EndDate:

12/31/2009