Knowledge Management

Knowledge Management is a multi-disciplinary approach to making the best use of knowledge. Efficient and effective searching is a key component. Eurospider's know how covers various aspects of knowledge management.

InnoBib.News provides the latest information on innovation at libraries and other memory institutions.

In 1985 a well-known publication in 1985 reported on a retrieval experiment where lawyers searched for legal information. They found only about 20% of the potentially relevant documents. The authors concluded that full text search has serious limitations.

The SIGIR 2010 Industry Track organized by David Harper (Google, Switzerland) and Peter Schäuble (Eurospider, Switzerland) was a success. In the morning session four keynote talks were presented from influential technical leaders (Baidu, Google, Bing, Yandex). During the afternoon session, seven presentation showed interesting, novel, and innovative ideas from the search industry.

In order to review documents within an eDiscovery process, it is helpful to assess entire email conversations. Short messages without context may be difficult or even impossible to assess, e.g. when preceding questions are missing. Furthermore, it is simply more efficient to tag an entire thread rather every single message in the thread.

Today’s tools support and encourage the duplication of data. Let’s assume user A obtains a document from the enterprise storage and sends it as an attachment by email to user B who stores it on a laptop. This everyday scenario shows how easily files are duplicated. The document file is not only in the enterprise storage, but also in A’s sent box, in B’s inbox, and on B’s laptop, possibly twice if it is in the target folder selected by B as well as in the download folder.

Information Retrieval

The objective of Information Retrieval (IR) is to search large data collections for information relevant to a user’s information requirements. The term “information retrieval” was coined by Calvin Mooers in 1950. Like “research” the word “retrieval” does not refer to refinding something. It rather relates to the information retrieval paradox: “If I knew what I was searching for, I wouldn’t be searching for it.”

Information retrieval is focuses on three dimensions: systems and applications, theory and models, evaluation. Various retrieval models exist, such as Vector Space Model (VSM) and probabilistic and language models. For evaluatio,n recall and precision are often used. SMART was an early retrieval system that dealt with all three aspects. RankBrain is a more recent retrieval system based on TensorFlow.


The Integrated Authority File (German: Gemein­same Norm­datei or GND) is an inter­national authority file used and maintained by the German National Library (German: Deutsche National­bibliothek or DNB), all German-language library associations, the Zeit­schrift­en­daten­bank (ZDB) and many other insti­tutions. WebGND is an online application that supports navigation and search within this large database which consists of more than 11 million records covering personal names, corporate names, meeting names, geographic names, topical terms and uniform work titles.

