"Forschungsprimärdaten bilden einen wertvollen Fundus an Informationen, die mit hohem finanziellem Aufwand erhoben werden. Je nach Fachgebiet und Methode sind sie replizierbar oder basieren auf nicht wiederholbaren Beobachtungen oder Messungen. In jedem Fall sollten die erhobenen Daten nach Abschluss der Forschungen öffentlich zugänglich und frei verfügbar sein."
Aus: Empfehlungen zur gesicherten Aufbewahrung und Bereitstellung digitaler Forschungsprimärdaten, DFG: Ausschuss für Wissenschaftliche Bibliotheken und Informationssysteme, Unterausschuss für Informationsmanagement, S. 2, Januar 2009.

Welcome to the Center for Sustainability of Linguistic Data (NaLiDa)

Linguistic resources such as corpora, lexicons, grammars, experimental data or computer programs are important in linguistic research. The creation of such data is often a very complicated process, and in the long run data can be lost or can no longer be worked with. The NaLiDa project at the Department of Linguistics at the University of Tübingen deals with developing an infrastructure for long-term-sustainability of linguistic data, so that such data will be accessible for conducting research in the future.

The following issues are the main aspects within the project:

  • Collection of resources
  • Data acquisition
  • Providing access to the data
  • Legal and ethic aspects of long-term-archiving
  • Standards for linguistic resources
  • Consulting for all aspects of working with digital texts
  • Investigating the migration of research data from the domain-specific repository to a generic repository; moving data from the Linguistics Department, to the infra-structural unit of the university (within the 2nd. funding phase)


Researchers are provided with assistance for creating descriptions of their resources (metadata) and archiving their data. This way researchers can focus on the relevant data to their research.


The archiving and publication of primary research data also provides easy access to the data. The catalogs here are based on metadata which is publicly accessible.


Presenting linguistic resources to the public via an open catalog, the description of the current processes for creating and archiving data as well as the design of up-to-date information associated to archives and resource documentation represent the main functions of a language resource portal.