📊 Projekt

Laudatio Repository

Humboldt-Universität zu Berlin

Laudatio Repository

Institution: Humboldt-Universität zu Berlin Category: Project
Website: https://www.laudatio-repository.org/

Short Description

The service provides access to metadata of linguistic corpora, documents, and annotations via a REST API. Target users are researchers and academic staff at universities working with structured text data. The main benefit lies in standardized, programmatic querying and analysis of corpus content, particularly for computer-assisted language research. The API enables integration into research workflows and the use of corpus data in academic projects.

General Description

-


Thematic Classification

Subject Areas

Humanities
Computer Science
Linguistics
Theology
History

Research Fields

  • Linguistics
  • Didactics
  • Theater
  • History
  • Theology

Specializations

  • Digital Humanities
  • Corpus Linguistics
  • Historical Linguistics
  • Text Annotation
  • Metadata Management
  • Open Access
  • Research Data Infrastructure
  • Linguistic Research Theater
  • Digital Edition
  • Text and Language Analysis
  • Multilayer Analysis
  • Semantic Web
  • REST-API for Linguistic Data
  • Text Movement: Theater and Language (project reference)

Keywords

  • Laudatio repository API - REST API - Corpus data - Document metadata - Annotation metadata - ElasticSearch integration - JSON response format - Search and retrieval - Metadata indexing - Linguistic corpora

Funding

Funding Provider: -
Funding Program: -
Funding Reference: Textbewegung: Theater und Sprache
Funding Period: 2013
Project Volume: -


Team & Partners

Project Leadership

  • Prof. Dr. Maik Walter (Humboldt-Universität zu Berlin)

Involved Persons

  • Carolin Odebrecht (Infrastructure)
  • Maik Walter (Editor)
  • Stefanie Dipper (Editor)
  • Simone Schultz-Balluff (Editor)
  • Maria Anselm (Annotator)
  • Katharina Bort (Annotator)
  • Malin Frey (Annotator)
  • Sarah Klein (Annotator)
  • Julia Krasselt (Annotator)
  • Nadine Lordick (Annotator)
  • Sarah Malke (Annotator)
  • Julika Nelken (Annotator)
  • Maurice Spengler (Annotator)
  • Helena Wedig (Annotator)

Affiliated Institutions

-

External Partners

Keine externen Partner genannt


Project Contents

Goals

  • Provision of a publicly accessible repository for linguistically annotated text corpora
  • Provision of a REST-API for programming access to corpora, documents, and annotations
  • Support for research in the fields of linguistics, history, and theology through high-quality, annotated text data
  • Promotion of scientific collaboration through standardized and documented data formats
  • Ensuring long-term archiving and reproducibility of research data

Work Packages

  • WP1: Corpus and document management
  • WP2: Annotation and data structuring
  • WP3: Search and analysis functions
  • WP4: User interface and visualization
  • WP5: Documentation and open-access publication

Methods

  • Tokenization of the transcription 'text'
  • Conversion of the annotation level 'tok' from treetaggeroutput to relANNIS via SaltNPepper
  • Manual annotation of the text columns
  • Import of the columns into CorA
  • Conversion of CoraXML to Annis via Pepper
  • Manual annotation
  • Automatic annotation
  • Collation and inspection
  • Transcription
  • Import
  • Conversion

Expected Outcomes

  • Creates and publishes a digital corpus of 201 children's and household fairy tales as well as 10 children's legends by the Brothers Grimm
  • Includes the final edition of the Brothers Grimm from 1857
  • Compiles and prepares for the advanced seminar "Dramapädagogik des Märchens: Linguistik, Didaktik und Theater" at the University of Tübingen
  • Contains transcriptions, tokenization, POS tagging, lemmatization, and meta-information
  • Uses Wikisource edition guidelines for text preparation
  • Published under the Creative Commons Attribution 3.0 Unported License
  • Contains 211 documents (fairy tales and legends) with a total of 295,880 tokens
  • Made accessible via a REST API enabling search, querying, and browsing of corpus, documents, and annotations
  • Part of the Open-Source project "Textbewegung: Theater und Sprache" at Humboldt-Universität zu Berlin

Contact

Contact Person: -
Email: -
Project Website: https://www.laudatio-repository.org/


Recorded: 2026-01-14
Source: https://www.laudatio-repository.org/

Visit Website