This portal provides access to various Text & Speech processing software services developed by the Centre of Language and Speech Technology or the Humanities Lab of Radboud University Nijmegen.

These tools are accessible for free but do require authentication. They participate in the CLARIAH infrastructure which means you can authenticate using your own institutional login, provided your institution has ties with CLARIN. If your institute is not in the list you can request a CLARIN account. For general questions or comments contact Henk van den Heuvel. For technical issues, contact admin@cls.ru.nl, here you can also request an API key for automated access if you explain your use-case.

  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.
  • Active: The project has reached a stable, usable state and is being actively developed.

Alpino Webservice 2.4

  •   Rijksuniversiteit Groningen (backend), Radboud Universiteit Nijmegen (webservice)
  •   KNAW Humanities Cluster & CLST, Radboud University
Alpino is a dependency parser for Dutch, developed in the context of the PIONIER Project Algorithms for Linguistic Processing, developed by Gertjan van Noord at the University of Groningen. You can upload either tokenised or untokenised files (which will be automatically tokenised for you using ucto), the output will consist of a zip file containing XML files, one for each sentence in the input document. [view more]
  • Internet > WWW/HTTP > WSGI > Application
  • Text Processing > Linguistic
  • dependency parsing
  • folia
  • linguistics
  • nlp
  • syntax
Created: 2015-09-08
Modified: 2023-11-01
  • Experimental: The technology is implemented and ready for experimental settings (beta), but requires further work and validation.
  • WIP: Initial development is in progress, but there has not yet been a stable, usable release suitable for the public.

Automatic Speech Recognition Service 0.3

An Automatic Speech Recognition Service for a variety of languages, powered by WhisperX [view more]
  • Internet > WWW/HTTP > WSGI > Application
  • Text Processing > Linguistic
  • clam webservice rest nlp computational_linguistics rest
Created: 2024-02-16
Modified: 2024-04-12
  • 9 - Proven: Technology complete and proven in practice by real users.
  • Active: The project has reached a stable, usable state and is being actively developed.

Automatic Transcription of Dutch Speech Recordings 0.6.1

  •   Centre for Language and Speech Technology, Radboud University
This webservice uses automatic speech recognition to provide the transcriptions of recordings spoken in Dutch. You can upload and process only one file per project. For bulk processing and other questions, please contact Henk van den Heuvel at h.vandenheuvel@let.ru.nl. [view more]
  • Software for humanities
  • Speech Recognizing
  • dutch
  • nlp
  • speech recognition
Created: 2017-04-02
  • Active: The project has reached a stable, usable state and is being actively developed.

RU-Cesar unknown

  •   Erwin Komen
Django web application that communicates with the CorpusStudioWeb back-end 'Crpp'. Two main purposes: (1) browse texts, (2) conduct syntactic searches with definable output per hit. Searches are translated to Xquery 'under the hood' [view more]
  • syntax
  • xquery
Created: 2018
  • WIP: Initial development is in progress, but there has not yet been a stable, usable release suitable for the public.
  • Experimental: The technology is implemented and ready for experimental settings (beta), but requires further work and validation.

Speaker Diarisation Service 0.1

A speaker diarisation service powered by PyAnnote [view more]
  • Internet > WWW/HTTP > WSGI > Application
  • Text Processing > Linguistic
  • clam webservice rest nlp computational_linguistics rest
Created: 2024-04-15
Modified: 2024-04-15
  • Active: The project has reached a stable, usable state and is being actively developed.

e-WALD unknown

  •   Erwin Komen
Django web application that facilities viewing and searching a dictionary of Dutch dialects from the regions 'Achterhoek' and 'Liemers' [view more]
  • dialect
  • dictionary
  • dutch
Created: 2019
  • Active: The project has reached a stable, usable state and is being actively developed.

e-WBD unknown

  •   Erwin Komen
Django web application that facilities viewing and searching a dictionary of dialects from the Dutch province 'Noord-Brabant' as well as the Belgian provinces of Antwerpen, Vlaams-Brabant and Brussels [view more]
  • dialect
  • dictionary
  • dutch
Created: 2017
  • Active: The project has reached a stable, usable state and is being actively developed.

e-WGD unknown

  •   Erwin Komen
Django web application that facilities viewing and searching a dictionary of Dutch dialects from the province 'Gelderland' [view more]
  • dialect
  • dictionary
  • dutch
Created: 2019
  • Active: The project has reached a stable, usable state and is being actively developed.

e-WLD unknown

  •   Erwin Komen
Django web application that facilities viewing and searching a dictionary of the Dutch Limburgian dialects [view more]
  • dialect
  • dictionary
  • dutch
Created: 2016
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.
  • Active: The project has reached a stable, usable state and is being actively developed.

FLAT: the FoLiA Linguistic Annotation Tool 0.11.5

  •   KNAW Humanities Cluster & CLST, Radboud University
FLAT is a web-based linguistic annotation environment based around the FoLiA format (https://proycon.github.io/folia), a rich XML-based format for linguistic annotation. Flat allows users to view annotated FoLiA documents and enrich these documents with new annotations, a wide variety of linguistic annotation types is supported through the FoLiA paradigm. [view more]
  • Text Processing > Linguistic
  • annotation
  • computational linguistics
  • folia
  • linguistics
  • nlp
Created: 2014-01-02
Modified: 2024-07-05
  • Active: The project has reached a stable, usable state and is being actively developed.
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.

Frog Webservice 2.7

  •   Centre for Language and Speech Technology, Radboud University and KNAW Humanities Cluster
Frog is a suite containing a tokeniser, Part-of-Speech tagger, lemmatiser, morphological analyser, shallow parser, and dependency parser for Dutch. [view more]
  • Annotating
  • Contextualizing
  • Linguistics
  • Named Entity Recognition
  • POS-Tagging
  • Segmenting
  • Tagging
  • Textual and content analysis
  • Tree-Tagging
  • clam webservice rest nlp computational_linguistics rest
Created: 2022-02-17
Modified: 2023-12-05
  • Active: The project has reached a stable, usable state and is being actively developed.
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.

Grapheme to Phoneme converter 0.3.4

Grapheme to Phoneme (G2P) conversion. Input is a list of words (utf-8, one word per line). The G2P will output the best guess for the phonetic transcription per word. The system is trained on existing dictionaries. Please choose a language option. The system is a demo-version --- please refer to CLST for using G2P for long word lists. [view more]
  • Internet > WWW/HTTP > WSGI > Application
  • Text Processing > Linguistic
  • speech
  • transcription
Created: 2019-02-25
Modified: 2023-05-12
  • https://w3id.org/research-technology-readiness-level#Level8Complete
    Warning: Status is not expressed in a known vocabulary
  • Inactive: The project has reached a stable, usable state but is no longer being actively developed; support/maintenance will be provided as time allows.

Glem 1.3.1

  •   Faculty of Philosophy, Theology and Religious Studies and Centre for Language and Speech Technology, Radboud University Nijmegen
GLEM is a lemmatizer for Ancient Greek. [view more]
  • Annotating
  • Computational linguistics and philology
  • Greek and Latin philology and literature
  • ancient greek
  • greek
  • lemma
  • lemmatisation
  • natural language processing
  • nlp
Created: 2017-04-09
Modified: 2023-10-05
  • Inactive: The project has reached a stable, usable state but is no longer being actively developed; support/maintenance will be provided as time allows.
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.

Piereling 0.4

  •   Centre for Language and Speech Technology, Radboud University
  •   KNAW Humanities Cluster & CLST, Radboud University
Piereling can convert a wide variety of document formats to FoLiA XML, and from FoLiA XML to various formats. Data conversions such as these provide the groundwork for Natural Language Processing pipelines. It relies on numerous specialised conversion tools in combination with notable third-party tools such as pandoc. [view more]
  • Internet > WWW/HTTP > WSGI > Application
  • Text Processing > Linguistic
  • webservice nlp computational_linguistics rest folia conversion
Created: 2019-10-18
Modified: 2023-11-01
  • Active: The project has reached a stable, usable state and is being actively developed.
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.

Ucto Webservice 2.5.2

  •   Centre for Language and Speech Technology, Radboud University and KNAW Humanities Cluster
  •   KNAW Humanities Cluster & CLST, Radboud University
Ucto is a unicode-compliant tokeniser. It takes input in the form of one or more untokenised texts, and subsequently tokenises them. Several languages are supported, but the software is extensible to other languages. [view more]
  • Annotating
  • Linguistics
  • Tagging
  • Textual and content analysis
  • clam webservice rest nlp computational_linguistics rest
Created: 2022-04-08
Modified: 2024-03-14