This portal provides access to various Text & Speech processing software services developed by the Centre of Language and Speech Technology or the Humanities Lab of Radboud University Nijmegen.

These tools are accessible for free but do require authentication. They participate in the CLARIAH infrastructure which means you can authenticate using your own institutional login, provided your institution has ties with CLARIN. If your institute is not in the list you can request a CLARIN account. For general questions or comments contact Henk van den Heuvel. For technical issues, contact admin@cls.ru.nl, here you can also request an API key for automated access if you explain your use-case.

  • Web Application
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.
  • Active: The project has reached a stable, usable state and is being actively developed.

Alpino-Webservice 2.4

  •   KNAW Humanities Cluster & CLST, Radboud University
Alpino is a dependency parser for Dutch, developed in the context of the PIONIER Project Algorithms for Linguistic Processing, developed by Gertjan van Noord at the University of Groningen. This is the webservice for it. You can upload either tokenised or untokenised files (which will be automatically tokenised for you using ucto), the output will consist of a zip file containing XML files, one for each sentence in the input document. [view more]
  • Internet > WWW/HTTP > WSGI > Application
  • Text Processing > Linguistic
  • dependency parsing
  • folia
  • linguistics
  • nlp
  • syntax
  • Bsd
  • Linux
  • Macos
  • Python
Created: 2015-09-08
Modified: 2023-11-01
  • Web Application
  • Active: The project has reached a stable, usable state and is being actively developed.
  • 9 - Proven: Technology complete and proven in practice by real users.

Automatic Speech Recognition for Dutch 0.6.2

This is a web-based automatic speech recogniser for Dutch, capable of transcribing dutch speech recordings using multiple models. [view more]
  • Software for humanities
  • Speech Recognizing
  • dutch
  • nlp
  • speech recognition
  • Linux
Created: 2017-04-02
  • Web Application
  • Active: The project has reached a stable, usable state and is being actively developed.

Corpus Editor for Syntactically Annotated Resources (Cesar) unknown

  •   Erwin Komen
Django web application that communicates with the CorpusStudioWeb back-end 'Crpp'. Two main purposes: (1) browse texts, (2) conduct syntactic searches with definable output per hit. Searches are translated to Xquery 'under the hood' [view more]
  • syntax
  • xquery
  • Posix
Created: 2018
  • Web Application
  • Active: The project has reached a stable, usable state and is being actively developed.

Electronisch woordenboek van de Achterhoekse en Liemerse dialecten unknown

  •   Erwin Komen
Django web application that facilities viewing and searching a dictionary of Dutch dialects from the regions 'Achterhoek' and 'Liemers' [view more]
  • dialect
  • dictionary
  • dutch
  • Posix
Created: 2019
  • Web Application
  • Active: The project has reached a stable, usable state and is being actively developed.

Electronisch woordenboek van de Gelderse dialecten unknown

  •   Erwin Komen
Django web application that facilities viewing and searching a dictionary of dialects from the Dutch province 'Noord-Brabant' as well as the Belgian provinces of Antwerpen, Vlaams-Brabant and Brussels [view more]
  • dialect
  • dictionary
  • dutch
  • Posix
Created: 2017
  • Web Application
  • Active: The project has reached a stable, usable state and is being actively developed.

Electronisch woordenboek van de Gelderse dialecten unknown

  •   Erwin Komen
Django web application that facilities viewing and searching a dictionary of Dutch dialects from the province 'Gelderland' [view more]
  • dialect
  • dictionary
  • dutch
  • Posix
Created: 2019
  • Web Application
  • Active: The project has reached a stable, usable state and is being actively developed.

Electronisch woordenboek van de Limburgse dialecten unknown

  •   Erwin Komen
Django web application that facilities viewing and searching a dictionary of the Dutch Limburgian dialects [view more]
  • dialect
  • dictionary
  • dutch
  • Posix
Created: 2016
  • Web Application
  • Active: The project has reached a stable, usable state and is being actively developed.
  • 9 - Proven: Technology complete and proven in practice by real users.

English Automatic Speech Recognition 0.2.3

This webservice uses the English automatic speech recognition system developed at the University of Twente. [view more]
  • Software for humanities
  • Speech Recognizing
  • english
  • nlp
  • speech recognition
  • Linux
Created: 2018-05-17
  • Web Application
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.
  • Active: The project has reached a stable, usable state and is being actively developed.

FoLiA-Linguistic-Annotation-Tool 0.11.2

  •   KNAW Humanities Cluster & CLST, Radboud University
FLAT is a web-based linguistic annotation environment based around the FoLiA format (https://proycon.github.io/folia), a rich XML-based format for linguistic annotation. Flat allows users to view annotated FoLiA documents and enrich these documents with new annotations, a wide variety of linguistic annotation types is supported through the FoLiA paradigm. [view more]
  • Text Processing > Linguistic
  • annotation
  • computational linguistics
  • folia
  • linguistics
  • nlp
  • Bsd
  • Linux
  • Macos
  • Python
Created: 2014-01-02
Modified: 2023-09-14
  • Web Application
  • Active: The project has reached a stable, usable state and is being actively developed.

Forced Alignment 2 0.3.1

This webservice provides an output file with word alignments given an NL speech recording and a transcription. [view more]
  • alignment
  • speech recognition
  • Linux
Created: 2020-03
  • Web Application
  • Active: The project has reached a stable, usable state and is being actively developed.
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.

Frog-Webservice 2.6

Frog is a suite containing a tokeniser, Part-of-Speech tagger, lemmatiser, morphological analyser, shallow parser, and dependency parser for Dutch. This is the webservice for it, for both humans and machines. [view more]
  • Annotating
  • Contextualizing
  • Linguistics
  • Named Entity Recognition
  • POS-Tagging
  • Segmenting
  • Tagging
  • Textual and content analysis
  • Tree-Tagging
  • clam webservice rest nlp computational_linguistics rest
  • Bsd
  • Linux
  • Macos
  • Python
Created: 2022-02-17
Modified: 2023-10-31
  • Web Application
  • Active: The project has reached a stable, usable state and is being actively developed.
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.

g2pservice 0.3.4

Grapheme to Phoneme converter. Input is a list of words (utf8). Choose one of the language options. [view more]
  • Internet > WWW/HTTP > WSGI > Application
  • Text Processing > Linguistic
  • speech
  • transcription
  • Bsd
  • Linux
  • Macos
  • Python
Created: 2019-02-25
Modified: 2023-05-12
  • Command-line Application
  • Web Application
  • https://w3id.org/research-technology-readiness-level#Level8Complete
    Warning: Status is not expressed in a known vocabulary
  • Inactive: The project has reached a stable, usable state but is no longer being actively developed; support/maintenance will be provided as time allows.

Glem 1.3.1

  •   Corien Bary
  •   Peter Berck
  •   Iris Hendrickx
  •   Wessel Stoop
GLEM is a lemmatizer for Ancient Greek. [view more]
  • Annotating
  • Computational linguistics and philology
  • Greek and Latin philology and literature
  • ancient greek
  • greek
  • lemma
  • lemmatisation
  • natural language processing
  • nlp
  • Posix
Created: 2017-04-09
Modified: 2023-10-05
  • Web Application
  • Inactive: The project has reached a stable, usable state but is no longer being actively developed; support/maintenance will be provided as time allows.
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.

piereling 0.4

  •   KNAW Humanities Cluster & CLST, Radboud University
Piereling is a webservice and web-application to convert between a variety of document formats, mostly from and to FoLiA XML. It is intended for NLP pipelines. [view more]
  • Internet > WWW/HTTP > WSGI > Application
  • Text Processing > Linguistic
  • webservice nlp computational_linguistics rest folia conversion
  • Bsd
  • Linux
  • Macos
  • Python
Created: 2019-10-18
Modified: 2023-11-01
  • Web Application
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.
  • Active: The project has reached a stable, usable state and is being actively developed.

Ucto-Webservice 2.5

  •   KNAW Humanities Cluster & CLST, Radboud University
Ucto is a rule-based tokeniser for multiple languages. This is the webservice for it, for both humans and machines. [view more]
  • Internet > WWW/HTTP > WSGI > Application
  • Text Processing > Linguistic
  • clam webservice rest nlp computational_linguistics rest
  • Bsd
  • Linux
  • Macos
  • Python
Created: 2022-04-08
Modified: 2023-11-01
  • inactive
    Warning: Status is not expressed in a known vocabulary

You are what you tweet unknown

Op basis van je Twittergedrag valt behoorlijk veel van je af te leiden. In deze demo willen we je laten zien wat onze technologie over jou denkt te weten [view more]
  • demo
  • dutch
  • nlp
  • twitter
  • Posix
Created: 2015-07-26