This portal provides access to various Text & Speech processing software services developed by the Centre of Language and Speech Technology or the Humanities Lab of Radboud University Nijmegen.

These tools are accessible for free but do require authentication. They participate in the CLARIAH infrastructure which means you can authenticate using your own institutional login, provided your institution has ties with CLARIN. If your institute is not in the list you can request a CLARIN account. For general questions or comments contact Henk van den Heuvel. For technical issues, contact admin@cls.ru.nl, here you can also request an API key for automated access if you explain your use-case.

Name Version Interface type Description Links Status Maintainer Authors Producer/Provider
Alpino-Webservice 2.4 2023-11-01 11:56:10 +0100
  • Web Application
Alpino is a dependency parser for Dutch, developed in the context of the PIONIER Project Algorithms for Linguistic Processing, developed by Gertjan van Noord at the University of Groningen. This is the webservice for it. You can upload either tokenised or untokenised files (which will be automatically tokenised for you using ucto), the output will consist of a zip file containing XML files, one for each sentence in the input document. [view more]
Category:
  • Internet > WWW/HTTP > WSGI > Application
  • Text Processing > Linguistic
Keywords:
  • dependency parsing
  • folia
  • linguistics
  • nlp
  • syntax
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.
  • Active: The project has reached a stable, usable state and is being actively developed.

★ ★ ★ ☆ ☆
  •   KNAW Humanities Cluster & CLST, Radboud University
Alpino Webservice 2.4
  • Web Application
Alpino is a dependency parser for Dutch, developed in the context of the PIONIER Project Algorithms for Linguistic Processing, developed by Gertjan van Noord at the University of Groningen. You can upload either tokenised or untokenised files (which will be automatically tokenised for you using ucto), the output will consist of a zip file containing XML files, one for each sentence in the input document.
asrservice 0.3 2024-04-12 10:39:45 +0200
  • Web Application
An Automatic Speech Recognition Service for a variety of languages, powered by WhisperX [view more]
Category:
  • Internet > WWW/HTTP > WSGI > Application
  • Text Processing > Linguistic
Keywords:
  • clam webservice rest nlp computational_linguistics rest
  • WIP: Initial development is in progress, but there has not yet been a stable, usable release suitable for the public.
  • Experimental: The technology is implemented and ready for experimental settings (beta), but requires further work and validation.

★ ★ ★ ☆ ☆
Automatic Speech Recognition Service 0.3
  • Web Application
asrservice
  • Unknown
Automatic Speech Recognition for Dutch 0.6.2
  • Web Application
This is a web-based automatic speech recogniser for Dutch, capable of transcribing dutch speech recordings using multiple models. [view more]
Category:
  • Software for humanities
  • Speech Recognizing
Keywords:
  • dutch
  • nlp
  • speech recognition
  • 9 - Proven: Technology complete and proven in practice by real users.
  • Active: The project has reached a stable, usable state and is being actively developed.

★ ★ ★ ☆ ☆
Automatic Transcription of Dutch Speech Recordings 0.6.1
  • Web Application
This webservice uses automatic speech recognition to provide the transcriptions of recordings spoken in Dutch. You can upload and process only one file per project. For bulk processing and other questions, please contact Henk van den Heuvel at h.vandenheuvel@let.ru.nl.
Automatic Speech Recognition for Dutch
  • Unknown
Corpus Editor for Syntactically Annotated Resources (Cesar) unknown
  • Web Application
Django web application that communicates with the CorpusStudioWeb back-end 'Crpp'. Two main purposes: (1) browse texts, (2) conduct syntactic searches with definable output per hit. Searches are translated to Xquery 'under the hood' [view more]
Keywords:
  • syntax
  • xquery
  • Active: The project has reached a stable, usable state and is being actively developed.

★ ☆ ☆ ☆ ☆
  •   Erwin Komen
  •   Erwin Komen
RU-Cesar
  • Web Application
diarisationservice 0.1 2024-04-15 14:24:17 +0200
  • Web Application
Speaker diarisation service, powered by PyAnnote [view more]
Category:
  • Internet > WWW/HTTP > WSGI > Application
  • Text Processing > Linguistic
Keywords:
  • clam webservice rest nlp computational_linguistics rest
  • WIP: Initial development is in progress, but there has not yet been a stable, usable release suitable for the public.
  • Experimental: The technology is implemented and ready for experimental settings (beta), but requires further work and validation.

★ ★ ★ ☆ ☆
Speaker Diarisation Service 0.1
  • Web Application
A speaker diarisation service powered by PyAnnote
diarisationservice
  • Unknown
Electronisch woordenboek van de Achterhoekse en Liemerse dialecten unknown
  • Web Application
Django web application that facilities viewing and searching a dictionary of Dutch dialects from the regions 'Achterhoek' and 'Liemers' [view more]
Keywords:
  • dialect
  • dictionary
  • dutch
  • Active: The project has reached a stable, usable state and is being actively developed.

★ ☆ ☆ ☆ ☆
  •   Erwin Komen
  •   Erwin Komen
e-WALD
  • Web Application
Electronisch woordenboek van de Gelderse dialecten unknown
  • Web Application
Django web application that facilities viewing and searching a dictionary of Dutch dialects from the province 'Gelderland' [view more]
Keywords:
  • dialect
  • dictionary
  • dutch
  • Active: The project has reached a stable, usable state and is being actively developed.

★ ☆ ☆ ☆ ☆
  •   Erwin Komen
  •   Erwin Komen
e-WGD
  • Web Application
Electronisch woordenboek van de Gelderse dialecten unknown
  • Web Application
Django web application that facilities viewing and searching a dictionary of dialects from the Dutch province 'Noord-Brabant' as well as the Belgian provinces of Antwerpen, Vlaams-Brabant and Brussels [view more]
Keywords:
  • dialect
  • dictionary
  • dutch
  • Active: The project has reached a stable, usable state and is being actively developed.

★ ☆ ☆ ☆ ☆
  •   Erwin Komen
  •   Erwin Komen
e-WBD
  • Web Application
Electronisch woordenboek van de Limburgse dialecten unknown
  • Web Application
Django web application that facilities viewing and searching a dictionary of the Dutch Limburgian dialects [view more]
Keywords:
  • dialect
  • dictionary
  • dutch
  • Active: The project has reached a stable, usable state and is being actively developed.

★ ☆ ☆ ☆ ☆
  •   Erwin Komen
  •   Erwin Komen
e-WLD
  • Web Application
FoLiA-Linguistic-Annotation-Tool 0.11.5 2024-07-05 13:27:34 +0200
  • Web Application
FLAT is a web-based linguistic annotation environment based around the FoLiA format (https://proycon.github.io/folia), a rich XML-based format for linguistic annotation. Flat allows users to view annotated FoLiA documents and enrich these documents with new annotations, a wide variety of linguistic annotation types is supported through the FoLiA paradigm. [view more]
Category:
  • Text Processing > Linguistic
Keywords:
  • annotation
  • computational linguistics
  • folia
  • linguistics
  • nlp
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.
  • Active: The project has reached a stable, usable state and is being actively developed.

★ ★ ★ ☆ ☆
  •   KNAW Humanities Cluster & CLST, Radboud University
FLAT: the FoLiA Linguistic Annotation Tool
  • Web Application
FoLiA-Linguistic-Annotation-Tool
  • Unknown
Forced Alignment 2 0.3.1
  • Web Application
This webservice provides an output file with word alignments given an NL speech recording and a transcription. [view more]
Keywords:
  • alignment
  • speech recognition
  • Active: The project has reached a stable, usable state and is being actively developed.

★ ★ ★ ☆ ☆
ForcedAlignment2 0.3.1
  • Web Application
Forced Alignment of text and audio files
Forced Alignment 2
  • Unknown
Frog-Webservice 2.7 2023-12-05 16:06:08 +0100
  • Web Application
Frog is a suite containing a tokeniser, Part-of-Speech tagger, lemmatiser, morphological analyser, shallow parser, and dependency parser for Dutch. This is the webservice for it, for both humans and machines. [view more]
Category:
  • Annotating
  • Contextualizing
  • Linguistics
  • Named Entity Recognition
  • POS-Tagging
  • Segmenting
  • Tagging
  • Textual and content analysis
  • Tree-Tagging
Keywords:
  • clam webservice rest nlp computational_linguistics rest
  • Active: The project has reached a stable, usable state and is being actively developed.
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.

★ ★ ★ ☆ ☆
Frog Webservice 2.7
  • Web Application
Frog is a suite containing a tokeniser, Part-of-Speech tagger, lemmatiser, morphological analyser, shallow parser, and dependency parser for Dutch.
g2pservice 0.3.4 2023-05-12 13:09:12 +0200
  • Web Application
Grapheme to Phoneme converter. Input is a list of words (utf8). Choose one of the language options. [view more]
Category:
  • Internet > WWW/HTTP > WSGI > Application
  • Text Processing > Linguistic
Keywords:
  • speech
  • transcription
  • Active: The project has reached a stable, usable state and is being actively developed.
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.

★ ★ ★ ☆ ☆
Grapheme to Phoneme converter 0.3.4
  • Web Application
Grapheme to Phoneme (G2P) conversion. Input is a list of words (utf-8, one word per line). The G2P will output the best guess for the phonetic transcription per word. The system is trained on existing dictionaries. Please choose a language option. The system is a demo-version --- please refer to CLST for using G2P for long word lists.
g2pservice
  • Unknown
Glem 1.3.1 2023-10-05 14:28:06 +0200
  • Command-line Application
  • Web Application
GLEM is a lemmatizer for Ancient Greek. [view more]
Category:
  • Annotating
  • Computational linguistics and philology
  • Greek and Latin philology and literature
Keywords:
  • ancient greek
  • greek
  • lemma
  • lemmatisation
  • natural language processing
  • nlp
  • https://w3id.org/research-technology-readiness-level#Level8Complete
    Warning: Status is not expressed in a known vocabulary
  • Inactive: The project has reached a stable, usable state but is no longer being actively developed; support/maintenance will be provided as time allows.

★ ★ ★ ☆ ☆
  •   Corien Bary
  •   Peter Berck
  •   Iris Hendrickx
  •   Wessel Stoop
Glem 1.3.1
  • Web Application
glem
  • Command-line Application
Command-line interface to GLEM
piereling 0.4 2023-11-01 11:43:34 +0100
  • Web Application
Piereling is a webservice and web-application to convert between a variety of document formats, mostly from and to FoLiA XML. It is intended for NLP pipelines. [view more]
Category:
  • Internet > WWW/HTTP > WSGI > Application
  • Text Processing > Linguistic
Keywords:
  • webservice nlp computational_linguistics rest folia conversion
  • Inactive: The project has reached a stable, usable state but is no longer being actively developed; support/maintenance will be provided as time allows.
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.

★ ★ ★ ☆ ☆
  •   KNAW Humanities Cluster & CLST, Radboud University
Piereling 0.4
  • Web Application
Piereling can convert a wide variety of document formats to FoLiA XML, and from FoLiA XML to various formats. Data conversions such as these provide the groundwork for Natural Language Processing pipelines. It relies on numerous specialised conversion tools in combination with notable third-party tools such as pandoc.
piereling
  • Unknown
Ucto-Webservice 2.5.2 2024-03-14 21:54:52 +0100
  • Web Application
Ucto is a rule-based tokeniser for multiple languages. This is the webservice for it, for both humans and machines. [view more]
Category:
  • Annotating
  • Linguistics
  • Tagging
  • Textual and content analysis
Keywords:
  • clam webservice rest nlp computational_linguistics rest
  • Active: The project has reached a stable, usable state and is being actively developed.
  • 8 - Complete: Technology complete and qualified, released for all end-users in scholarly environments.

★ ★ ★ ☆ ☆
  •   KNAW Humanities Cluster & CLST, Radboud University
Ucto Webservice 2.5.2
  • Web Application
Ucto is a unicode-compliant tokeniser. It takes input in the form of one or more untokenised texts, and subsequently tokenises them. Several languages are supported, but the software is extensible to other languages.
You are what you tweet unknown
  • Unknown
Op basis van je Twittergedrag valt behoorlijk veel van je af te leiden. In deze demo willen we je laten zien wat onze technologie over jou denkt te weten [view more]
Keywords:
  • demo
  • dutch
  • nlp
  • twitter
  • inactive
    Warning: Status is not expressed in a known vocabulary

★ ☆ ☆ ☆ ☆
You are what you tweet
  • Unknown