This portal provides access to various Text & Speech processing software services developed by the Centre of Language and Speech Technology or the Humanities Lab of Radboud University Nijmegen.
These tools are accessible for free but do require authentication. They participate in the CLARIAH infrastructure which means you can authenticate using your own institutional login, provided your institution has ties with CLARIN. If your institute is not in the list you can request a CLARIN account. For general questions or comments contact Henk van den Heuvel. For technical issues, contact admin@cls.ru.nl, here you can also request an API key for automated access if you explain your use-case.
Alpino Webservice 2.4
- Rijksuniversiteit Groningen (backend), Radboud Universiteit Nijmegen (webservice)
- KNAW Humanities Cluster & CLST, Radboud University
Alpino is a dependency parser for Dutch, developed in the context of the PIONIER Project Algorithms for Linguistic Processing, developed by Gertjan van Noord at the University of Groningen. You can upload either tokenised or untokenised files (which will be automatically tokenised for you using ucto), the output will consist of a zip file containing XML files, one for each sentence in the input document. [view more]
- Internet > WWW/HTTP > WSGI > Application
- Text Processing > Linguistic
- dependency parsing
- folia
- linguistics
- nlp
- syntax
Created: 2015-09-08
Modified: 2023-11-01
Automatic Speech Recognition Service 0.3
An Automatic Speech Recognition Service for a variety of languages, powered by WhisperX [view more]
- Internet > WWW/HTTP > WSGI > Application
- Text Processing > Linguistic
- clam webservice rest nlp computational_linguistics rest
Created: 2024-02-16
Modified: 2024-04-12
Automatic Transcription of Dutch Speech Recordings 0.6.1
- Centre for Language and Speech Technology, Radboud University
This webservice uses automatic speech recognition to provide the transcriptions of recordings spoken in Dutch. You can upload and process only one file per project. For bulk processing and other questions, please contact Henk van den Heuvel at h.vandenheuvel@let.ru.nl. [view more]
- Software for humanities
- Speech Recognizing
- dutch
- nlp
- speech recognition
Created: 2017-04-02
Created: 2018
Speaker Diarisation Service 0.1
A speaker diarisation service powered by PyAnnote [view more]
- Internet > WWW/HTTP > WSGI > Application
- Text Processing > Linguistic
- clam webservice rest nlp computational_linguistics rest
Created: 2024-04-15
Modified: 2024-04-15
Created: 2019
Created: 2017
Created: 2019
Created: 2016
FLAT: the FoLiA Linguistic Annotation Tool 0.11.5
- KNAW Humanities Cluster & CLST, Radboud University
FLAT is a web-based linguistic annotation environment based around the FoLiA format (https://proycon.github.io/folia), a rich XML-based format for linguistic annotation. Flat allows users to view annotated FoLiA documents and enrich these documents with new annotations, a wide variety of linguistic annotation types is supported through the FoLiA paradigm. [view more]
- Text Processing > Linguistic
- annotation
- computational linguistics
- folia
- linguistics
- nlp
Created: 2014-01-02
Modified: 2024-07-05
ForcedAlignment2 0.3.1
Forced Alignment of text and audio files [view more]
- alignment
- speech recognition
Created: 2020-03
Frog Webservice 2.7
- Centre for Language and Speech Technology, Radboud University and KNAW Humanities Cluster
Frog is a suite containing a tokeniser, Part-of-Speech tagger, lemmatiser, morphological analyser, shallow parser, and dependency parser for Dutch. [view more]
- Annotating
- Contextualizing
- Linguistics
- Named Entity Recognition
- POS-Tagging
- Segmenting
- Tagging
- Textual and content analysis
- Tree-Tagging
- clam webservice rest nlp computational_linguistics rest
Created: 2022-02-17
Modified: 2023-12-05
Grapheme to Phoneme converter 0.3.4
Grapheme to Phoneme (G2P) conversion. Input is a list of words (utf-8, one word per line). The G2P will output the best guess for the phonetic transcription per word. The system is trained on existing dictionaries. Please choose a language option. The system is a demo-version --- please refer to CLST for using G2P for long word lists. [view more]
- Internet > WWW/HTTP > WSGI > Application
- Text Processing > Linguistic
- speech
- transcription
Created: 2019-02-25
Modified: 2023-05-12
Glem 1.3.1
- Faculty of Philosophy, Theology and Religious Studies and Centre for Language and Speech Technology, Radboud University Nijmegen
GLEM is a lemmatizer for Ancient Greek. [view more]
- Annotating
- Computational linguistics and philology
- Greek and Latin philology and literature
- ancient greek
- greek
- lemma
- lemmatisation
- natural language processing
- nlp
Created: 2017-04-09
Modified: 2023-10-05
Piereling 0.4
- Centre for Language and Speech Technology, Radboud University
- KNAW Humanities Cluster & CLST, Radboud University
Piereling can convert a wide variety of document formats to FoLiA XML, and from FoLiA XML to various formats. Data conversions such as these provide the groundwork for Natural Language Processing pipelines. It relies on numerous specialised conversion tools in combination with notable third-party tools such as pandoc. [view more]
- Internet > WWW/HTTP > WSGI > Application
- Text Processing > Linguistic
- webservice nlp computational_linguistics rest folia conversion
Created: 2019-10-18
Modified: 2023-11-01
Ucto Webservice 2.5.2
- Centre for Language and Speech Technology, Radboud University and KNAW Humanities Cluster
- KNAW Humanities Cluster & CLST, Radboud University
Ucto is a unicode-compliant tokeniser. It takes input in the form of one or more untokenised texts, and subsequently tokenises them. Several languages are supported, but the software is extensible to other languages. [view more]
- Annotating
- Linguistics
- Tagging
- Textual and content analysis
- clam webservice rest nlp computational_linguistics rest
Created: 2022-04-08
Modified: 2024-03-14