This portal provides access to various Text & Speech processing software services developed by the Centre of Language and Speech Technology or the Humanities Lab of Radboud University Nijmegen.
These tools are accessible for free but do require authentication. They participate in the CLARIAH infrastructure which means you can authenticate using your own institutional login, provided your institution has ties with CLARIN. If your institute is not in the list you can request a CLARIN account. For general questions or comments contact Henk van den Heuvel. For technical issues, contact admin@cls.ru.nl, here you can also request an API key for automated access if you explain your use-case.
Name | Version | Interface type | Description | Links | Status | Maintainer | Authors | Producer/Provider |
---|---|---|---|---|---|---|---|---|
Alpino-Webservice | 2.4 2023-11-01 11:56:10 +0100 |
|
Alpino is a dependency parser for Dutch, developed in the context of the PIONIER Project Algorithms for Linguistic Processing, developed by Gertjan van Noord at the University of Groningen. This is the webservice for it. You can upload either tokenised or untokenised files (which will be automatically tokenised for you using ucto), the output will consist of a zip file containing XML files, one for each sentence in the input document. [view more]
Category:
Keywords:
|
|
|
|||
Alpino Webservice | 2.4 |
|
Alpino is a dependency parser for Dutch, developed in the context of the PIONIER Project Algorithms for Linguistic Processing, developed by Gertjan van Noord at the University of Groningen. You can upload either tokenised or untokenised files (which will be automatically tokenised for you using ucto), the output will consist of a zip file containing XML files, one for each sentence in the input document. |
|
|
|||
asrservice | 0.3 2024-04-12 10:39:45 +0200 |
|
An Automatic Speech Recognition Service for a variety of languages, powered by WhisperX [view more]
Category:
Keywords:
|
|
||||
Automatic Speech Recognition Service | 0.3 |
|
|
|
||||
asrservice |
|
|
|
|
|
|||
Automatic Speech Recognition for Dutch | 0.6.2 |
|
This is a web-based automatic speech recogniser for Dutch, capable of transcribing dutch speech recordings using multiple models. [view more]
Category:
Keywords:
|
|
||||
Automatic Transcription of Dutch Speech Recordings | 0.6.1 |
|
This webservice uses automatic speech recognition to provide the transcriptions of recordings spoken in Dutch. You can upload and process only one file per project. For bulk processing and other questions, please contact Henk van den Heuvel at h.vandenheuvel@let.ru.nl. |
|
|
|||
Automatic Speech Recognition for Dutch |
|
|
|
|
|
|||
Corpus Editor for Syntactically Annotated Resources (Cesar) | unknown |
|
Django web application that communicates with the CorpusStudioWeb back-end 'Crpp'. Two main purposes: (1) browse texts, (2) conduct syntactic searches with definable output per hit. Searches are translated to Xquery 'under the hood' [view more]
Keywords:
|
|
|
|
||
RU-Cesar |
|
|
|
|
||||
diarisationservice | 0.1 2024-04-15 14:24:17 +0200 |
|
Speaker diarisation service, powered by PyAnnote [view more]
Category:
Keywords:
|
|
||||
Speaker Diarisation Service | 0.1 |
|
A speaker diarisation service powered by PyAnnote |
|
|
|||
diarisationservice |
|
|
|
|
|
|||
Electronisch woordenboek van de Achterhoekse en Liemerse dialecten | unknown |
|
Django web application that facilities viewing and searching a dictionary of Dutch dialects from the regions 'Achterhoek' and 'Liemers' [view more]
Keywords:
|
|
|
|
||
e-WALD |
|
|
|
|
||||
Electronisch woordenboek van de Gelderse dialecten | unknown |
|
Django web application that facilities viewing and searching a dictionary of Dutch dialects from the province 'Gelderland' [view more]
Keywords:
|
|
|
|
||
e-WGD |
|
|
|
|
||||
Electronisch woordenboek van de Gelderse dialecten | unknown |
|
Django web application that facilities viewing and searching a dictionary of dialects from the Dutch province 'Noord-Brabant' as well as the Belgian provinces of Antwerpen, Vlaams-Brabant and Brussels [view more]
Keywords:
|
|
|
|
||
e-WBD |
|
|
|
|
||||
Electronisch woordenboek van de Limburgse dialecten | unknown |
|
Django web application that facilities viewing and searching a dictionary of the Dutch Limburgian dialects [view more]
Keywords:
|
|
|
|
||
e-WLD |
|
|
|
|
||||
FoLiA-Linguistic-Annotation-Tool | 0.11.5 2024-07-05 13:27:34 +0200 |
|
FLAT is a web-based linguistic annotation environment based around the FoLiA format (https://proycon.github.io/folia), a rich XML-based format for linguistic annotation. Flat allows users to view annotated FoLiA documents and enrich these documents with new annotations, a wide variety of linguistic annotation types is supported through the FoLiA paradigm. [view more]
Category:
Keywords:
|
|
|
|||
FLAT: the FoLiA Linguistic Annotation Tool |
|
|
|
|||||
FoLiA-Linguistic-Annotation-Tool |
|
|
|
|
|
|||
Forced Alignment 2 | 0.3.1 |
|
This webservice provides an output file with word alignments given an NL speech recording and a transcription. [view more]
Keywords:
|
|
||||
ForcedAlignment2 | 0.3.1 |
|
Forced Alignment of text and audio files |
|
|
|||
Forced Alignment 2 |
|
|
|
|
|
|||
Frog-Webservice | 2.7 2023-12-05 16:06:08 +0100 |
|
Frog is a suite containing a tokeniser, Part-of-Speech tagger, lemmatiser, morphological analyser, shallow parser, and dependency parser for Dutch. This is the webservice for it, for both humans and machines. [view more]
Category:
Keywords:
|
|
||||
Frog Webservice | 2.7 |
|
Frog is a suite containing a tokeniser, Part-of-Speech tagger, lemmatiser, morphological analyser, shallow parser, and dependency parser for Dutch. |
|
|
|||
g2pservice | 0.3.4 2023-05-12 13:09:12 +0200 |
|
Grapheme to Phoneme converter. Input is a list of words (utf8). Choose one of the language options. [view more]
Category:
Keywords:
|
|
||||
Grapheme to Phoneme converter | 0.3.4 |
|
Grapheme to Phoneme (G2P) conversion. Input is a list of words (utf-8, one word per line). The G2P will output the best guess for the phonetic transcription per word. The system is trained on existing dictionaries. Please choose a language option. The system is a demo-version --- please refer to CLST for using G2P for long word lists. |
|
|
|||
g2pservice |
|
|
|
|
|
|||
Glem | 1.3.1 2023-10-05 14:28:06 +0200 |
|
GLEM is a lemmatizer for Ancient Greek. [view more]
Category:
Keywords:
|
|
|
|||
Glem | 1.3.1 |
|
|
|
||||
glem |
|
Command-line interface to GLEM |
|
|
|
|
||
piereling | 0.4 2023-11-01 11:43:34 +0100 |
|
Piereling is a webservice and web-application to convert between a variety of document formats, mostly from and to FoLiA XML. It is intended for NLP pipelines. [view more]
Category:
Keywords:
|
|
|
|||
Piereling | 0.4 |
|
Piereling can convert a wide variety of document formats to FoLiA XML, and from FoLiA XML to various formats. Data conversions such as these provide the groundwork for Natural Language Processing pipelines. It relies on numerous specialised conversion tools in combination with notable third-party tools such as pandoc. |
|
|
|||
piereling |
|
|
|
|
|
|||
Ucto-Webservice | 2.5.2 2024-03-14 21:54:52 +0100 |
|
Ucto is a rule-based tokeniser for multiple languages. This is the webservice for it, for both humans and machines. [view more]
Category:
Keywords:
|
|
|
|||
Ucto Webservice | 2.5.2 |
|
Ucto is a unicode-compliant tokeniser. It takes input in the form of one or more untokenised texts, and subsequently tokenises them. Several languages are supported, but the software is extensible to other languages. |
|
|
|||
You are what you tweet | unknown |
|
Op basis van je Twittergedrag valt behoorlijk veel van je af te leiden. In deze demo willen we je laten zien wat onze technologie over jou denkt te weten [view more]
Keywords:
|
|
||||
You are what you tweet |
|
|
|
|
|