eng_asr

Provided tools & services

English Automatic Speech Recognition System

This webservice uses automatic speech recognition to provide the transcriptions of recordings spoken in English recordings. You can upload and process only one file per project. For bulk processing and other questions, please contact Henk van den Heuvel at h.vandenheuvel@let.ru.nl.
Type
  • Web Application
Version
0.2.1
Note: Version does not match latest source release (None), service may be out of date
Service Provider
      Centre for Language and Speech Technology, Radboud University
Input data
Name
*.ogg
Description
Ogg file
Type
AudioObject
Encoding Format
audio/vorbis
Name
*.wav
Description
Wav file
Type
AudioObject
Encoding Format
audio/vnd.wave
Name
*.mp3
Description
MP3 file
Type
AudioObject
Encoding Format
audio/mpeg
Output data
Name
*.ctm
Description
Automatic transcription of the input recording with timestamps (CTM)
Type
DigitalDocument
Encoding Format
text/plain
Name
*.txt
Description
Automatic transcription of the input recording
Type
DigitalDocument
Encoding Format
text/plain
Name
*.xml
Description
Automatic transcription of the input recording (full data) (AudioDoc XML)
Type
DigitalDocument
Encoding Format
text/xml
Name
error.log
Description
Log file with (standard) error output
Type
DigitalDocument
Encoding Format
text/plain
Name
*.tg
Description
Automatic transcription of the input recording (full data) (Praat Textgrid)
Type
DigitalDocument
Encoding Format
text/plain

Citation

You can cite this software using the following citation generated from its metadata:

Logs & Reviews

Name
Automatic software metadata validation report for unnamed software (unknown version)
Author
  • codemetapy validator using software.ttl
Date
2023-05-30 04:00:36
Review
Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems

Validation of unnamed software (unknown version) failed (score 0/5) due to one or more requirement violations:

1. Violation: Software source code *MUST* have a name. (This is missing in the metadata)
2. Violation: Software source code *MUST* have one (short) description. (This is missing in the metadata)
3. Violation: The authors of the software source code *MUST* be expressed. (This is missing in the metadata)
4. Violation: The maintainer of the software source code *MUST* be expressed. (This is missing in the metadata)
5. Violation: Software source code *MUST* state its license (This is missing in the metadata)
6. Violation: Software source code *MUST* state its version (This is missing in the metadata)
7. Warning: Software source code *SHOULD* link to a continuous integration service that builds the software and runs the software's tests (This is missing in the metadata)
8. Info: Software source code *MAY* express the programming language(s) used (This is missing in the metadata)
9. Warning: The producer of the software source code *SHOULD* be expressed (This is missing in the metadata)
10. Warning: All contributors *SHOULD* be expressed (This is missing in the metadata)
11. Info: An interface type *SHOULD* be expressed: Software source code should define one or more target products that are the resulting software applications offering specific interfaces (This is missing in the metadata)
12. Warning: Documentation *SHOULD* be expressed (This is missing in the metadata)
13. Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata)
14. Info: The funder *SHOULD* be acknowledged (This is missing in the metadata)
15. Info: The technology readiness level *SHOULD* be expressed (This is missing in the metadata)
Rating
☆ ☆ ☆ ☆ ☆
There were 1 error(s) harvesting this metadata, please inspect the log.
(log file starts at Tue May 30 04:00:34 UTC 2023)

[harvester info] --> Processing eng_asr (https://github.com/opensource-spraakherkenning-nl/eng_ASR) [Tue May 30 04:00:34 UTC 2023]

[harvester info] Git updating cached clone of https://github.com/opensource-spraakherkenning-nl/eng_ASR...

[harvester info] Found release v0.2.2

[harvester info] Using 'v0.2.2'

[harvester info] Git reference: v0.2.2

[harvester info] Scanning directory /tmp/codemeta-harvester.cache/eng_asr for harvestable resources...

[harvester info] found codemeta.json for eng_asr (md5sum fa7f00bd7ad6d1e42a35fd282a193d36); **NOTE: this is considered authoritative and most other detection methods will be skipped now!**

-- begin log --

HEAD is now at e35e2db metadata update

-- end log --

[harvester error] codemeta.json for eng_asr is not well-formed

[harvester info] Inferring repostatus information from git activity (used only as a fallback if not explicitly provided)...

[harvester info] Inferred repostatus https://www.repostatus.org/#active

[harvester info] Looking for repostatus information in README.md in master branch...

-- begin log --

-- end log --

[harvester info] Found README.md

[harvester info] Reconciliating: codemetapy  --baseuri https://webservices.cls.ru.nl/portal --baseuri https://webservices.cls.ru.nl/portal --includecontext --addcontext https://w3id.org/nwo-research-fields --addcontext https://w3id.org/research-technology-readiness-levels --addcontextgraph https://vocabs.dariah.eu/rest/v1/tadirah/data?format=text/turtle --trl --identifier "eng_asr" --codeRepository "https://github.com/opensource-spraakherkenning-nl/eng_ASR" --validate /etc/software.ttl --released --enrich --textv "Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems" -O /tmp/out/eng_asr.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-repostatus.eng_asr.codemeta.json /tmp/codemeta-harvester.cache//tmp/41-readme.eng_asr.codemeta.json 

-- begin log --

Passed 2 files/sources but specified 0 input types! Automatically guessing types...

Detected input types: [('/tmp/codemeta-harvester.cache//tmp/99-repostatus.eng_asr.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/41-readme.eng_asr.codemeta.json', 'json')]

Adding to contextgraph: /tmp/turtle

Initial URI automatically generated, may be overriden later: https://webservices.cls.ru.nl/portal/eng-asr

Processing source #1 of 2

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-repostatus.eng_asr.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://webservices.cls.ru.nl/portal/eng-asr

[CODEMETA COMPOSITION (https://webservices.cls.ru.nl/portal/eng-asr)] processed 1 new triples, total is now 2

Processing source #2 of 2

Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/41-readme.eng_asr.codemeta.json

    NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically...

    Injected (possibly temporary) URI https://webservices.cls.ru.nl/portal/eng-asr

[CODEMETA COMPOSITION (https://webservices.cls.ru.nl/portal/eng-asr)] processed 1 new triples, total is now 3

Remapping URI to (possibly) new identifier and version component: https://webservices.cls.ru.nl/portal/eng-asr -> https://webservices.cls.ru.nl/portal/eng_asr/snapshot

[CODEMETA VALIDATION (eng_asr)] author not set

[CODEMETA VALIDATION (eng_asr)] license not set

[CODEMETA VALIDATION (eng_asr)] done

VALIDATION https://webservices.cls.ru.nl/portal/eng_asr/snapshot #1: Violation: Software source code *MUST* have a name. (This is missing in the metadata)

VALIDATION https://webservices.cls.ru.nl/portal/eng_asr/snapshot #2: Violation: Software source code *MUST* have one (short) description. (This is missing in the metadata)

VALIDATION https://webservices.cls.ru.nl/portal/eng_asr/snapshot #3: Violation: The authors of the software source code *MUST* be expressed. (This is missing in the metadata)

VALIDATION https://webservices.cls.ru.nl/portal/eng_asr/snapshot #4: Violation: The maintainer of the software source code *MUST* be expressed. (This is missing in the metadata)

VALIDATION https://webservices.cls.ru.nl/portal/eng_asr/snapshot #5: Violation: Software source code *MUST* state its license (This is missing in the metadata)

VALIDATION https://webservices.cls.ru.nl/portal/eng_asr/snapshot #6: Violation: Software source code *MUST* state its version (This is missing in the metadata)

VALIDATION https://webservices.cls.ru.nl/portal/eng_asr/snapshot #7: Warning: Software source code *SHOULD* link to a continuous integration service that builds the software and runs the software's tests (This is missing in the metadata)

VALIDATION https://webservices.cls.ru.nl/portal/eng_asr/snapshot #8: Info: Software source code *MAY* express the programming language(s) used (This is missing in the metadata)

VALIDATION https://webservices.cls.ru.nl/portal/eng_asr/snapshot #9: Warning: The producer of the software source code *SHOULD* be expressed (This is missing in the metadata)

VALIDATION https://webservices.cls.ru.nl/portal/eng_asr/snapshot #10: Warning: All contributors *SHOULD* be expressed (This is missing in the metadata)

VALIDATION https://webservices.cls.ru.nl/portal/eng_asr/snapshot #11: Info: An interface type *SHOULD* be expressed: Software source code should define one or more target products that are the resulting software applications offering specific interfaces (This is missing in the metadata)

VALIDATION https://webservices.cls.ru.nl/portal/eng_asr/snapshot #12: Warning: Documentation *SHOULD* be expressed (This is missing in the metadata)

VALIDATION https://webservices.cls.ru.nl/portal/eng_asr/snapshot #13: Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata)

VALIDATION https://webservices.cls.ru.nl/portal/eng_asr/snapshot #14: Info: The funder *SHOULD* be acknowledged (This is missing in the metadata)

VALIDATION https://webservices.cls.ru.nl/portal/eng_asr/snapshot #15: Info: The technology readiness level *SHOULD* be expressed (This is missing in the metadata)

-- end log --

[harvester info] Output written to /tmp/out/eng_asr.codemeta.json

[harvester info] Harvesting remote service URL https://webservices.cls.ru.nl/eng_ASR for eng_asr: codemetapy  --baseuri https://webservices.cls.ru.nl/portal --baseuri https://webservices.cls.ru.nl/portal --includecontext --addcontext https://w3id.org/nwo-research-fields --addcontext https://w3id.org/research-technology-readiness-levels --addcontextgraph https://vocabs.dariah.eu/rest/v1/tadirah/data?format=text/turtle --trl -O "/tmp/codemeta-harvester.cache//tmp/eng_asr.codemeta.json" "/tmp/out/eng_asr.codemeta.json" "https://webservices.cls.ru.nl/eng_ASR"

-- begin log --

Passed 2 files/sources but specified 0 input types! Automatically guessing types...

Detected input types: [('/tmp/out/eng_asr.codemeta.json', 'json'), ('https://webservices.cls.ru.nl/eng_ASR', 'web')]

Adding to contextgraph: /tmp/turtle

Initial URI automatically generated, may be overriden later: https://webservices.cls.ru.nl/portal/eng-asr

Processing source #1 of 2

Parsing json-ld file from /tmp/out/eng_asr.codemeta.json

    Found main resource with URI https://webservices.cls.ru.nl/portal/eng_asr/snapshot

    Injected (possibly temporary) URI https://webservices.cls.ru.nl/portal/eng-asr

[CODEMETA COMPOSITION (eng_asr)] processed 20 new triples, total is now 20

Processing source #2 of 2

Fallback: Obtaining metadata from remote URL https://webservices.cls.ru.nl/eng_ASR

    Service replied with content-type application/ld+json

    Parsing json...

    Found main resource with URI https://webservices.cls.ru.nl/eng_ASR

    Injected (possibly temporary) URI https://webservices.cls.ru.nl/portal/webapplication/N5e5bfffcebf4fb6e570fb296bcacd765

Adding service (targetProduct) https://webservices.cls.ru.nl/eng_ASR

[CODEMETA COMPOSITION (eng_asr)] processed 62 new triples, total is now 83

Remapping URI to (possibly) new identifier and version component: https://webservices.cls.ru.nl/portal/eng-asr -> https://webservices.cls.ru.nl/portal/eng_asr/snapshot

[CODEMETA VALIDATION (eng_asr)] author not set

[CODEMETA VALIDATION (eng_asr)] license not set

[CODEMETA VALIDATION (eng_asr)] done

-- end log --

[harvester info] <-- Finished processing eng_asr (https://github.com/opensource-spraakherkenning-nl/eng_ASR) [Tue May 30 04:00:40 UTC 2023]

        

Metadata Properties

Interface types
  • Web Application
Source code repository
 https://github.com/opensource-spraakherkenning-nl/eng_ASR  Stars are an indicator of the popularity of this project on GitHub
Development Status
  • Active: The project has reached a stable, usable state and is being actively developed.
Documentation
Metadata validation
☆ ☆ ☆ ☆ ☆