@prefix codemeta: . @prefix dcterms: . @prefix iodata: . @prefix ns1: . @prefix ns2: . @prefix owl: . @prefix rdf: . @prefix rdfs: . @prefix sdo: . @prefix skos: . @prefix stype: . @prefix xsd: . a rdfs:Class . a skos:ConceptScheme ; dcterms:creator "Jason Antman" ; dcterms:description "A standard to easily communicate to humans and machines the development/support and usability status of software repositories/projects." ; dcterms:title "repostatus.org" . sdo:WebApplication rdfs:label "Web Application" ; rdfs:comment "A software application served as a service over the web with an interface for human end-users" . sdo:name "GNU General Public License v3" . sdo:name "GNU General Public License v3.0 or later" . a skos:Concept, ; skos:altLabel "beta" ; skos:definition "The technology is implemented and ready for experimental settings (beta), but requires further work and validation." ; skos:inScheme "trl:Scheme" ; skos:notation "5-7" ; skos:prefLabel "Experimental" . a sdo:Audience ; sdo:audienceType "Developers" . a sdo:Audience ; sdo:audienceType "Science/Research" . a sdo:SoftwareApplication ; sdo:identifier "CLAM" ; sdo:name "CLAM" ; sdo:runtimePlatform "Python 3" ; sdo:version ">= 3.2" . a sdo:SoftwareApplication ; sdo:identifier "pyannote.audio" ; sdo:name "pyannote.audio" ; sdo:runtimePlatform "Python 3" . a sdo:SoftwareSourceCode ; sdo:applicationCategory "Internet > WWW/HTTP > WSGI > Application", "Text Processing > Linguistic" ; sdo:audience , ; sdo:author ; sdo:codeRepository ; sdo:contributor ; sdo:dateCreated "2024-04-15T12:39:11Z+0200"^^sdo:Date ; sdo:dateModified "2024-04-15T14:24:17Z+0200"^^sdo:Date ; sdo:description "Speaker diarisation service, powered by PyAnnote" ; sdo:identifier "diarisationservice" ; sdo:keywords "clam webservice rest nlp computational_linguistics rest" ; sdo:license , ; sdo:maintainer ; sdo:name "diarisationservice" ; sdo:operatingSystem "BSD", "Linux", "macOS" ; sdo:producer ; sdo:programmingLanguage "Python" ; sdo:review ; sdo:runtimePlatform "Python 3", "Python 3.10", "Python 3.11", "Python 3.6", "Python 3.7", "Python 3.8", "Python 3.9" ; sdo:softwareRequirements , ; sdo:targetProduct , ; sdo:url ; sdo:version "0.1" ; owl:sameAs , , ; codemeta:buildInstructions ; codemeta:developmentStatus , ; codemeta:issueTracker ; codemeta:readme ; ns1:errors 0 ; ns1:log """(log file starts at Wed Jan 15 04:01:22 UTC 2025) [harvester info] --> Processing diarisationservice (https://github.com/opensource-spraakherkenning-nl/diarisationservice) [Wed Jan 15 04:01:22 UTC 2025] [harvester info] Git updating cached clone of https://github.com/opensource-spraakherkenning-nl/diarisationservice... [harvester info] No releases found, falling back to default git branch! [harvester info] Using 'master' [harvester info] Git reference: master [harvester info] Scanning directory /tmp/codemeta-harvester.cache/diarisationservice for harvestable resources... [harvester info] found python setup for diarisationservice, converting to codemeta -- begin log -- No input files specified, but found python project (setup.py) in current dir, using that... Generating egg_info running egg_info writing diarisationservice.egg-info/PKG-INFO writing dependency_links to diarisationservice.egg-info/dependency_links.txt writing requirements to diarisationservice.egg-info/requires.txt writing top-level names to diarisationservice.egg-info/top_level.txt reading manifest file 'diarisationservice.egg-info/SOURCES.txt' reading manifest template 'MANIFEST.in' adding license file 'LICENSE' writing manifest file 'diarisationservice.egg-info/SOURCES.txt' Adding to contextgraph: /tmp/turtle Initial URI automatically generated, may be overriden later: https://webservices.cls.ru.nl/portal/diarisationservice Processing source #1 of 1 Obtaining python package metadata for: diarisationservice Loading metadata from diarisationservice via importlib.metadata WARNING: No translation for distutils or pyproject.toml key Metadata-Version WARNING: No translation for distutils or pyproject.toml key License-File WARNING: No translation for distutils or pyproject.toml key Description Found dependency CLAM >= 3.2 Found dependency pyannote.audio [CODEMETA COMPOSITION (diarisationservice)] processed 46 new triples, total is now 47 Remapping URI to (possibly) new identifier and version component: https://webservices.cls.ru.nl/portal/diarisationservice -> https://webservices.cls.ru.nl/portal/diarisationservice/0.1 [CODEMETA VALIDATION (diarisationservice)] done -- end log -- [harvester info] Looking for license.... [harvester info] Found license AGPL-3.0-only -- begin log -- Trying README.md ... Trying LICENSE ... -- end log -- [harvester info] Getting contributors from git... -- begin log -- Adding to contextgraph: /tmp/turtle Initial URI automatically generated, may be overriden later: https://webservices.cls.ru.nl/portal/diarisationservice-contributors Processing source #1 of 1 Extracting contributors from /tmp/codemeta-harvester.cache//tmp/diarisationservice.CONTRIBUTORS [CODEMETA COMPOSITION (https://webservices.cls.ru.nl/portal/diarisationservice-contributors)] processed 7 new triples, total is now 8 Remapping URI to (possibly) new identifier and version component: https://webservices.cls.ru.nl/portal/diarisationservice-contributors -> https://webservices.cls.ru.nl/portal/diarisationservice.contributors/snapshot [CODEMETA VALIDATION (https://webservices.cls.ru.nl/portal/diarisationservice.contributors/snapshot)] codeRepository not set [CODEMETA VALIDATION (https://webservices.cls.ru.nl/portal/diarisationservice.contributors/snapshot)] author not set [CODEMETA VALIDATION (https://webservices.cls.ru.nl/portal/diarisationservice.contributors/snapshot)] license not set [CODEMETA VALIDATION (https://webservices.cls.ru.nl/portal/diarisationservice.contributors/snapshot)] done -- end log -- [harvester info] Extracting last and first commit date from git log.... [harvester info] Date created: 2024-04-15T12:39:11Z+0200, date modified: 2024-04-15T14:24:17Z+0200 [harvester info] Querying Github/GitLab API (https://github.com/opensource-spraakherkenning-nl/diarisationservice) -- begin log -- Adding to contextgraph: /tmp/turtle Initial URI automatically generated, may be overriden later: https://webservices.cls.ru.nl/portal/diarisationservice Processing source #1 of 1 Querying GitAPI parser for https://github.com/opensource-spraakherkenning-nl/diarisationservice Parsing Github API response [CODEMETA COMPOSITION (https://webservices.cls.ru.nl/portal/diarisationservice)] processed 12 new triples, total is now 13 Remapping URI to (possibly) new identifier and version component: https://webservices.cls.ru.nl/portal/diarisationservice -> https://webservices.cls.ru.nl/portal/diarisationservice/snapshot [CODEMETA VALIDATION (https://webservices.cls.ru.nl/portal/diarisationservice/snapshot)] author not set [CODEMETA VALIDATION (https://webservices.cls.ru.nl/portal/diarisationservice/snapshot)] done Querying https://api.github.com/repos/opensource-spraakherkenning-nl/diarisationservice Remaining github API requests: 4995 ### Next rate limit reset at: 2025-01-15 05:00:34 (has_token=True) Querying https://api.github.com/users/opensource-spraakherkenning-nl Remaining github API requests: 4994 ### Next rate limit reset at: 2025-01-15 05:00:34 (has_token=True) -- end log -- [harvester info] Found buildInstructions in INSTALL [harvester info] Looking for TRL information in README.md... -- begin log -- -- end log -- [harvester info] Looking for repostatus information in README.md... -- begin log -- -- end log -- [harvester info] Looking for continuous integration information in README.md... -- begin log -- -- end log -- [harvester info] Looking for documentation links in README.md... -- begin log -- -- end log -- [harvester info] Inferring repostatus information from git activity (used only as a fallback if not explicitly provided)... [harvester info] Inferred repostatus https://www.repostatus.org/#inactive [harvester info] Found README.md [harvester info] Reconciliating: codemetapy --baseuri https://webservices.cls.ru.nl/portal --baseuri https://webservices.cls.ru.nl/portal --includecontext --addcontext https://w3id.org/nwo-research-fields --addcontext https://w3id.org/research-technology-readiness-levels --addcontextgraph https://vocabs.dariah.eu/rest/v1/tadirah/data?format=text/turtle --trl --identifier "diarisationservice" --codeRepository "https://github.com/opensource-spraakherkenning-nl/diarisationservice" --validate /etc/software.ttl --enrich --textv "Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems" -O /tmp/out/diarisationservice.codemeta.json /tmp/codemeta-harvester.cache//tmp/99-repostatus.diarisationservice.codemeta.json /tmp/codemeta-harvester.cache//tmp/42-buildinstructions.diarisationservice.codemeta.json /tmp/codemeta-harvester.cache//tmp/41-readme.diarisationservice.codemeta.json /tmp/codemeta-harvester.cache//tmp/40-gitapi.diarisationservice.codemeta.json /tmp/codemeta-harvester.cache//tmp/39-gitdate.diarisationservice.codemeta.json /tmp/codemeta-harvester.cache//tmp/32-contributors.diarisationservice.codemeta.json /tmp/codemeta-harvester.cache//tmp/29-license.diarisationservice.codemeta.json /tmp/codemeta-harvester.cache//tmp/20-python.diarisationservice.codemeta.json -- begin log -- Passed 8 files/sources but specified 0 input types! Automatically guessing types... Detected input types: [('/tmp/codemeta-harvester.cache//tmp/99-repostatus.diarisationservice.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/42-buildinstructions.diarisationservice.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/41-readme.diarisationservice.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/40-gitapi.diarisationservice.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/39-gitdate.diarisationservice.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/32-contributors.diarisationservice.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/29-license.diarisationservice.codemeta.json', 'json'), ('/tmp/codemeta-harvester.cache//tmp/20-python.diarisationservice.codemeta.json', 'json')] Adding to contextgraph: /tmp/turtle Initial URI automatically generated, may be overriden later: https://webservices.cls.ru.nl/portal/diarisationservice Processing source #1 of 8 Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/99-repostatus.diarisationservice.codemeta.json NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically... Injected (possibly temporary) URI https://webservices.cls.ru.nl/portal/diarisationservice [CODEMETA COMPOSITION (https://webservices.cls.ru.nl/portal/diarisationservice)] processed 1 new triples, total is now 2 Processing source #2 of 8 Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/42-buildinstructions.diarisationservice.codemeta.json NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically... Injected (possibly temporary) URI https://webservices.cls.ru.nl/portal/diarisationservice [CODEMETA COMPOSITION (https://webservices.cls.ru.nl/portal/diarisationservice)] processed 1 new triples, total is now 3 Processing source #3 of 8 Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/41-readme.diarisationservice.codemeta.json NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically... Injected (possibly temporary) URI https://webservices.cls.ru.nl/portal/diarisationservice [CODEMETA COMPOSITION (https://webservices.cls.ru.nl/portal/diarisationservice)] processed 1 new triples, total is now 4 Processing source #4 of 8 Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/40-gitapi.diarisationservice.codemeta.json Found main resource with URI https://webservices.cls.ru.nl/portal/diarisationservice/snapshot Injected (possibly temporary) URI https://webservices.cls.ru.nl/portal/diarisationservice [CODEMETA COMPOSITION (https://webservices.cls.ru.nl/portal/diarisationservice)] processed 14 new triples, total is now 17 Processing source #5 of 8 Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/39-gitdate.diarisationservice.codemeta.json NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically... Injected (possibly temporary) URI https://webservices.cls.ru.nl/portal/diarisationservice [CODEMETA COMPOSITION (https://webservices.cls.ru.nl/portal/diarisationservice)] overriding old http://schema.org/dateCreated (2024-04-15T12:12:21Z -> 2024-04-15T12:39:11Z+0200) [CODEMETA COMPOSITION (https://webservices.cls.ru.nl/portal/diarisationservice)] overriding old http://schema.org/dateModified (2024-04-15T12:22:53Z -> 2024-04-15T14:24:17Z+0200) [CODEMETA COMPOSITION (https://webservices.cls.ru.nl/portal/diarisationservice)] processed 2 new triples, total is now 17 Processing source #6 of 8 Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/32-contributors.diarisationservice.codemeta.json Found main resource with URI https://webservices.cls.ru.nl/portal/diarisationservice.contributors/snapshot Injected (possibly temporary) URI https://webservices.cls.ru.nl/portal/diarisationservice [CODEMETA COMPOSITION (https://webservices.cls.ru.nl/portal/diarisationservice)] processed 8 new triples, total is now 24 Processing source #7 of 8 Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/29-license.diarisationservice.codemeta.json NOTE: Not a valid JSON-LD document, @context missing! Attempting to inject automatically... Injected (possibly temporary) URI https://webservices.cls.ru.nl/portal/diarisationservice [CODEMETA COMPOSITION (https://webservices.cls.ru.nl/portal/diarisationservice)] overriding old http://schema.org/license (http://spdx.org/licenses/AGPL-3.0-only -> AGPL-3.0-only) [CODEMETA CORRECTION (https://webservices.cls.ru.nl/portal/diarisationservice)] automatically converting license to spdx URI [CODEMETA COMPOSITION (https://webservices.cls.ru.nl/portal/diarisationservice)] processed 1 new triples, total is now 24 Processing source #8 of 8 Parsing json-ld file from /tmp/codemeta-harvester.cache//tmp/20-python.diarisationservice.codemeta.json Found main resource with URI https://webservices.cls.ru.nl/portal/diarisationservice/0.1 Injected (possibly temporary) URI https://webservices.cls.ru.nl/portal/diarisationservice [CODEMETA COMPOSITION (diarisationservice)] overriding old http://schema.org/description (Speaker diarisiation service powered by PyAnnote (diarisiation) and CLAM (webservice) -> Speaker diarisation service, powered by PyAnnote) [CODEMETA COMPOSITION (diarisationservice)] overriding old https://codemeta.github.io/terms/developmentStatus (https://www.repostatus.org/#inactive -> https://www.repostatus.org/#wip) [CODEMETA COMPOSITION (diarisationservice)] overriding old http://schema.org/license (http://spdx.org/licenses/AGPL-3.0-only -> http://spdx.org/licenses/GPL-3.0-only) [CODEMETA COMPOSITION (diarisationservice)] processed 64 new triples, total is now 80 Remapping URI to (possibly) new identifier and version component: https://webservices.cls.ru.nl/portal/diarisationservice -> https://webservices.cls.ru.nl/portal/diarisationservice/0.1 [CODEMETA VALIDATION (diarisationservice)] done [CODEMETA ENRICHMENT (diarisationservice)] Guessing interface type https://w3id.org/software-types#WebApplication based on clues [CODEMETA ENRICHMENT (diarisationservice)] automatically adding programmingLanguage Python derived from runtimePlatform Python [CODEMETA ENRICHMENT (diarisationservice)] automatically adding programmingLanguage Python derived from runtimePlatform Python [CODEMETA ENRICHMENT (diarisationservice)] automatically adding programmingLanguage Python derived from runtimePlatform Python [CODEMETA ENRICHMENT (diarisationservice)] automatically adding programmingLanguage Python derived from runtimePlatform Python [CODEMETA ENRICHMENT (diarisationservice)] automatically adding programmingLanguage Python derived from runtimePlatform Python [CODEMETA ENRICHMENT (diarisationservice)] automatically adding programmingLanguage Python derived from runtimePlatform Python [CODEMETA ENRICHMENT (diarisationservice)] automatically adding programmingLanguage Python derived from runtimePlatform Python [CODEMETA ENRICHMENT (diarisationservice)] considering first author as maintainer VALIDATION https://webservices.cls.ru.nl/portal/diarisationservice/0.1 #1: Warning: Software source code *SHOULD* link to a continuous integration service that builds the software and runs the software's tests (This is missing in the metadata) VALIDATION https://webservices.cls.ru.nl/portal/diarisationservice/0.1 #2: Info: An interface type *SHOULD* be expressed: Software source code should define one or more target products that are the resulting software applications offering specific interfaces (The metadata does express this currently, but something is wrong in the way it is expressed. Is the type/class valid?) VALIDATION https://webservices.cls.ru.nl/portal/diarisationservice/0.1 #3: Warning: Documentation *SHOULD* be expressed (This is missing in the metadata) VALIDATION https://webservices.cls.ru.nl/portal/diarisationservice/0.1 #4: Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata) VALIDATION https://webservices.cls.ru.nl/portal/diarisationservice/0.1 #5: Info: The funder *SHOULD* be acknowledged (This is missing in the metadata) VALIDATION https://webservices.cls.ru.nl/portal/diarisationservice/0.1 #6: Info: A research domain *SHOULD* be expressed as a category using the NWO Research Fields vocabulary, if applicable (This is missing in the metadata) VALIDATION https://webservices.cls.ru.nl/portal/diarisationservice/0.1 #7: Info: A research activity *SHOULD* be expressed as a category using the TaDiRaH vocabulary (This is missing in the metadata) -- end log -- [harvester info] Output written to /tmp/out/diarisationservice.codemeta.json [harvester info] Harvesting remote service URL https://webservices2.cls.ru.nl/diarisationservice/ for diarisationservice: codemetapy --baseuri https://webservices.cls.ru.nl/portal --baseuri https://webservices.cls.ru.nl/portal --includecontext --addcontext https://w3id.org/nwo-research-fields --addcontext https://w3id.org/research-technology-readiness-levels --addcontextgraph https://vocabs.dariah.eu/rest/v1/tadirah/data?format=text/turtle --trl -O "/tmp/codemeta-harvester.cache//tmp/diarisationservice.codemeta.json" "/tmp/out/diarisationservice.codemeta.json" "https://webservices2.cls.ru.nl/diarisationservice/" -- begin log -- Passed 2 files/sources but specified 0 input types! Automatically guessing types... Detected input types: [('/tmp/out/diarisationservice.codemeta.json', 'json'), ('https://webservices2.cls.ru.nl/diarisationservice/', 'web')] Adding to contextgraph: /tmp/turtle Initial URI automatically generated, may be overriden later: https://webservices.cls.ru.nl/portal/diarisationservice Processing source #1 of 2 Parsing json-ld file from /tmp/out/diarisationservice.codemeta.json Found main resource with URI https://webservices.cls.ru.nl/portal/diarisationservice/0.1 Injected (possibly temporary) URI https://webservices.cls.ru.nl/portal/diarisationservice [CODEMETA COMPOSITION (diarisationservice)] processed 94 new triples, total is now 94 Processing source #2 of 2 Fallback: Obtaining metadata from remote URL https://webservices2.cls.ru.nl/diarisationservice/ Service replied with content-type application/ld+json Parsing json... Found main resource with URI https://webservices2.cls.ru.nl/diarisationservice Injected (possibly temporary) URI https://webservices.cls.ru.nl/portal/webapplication/Na3c35f69b3709808f67e19c9082bc10b Adding service (targetProduct) https://webservices2.cls.ru.nl/diarisationservice/ [CODEMETA COMPOSITION (diarisationservice)] processed 29 new triples, total is now 124 Remapping URI to (possibly) new identifier and version component: https://webservices.cls.ru.nl/portal/diarisationservice -> https://webservices.cls.ru.nl/portal/diarisationservice/0.1 [CODEMETA VALIDATION (diarisationservice)] done -- end log -- [harvester info] <-- Finished processing diarisationservice (https://github.com/opensource-spraakherkenning-nl/diarisationservice) [Wed Jan 15 04:01:41 UTC 2025] """ . a sdo:Organization ; sdo:name "KNAW Humanities Cluster & CLST, Radboud University" . a sdo:Organization ; sdo:email "bestuur@openspraaktechnologie.org" ; sdo:name "Open Spraaktechnologie" ; sdo:url . rdf:first "codemetapy validator using software.ttl" ; rdf:rest () . a sdo:DigitalDocument ; sdo:description "Speaker diarisation" ; sdo:encodingFormat "text/plain" ; sdo:identifier "RTTMFormat" ; sdo:name "*.rttm" . a sdo:DigitalDocument ; sdo:description "Log file with (standard) error output" ; sdo:encodingFormat "text/plain" ; sdo:identifier "PlainTextFormat" ; sdo:name "error.log" . a sdo:Person ; sdo:email "proycon@anaproy.nl" ; sdo:name "Maarten van Gompel" . a sdo:AudioObject ; sdo:description "Wav audio file" ; sdo:encodingFormat "audio/vnd.wave" ; sdo:identifier "WaveAudioFormat" ; sdo:name "*.wav" . rdf:first ; rdf:rest () . a stype:WebApplication ; sdo:name "diarisationservice" . a sdo:Review ; sdo:author ; sdo:datePublished "2025-01-15 04:01:36"^^sdo:Date ; sdo:name "Automatic software metadata validation report for diarisationservice 0.1" ; sdo:reviewBody """Please consult the CLARIAH Software Metadata Requirements at https://github.com/CLARIAH/clariah-plus/blob/main/requirements/software-metadata-requirements.md for an in-depth explanation of any found problems Validation of diarisationservice 0.1 was successful (score=3/5), but there are some warnings which should be addressed: 1. Warning: Software source code *SHOULD* link to a continuous integration service that builds the software and runs the software's tests (This is missing in the metadata) 2. Info: An interface type *SHOULD* be expressed: Software source code should define one or more target products that are the resulting software applications offering specific interfaces (The metadata does express this currently, but something is wrong in the way it is expressed. Is the type/class valid?) 3. Warning: Documentation *SHOULD* be expressed (This is missing in the metadata) 4. Info: Reference publications *SHOULD* be expressed, if any (This is missing in the metadata) 5. Info: The funder *SHOULD* be acknowledged (This is missing in the metadata) 6. Info: A research domain *SHOULD* be expressed as a category using the NWO Research Fields vocabulary, if applicable (This is missing in the metadata) 7. Info: A research activity *SHOULD* be expressed as a category using the TaDiRaH vocabulary (This is missing in the metadata)""" ; sdo:reviewRating 3 . a sdo:WebApplication ; sdo:author ; sdo:description "A speaker diarisation service powered by PyAnnote" ; sdo:name "Speaker Diarisation Service" ; sdo:url ; sdo:version "0.1" ; iodata:consumesData ; iodata:producesData , . a skos:Concept ; ns2:image "https://www.repostatus.org/badges/latest/wip.svg" ; skos:altLabel "Work in Progress" ; skos:definition "Initial development is in progress, but there has not yet been a stable, usable release suitable for the public." ; skos:inScheme "https://www.repostatus.org" ; skos:prefLabel "WIP" . skos:Concept a owl:Class . a sdo:Person ; sdo:affiliation ; sdo:email "proycon@anaproy.nl" ; sdo:familyName "van Gompel", "van gompel" ; sdo:givenName "Maarten" ; sdo:url . rdf:first ; rdf:rest () .