bibtype C - Conference Paper (international conference)
ARLID 0561726
utime 20230316105605.3
mtime 20221002235959.9
title (primary) (eng) Automatic Verb Classifier for Abui (AVC-abz)
specification
page_count 9 s.
media_type P
serial
ARLID cav_un_epca*0561725
ISBN 978-2-493814-07-4
title Proceedings of the Workshop on Resources and Technologies for Indigenous, Endangered and Lesser-resourced Languages in Eurasia within the 13th Language Resources and Evaluation Conference
page_num 42-50
publisher
place Paris
name European Language Resources Association
year 2022
keyword automatic verb classifier
keyword endangered languages
keyword head-marking languages
keyword Papuan
author (primary)
ARLID cav_un_auth*0414315
name1 Kratochvíl
name2 F.
country CZ
author
ARLID cav_un_auth*0431676
name1 Saad
name2 G.
country CZ
author
ARLID cav_un_auth*0101228
name1 Vomlel
name2 Jiří
institution UTIA-B
full_dept (cz) Matematická teorie rozhodování
full_dept Department of Decision Making Theory
department (cz) MTR
department MTR
full_dept Department of Decision Making Theory
fullinstit Ústav teorie informace a automatizace AV ČR, v. v. i.
author
ARLID cav_un_auth*0216188
name1 Kratochvíl
name2 Václav
institution UTIA-B
full_dept (cz) Matematická teorie rozhodování
full_dept Department of Decision Making Theory
department (cz) MTR
department MTR
full_dept Department of Decision Making Theory
country CZ
fullinstit Ústav teorie informace a automatizace AV ČR, v. v. i.
source
url http://library.utia.cas.cz/separaty/2022/MTR/vomlel-0561726.pdf
cas_special
project
project_id GA20-18407S
agency GA ČR
ARLID cav_un_auth*0397557
abstract (eng) We present an automatic verb classifier system that identifies inflectional classes in Abui (AVC-abz), a Papuan language of the Timor-Alor-Pantar family. The system combines manually annotated language data (the learning set) with the output of a morphological precision grammar (corpus data). The morphological precision grammar is trained on a fully glossed smaller corpus and applied to a larger corpus. Using the k-means algorithm, the system clusters inflectional classes discovered in the learning set. In the second step, Naive Bayes algorithm assigns the verbs found in the corpus data to the best-fitting cluster. AVC-abz serves to advance and refine the grammatical analysis of Abui as well as to monitor corpus coverage and its gradual improvement.
action
ARLID cav_un_auth*0437176
name Workshop on Resources and Technologies for Indigenous, Endangered and Lesser-resourced Languages in Eurasia
dates 20220620
mrcbC20-s 20220625
place Marseille
country FR
RIV AI
FORD0 60000
FORD1 60200
FORD2 60203
reportyear 2023
num_of_auth 4
presentation_type PR
inst_support RVO:67985556
permalink https://hdl.handle.net/11104/0335178
cooperation
ARLID cav_un_auth*0304106
name Univerzita Palackého v Olomouci, Filozofická fakulta
institution FF UP
country CZ
confidential S
arlyear 2022
mrcbU14 SCOPUS
mrcbU24 PUBMED
mrcbU34 WOS
mrcbU63 cav_un_epca*0561725 Proceedings of the Workshop on Resources and Technologies for Indigenous, Endangered and Lesser-resourced Languages in Eurasia within the 13th Language Resources and Evaluation Conference European Language Resources Association 2022 Paris 42 50 978-2-493814-07-4