Savitar: an intelligent sign language translation approach for deafness and dysphonia in the COVID-19 era
Purpose In the COVID-19 era, sign language (SL) translation has gained attention in online learning, which evaluates the physical gestures of each student and bridges the communication gap between dysphonia and hearing people. The purpose of this paper is to devote the alignment between SL sequence and nature language sequence with high translation performance. Design/methodology/approach SL can be characterized as joint/bone location information in two-dimensional space over time, forming skeleton sequences. To encode joint, bone and their motion information, we propose a multistream hierarchy network (MHN) along with a vocab prediction network (VPN) and a joint network (JN) with the recurrent neural network transducer. The JN is used to concatenate the sequences encoded by the MHN and VPN and learn their sequence alignments. Findings We verify the effectiveness of the proposed approach and provide experimental results on three large-scale datasets, which show that translation accuracy is 94.96, 54.52, and 92.88 per cent, and the inference time is 18 and 1.7 times faster than listen-attend-spell network (LAS) and visual hierarchy to lexical sequence network (H2SNet) , respectively. Originality/value In this paper, we propose a novel framework that can fuse multimodal input (i.e. joint, bone and their motion stream) and align input streams with nature language. Moreover, the provided framework is improved by the different properties of MHN, VPN and JN. Experimental results on the three datasets demonstrate that our approaches outperform the state-of-the-art methods in terms of translation accuracy and speed.
Year of publication: |
2023
|
---|---|
Authors: | Liang, Wuyan ; Xu, Xiaolong |
Published in: |
Data Technologies and Applications. - Emerald Publishing Limited, ISSN 2514-9318, ZDB-ID 2935212-5. - Vol. 58.2023, 2, p. 153-175
|
Publisher: |
Emerald Publishing Limited |
Subject: | Multistream hierarchy network | Online learning | Sign language translation | RNN-T sequence-to-sequence |
Saved in:
Saved in favorites
Similar items by subject
-
Kazemilari, Mansooreh, (2024)
-
Romania: Ready for online learning?
Dinculescu, Corina Georgeta, (2021)
-
The Uneven Effect of COVID School Closures: Parents in Teleworkable vs. Non-teleworkable Occupations
Aparicio Fenoll, Ainoa, (2022)
- More ...
Similar items by person