Zuo et al., 2024 - Google Patents

A simple baseline for spoken language to sign language translation with 3d avatars

Zuo et al., 2024

Document ID: 5763003059699879259
Author: Zuo R; Wei F; Chen Z; Mak B; Yang J; Tong X
Publication year: 2024
Publication venue: European Conference on Computer Vision

External Links

Cited by

Snippet

The objective of this paper is to develop a functional system for translating spoken languages into sign languages, referred to as Spoken2Sign translation. The Spoken2Sign task is orthogonal and complementary to traditional sign language to spoken language …

Continue reading at arxiv.org (PDF) (other versions)

238000013519 translation 0 title abstract description 80

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/289—Use of machine translation, e.g. multi-lingual retrieval, server side translation for client devices, real-time translation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
- G06F17/2827—Example based machine translation; Alignment
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G06T13/20—3D [Three Dimensional] animation
- G06T13/40—3D [Three Dimensional] animation of characters, e.g. humans, animals or virtual beings
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis

Similar Documents

Publication	Publication Date	Title
Rastgoo et al.	2021	Sign language production: A review
Stoll et al.	2020	Text2Sign: towards sign language production using neural machine translation and generative adversarial networks
Petrovich et al.	2022	Temos: Generating diverse human motions from textual descriptions
Lin et al.	2023	Motion-x: A large-scale 3d expressive whole-body human motion dataset
Liang et al.	2024	Intergen: Diffusion-based multi-human motion generation under complex interactions
Stoll et al.	2018	Sign language production using neural machine translation and generative adversarial networks
Hu et al.	2021	Hand-model-aware sign language recognition
Parelli et al.	2020	Exploiting 3d hand pose estimation in deep learning-based sign language recognition from rgb videos
Natarajan et al.	2022	Dynamic GAN for high-quality sign language video generation from skeletal poses using generative adversarial networks
Wang et al.	2018	Look deeper see richer: Depth-aware image paragraph captioning
Aujeszky et al.	2016	A gesture recogintion architecture for Arabic sign language communication system
Baltatzis et al.	2024	Neural sign actors: a diffusion model for 3d sign language production from text
Patel et al.	2021	Recent advances in video question answering: A review of datasets and methods
Brock et al.	2020	Learning three-dimensional skeleton data from sign language video
Xie et al.	2021	Sequential gesture learning for continuous labanotation generation based on the fusion of graph neural networks
Xu et al.	2021	Text-guided human image manipulation via image-text shared space
Eunice et al.	2023	Sign2Pose: A pose-based approach for gloss prediction using a transformer model
Stoll et al.	2022	There and back again: 3d sign language generation from text using back-translation
Hong et al.	2023	Dagan++: Depth-aware generative adversarial network for talking head video generation
Zuo et al.	2024	A simple baseline for spoken language to sign language translation with 3d avatars
Yu et al.	2024	Signavatars: A large-scale 3d sign language holistic motion dataset and benchmark
Wahane et al.	2022	Real-time sign language recognition using deep learning techniques
Nocentini et al.	2024	Scantalk: 3d talking heads from unregistered scans
Zhang et al.	2021	Adversarial synthesis of human pose from text
De Martino et al.	2023	Neural machine translation from text to sign language