Zuo et al., 2024 - Google Patents
A simple baseline for spoken language to sign language translation with 3d avatarsZuo et al., 2024
View PDF- Document ID
- 5763003059699879259
- Author
- Zuo R
- Wei F
- Chen Z
- Mak B
- Yang J
- Tong X
- Publication year
- Publication venue
- European Conference on Computer Vision
External Links
Snippet
The objective of this paper is to develop a functional system for translating spoken languages into sign languages, referred to as Spoken2Sign translation. The Spoken2Sign task is orthogonal and complementary to traditional sign language to spoken language …
- 238000013519 translation 0 title abstract description 80
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/289—Use of machine translation, e.g. multi-lingual retrieval, server side translation for client devices, real-time translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
- G06F17/2827—Example based machine translation; Alignment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G06T13/20—3D [Three Dimensional] animation
- G06T13/40—3D [Three Dimensional] animation of characters, e.g. humans, animals or virtual beings
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Rastgoo et al. | Sign language production: A review | |
Stoll et al. | Text2Sign: towards sign language production using neural machine translation and generative adversarial networks | |
Petrovich et al. | Temos: Generating diverse human motions from textual descriptions | |
Lin et al. | Motion-x: A large-scale 3d expressive whole-body human motion dataset | |
Liang et al. | Intergen: Diffusion-based multi-human motion generation under complex interactions | |
Stoll et al. | Sign language production using neural machine translation and generative adversarial networks | |
Hu et al. | Hand-model-aware sign language recognition | |
Parelli et al. | Exploiting 3d hand pose estimation in deep learning-based sign language recognition from rgb videos | |
Natarajan et al. | Dynamic GAN for high-quality sign language video generation from skeletal poses using generative adversarial networks | |
Wang et al. | Look deeper see richer: Depth-aware image paragraph captioning | |
Aujeszky et al. | A gesture recogintion architecture for Arabic sign language communication system | |
Baltatzis et al. | Neural sign actors: a diffusion model for 3d sign language production from text | |
Patel et al. | Recent advances in video question answering: A review of datasets and methods | |
Brock et al. | Learning three-dimensional skeleton data from sign language video | |
Xie et al. | Sequential gesture learning for continuous labanotation generation based on the fusion of graph neural networks | |
Xu et al. | Text-guided human image manipulation via image-text shared space | |
Eunice et al. | Sign2Pose: A pose-based approach for gloss prediction using a transformer model | |
Stoll et al. | There and back again: 3d sign language generation from text using back-translation | |
Hong et al. | Dagan++: Depth-aware generative adversarial network for talking head video generation | |
Zuo et al. | A simple baseline for spoken language to sign language translation with 3d avatars | |
Yu et al. | Signavatars: A large-scale 3d sign language holistic motion dataset and benchmark | |
Wahane et al. | Real-time sign language recognition using deep learning techniques | |
Nocentini et al. | Scantalk: 3d talking heads from unregistered scans | |
Zhang et al. | Adversarial synthesis of human pose from text | |
De Martino et al. | Neural machine translation from text to sign language |