Title:
INTERACTION OBJECT DRIVING AND PHONEME PROCESSING METHODS AND APPARATUS, DEVICE AND STORAGE MEDIUM
Document Type and Number:
WIPO Patent Application WO/2022/252890
Kind Code:
A1
Abstract:
Disclosed are interactive object driving and phoneme processing methods, an apparatus, a device, and a storage medium. The interactive object driving method comprises: acquiring a sound feature of sound driving data of an interactive object; performing feature extraction on the sound feature using a sound feature extraction network to obtain a phoneme posterior probability of each voice frame in the sound driving data, the sound feature extraction network being obtained by means of training of a phoneme table containing multiple languages; according to the phoneme posterior probability of each voice frame, obtaining a posture parameter value of the interactive object; and controlling the posture of the interactive object according to the posture parameter value.
More Like This:
Inventors:
WU WENYAN (CN)
WU QIANYI (CN)
GAO NA (CN)
QIAN CHEN (CN)
WU QIANYI (CN)
GAO NA (CN)
QIAN CHEN (CN)
Application Number:
PCT/CN2022/089870
Publication Date:
December 08, 2022
Filing Date:
April 28, 2022
Export Citation:
Assignee:
SHANGHAI SENSETIME INTELLIGENT TECH CO LTD (CN)
International Classes:
G10L15/02; G10L13/02; G10L13/08; G10L13/10; G10L15/06; G10L15/22; G10L21/10; G10L25/24
Foreign References:
CN113314104A | 2021-08-27 | |||
CN110503942A | 2019-11-26 | |||
CN112017648A | 2020-12-01 | |||
CN111933110A | 2020-11-13 | |||
CN111459450A | 2020-07-28 | |||
CN110880315A | 2020-03-13 | |||
CN112669841A | 2021-04-16 |
Attorney, Agent or Firm:
BEIJING BESTIPR INTELLECTUAL PROPERTY LAW CORPORATION (CN)
Download PDF: