Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
MODEL TRAINING METHOD, VOICE DETECTION AND LOCALIZATION METHOD, APPARATUS, DEVICE, AND MEDIUM
Document Type and Number:
WIPO Patent Application WO/2023/273469
Kind Code:
A1
Abstract:
A model training method, a voice detection and localization method, an apparatus, a device, and a medium. The method comprises: receiving speech content by means of a microphone array, and obtaining multi-channel audio, the speech content being voice speech content or noise speech content; obtaining a target vector and an audio feature parameter of the multi-channel audio, the target vector comprising N labels, the N labels being in one-to-one correspondence with N spatial regions, and each label indicating the probability that the corresponding spatial region contains a voice; taking the audio feature parameter of the multi-channel audio to serve as training sample input, taking the target vector of the multi-channel audio to serve as training sample target output, utilizing a training sample to train a deep neural network, and obtaining a target model. The accuracy of the target model is thereby increased, and the accuracy of voice detection and localization is consequently increased.

More Like This:
Inventors:
HAN KEWEI (CN)
Application Number:
PCT/CN2022/084599
Publication Date:
January 05, 2023
Filing Date:
March 31, 2022
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
CLOUDMINDS ROBOTICS CO LTD (CN)
International Classes:
G10L25/78; G01S3/802
Domestic Patent References:
WO2021013346A12021-01-28
Foreign References:
CN111142066A2020-05-12
CN112799016A2021-05-14
CN105068048A2015-11-18
CN110794368A2020-02-14
US20170040030A12017-02-09
Attorney, Agent or Firm:
TEKYRS INTELLECTUAL PROPERTY INC. (CN)
Download PDF: