[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN109273002A - Vehicle configuration method, system, vehicle device and vehicle - Google Patents

Vehicle configuration method, system, vehicle device and vehicle Download PDF

Info

Publication number
CN109273002A
CN109273002A CN201811260569.6A CN201811260569A CN109273002A CN 109273002 A CN109273002 A CN 109273002A CN 201811260569 A CN201811260569 A CN 201811260569A CN 109273002 A CN109273002 A CN 109273002A
Authority
CN
China
Prior art keywords
user
voiceprint
vehicle
voice messaging
feature
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811260569.6A
Other languages
Chinese (zh)
Other versions
CN109273002B (en
Inventor
徐曼
孙珏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NIO Holding Co Ltd
Original Assignee
NIO Nextev Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NIO Nextev Ltd filed Critical NIO Nextev Ltd
Priority to CN201811260569.6A priority Critical patent/CN109273002B/en
Publication of CN109273002A publication Critical patent/CN109273002A/en
Application granted granted Critical
Publication of CN109273002B publication Critical patent/CN109273002B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)

Abstract

The invention belongs to technical field of voice recognition, a kind of vehicle configuration method, system, vehicle device and vehicle are specifically provided, it is intended to solve the problems, such as to need manual setting individual cultivation again when the prior art replaces driver and passenger.For this purpose, the present invention provides a kind of vehicle configuration method, including obtaining the voice messaging of user and identifying the corresponding user's voiceprint of voice messaging;According to preset multiple voiceprints User ID corresponding with the one-to-one relationship of multiple User ID acquisition user's voiceprint;Execute the associated vehicle configuration operation of User ID corresponding with user's voiceprint in advance.Based on above-mentioned steps, the method provided according to the present invention, vehicle can execute automatically according to the individual cultivation before user to be operated with the associated vehicle configuration of the User ID, improve the car experience of user, corresponding vehicle configuration operation can also be executed according to the ID of different user, improve the intelligence of vehicle.

Description

Vehicle configuration method, system, vehicle device and vehicle
Technical field
The invention belongs to technical field of voice recognition, and in particular to a kind of vehicle configuration method, system, vehicle device and vehicle ?.
Background technique
With the development of vehicle technology, vehicle has become essential a part in people's life, interior scene At an important place in for people's lives.Requirement with people to vehicle is higher and higher, and people are more than and want to Safe driving, under the premise of guaranteeing safe driving, it is desirable also to possess good driving experience by people.
When driving vehicle, different driver and passenger can carry out vehicle according to personalized preference and level of comfort personalized Configuration, such as adjustment height of seat, door mirror angle and seat back angle, but when replacement driver and passenger, new driving Personnel need manual setting individual cultivation again, cumbersome reduction user experience.
With the development of voice technology, user drive when can to vehicle carry out interactive voice to realize relevant operation, Therefore, how vehicle to be carried out individual cultivation to improve user's car experience being that those skilled in the art need at present by voice It solves the problems, such as.
Summary of the invention
In order to solve the above problem in the prior art, in order to be needed again when solving prior art replacement driver and passenger The problem of manual setting vehicle personalization configures, the first aspect of the present invention provides a kind of vehicle configuration method, comprising:
It obtains the voice messaging of user and identifies the corresponding user's voiceprint of the voice messaging;
User's voiceprint is obtained according to the one-to-one relationship of preset multiple voiceprints and multiple User ID Corresponding User ID;
Execute the associated vehicle configuration operation of User ID corresponding with user's voiceprint in advance.
Optionally, in provided vehicle configuration method, " identifying the corresponding user's voiceprint of the voice messaging " The step of include:
The mean value of each gauss of distribution function in the universal background model constructed in advance is obtained, and according to the mean value meter Calculate fixed reference feature;
The phonetic feature of the voice messaging is extracted, and the gauss of distribution function is carried out according to the phonetic feature Interpolation obtains the corresponding gauss of distribution function of the phonetic feature;
According to the mean value computation target signature of the corresponding gauss of distribution function of the phonetic feature;
Global variable space matrix is obtained according to expectation-maximization algorithm and the voice messaging;
Method shown according to the fixed reference feature, target signature and global variable space matrix and according to the following formula obtains The acoustic feature of the voice messaging:
W=(s-m)/T
Wherein, w indicates the acoustic feature, and s indicates the target signature, and m indicates the fixed reference feature, described in T expression Global variable space matrix;
Channel compensation is carried out to the acoustic feature, obtains the corresponding voiceprint of the voice messaging;
Wherein, the universal background model is constructed based on preset gauss hybrid models and preset voice training collection Model.
Optionally, it in provided vehicle configuration method, " is obtaining each high in the universal background model constructed in advance Before the step of mean value of this distribution function, and according to the mean value computation fixed reference feature ", the method also includes:
The phonetic feature that the voice training concentrates voice messaging is extracted, and utilizes speech detection algorithms by the voice The quiet data of feature is rejected, and universal background model training characteristics are obtained;
According to the universal background model training characteristics and utilize the expectation-maximization algorithm training general back Scape model.
Optionally, in provided vehicle configuration method, the voice training collection includes multiple preset sub- training sets;
The quantity of the universal background model is multiple and each universal background model and each son training Collection corresponds.
Optionally, in provided vehicle configuration method, " according to preset multiple voiceprints and multiple User ID One-to-one relationship obtains the corresponding User ID of user's voiceprint " the step of include:
Calculate separately the similarity between the multiple voiceprint and user's voiceprint;
Judge whether the multiple similarities being calculated are more than or equal to preset similarity threshold respectively:
A similarity more than or equal to the similarity threshold if it exists, the then voiceprint for obtaining the similarity are corresponding User ID;
Multiple similarities more than or equal to the similarity threshold if it exists then obtain maximum phase in the multiple similarity Like the corresponding User ID of voiceprint of degree.
Optionally, in provided vehicle configuration method, the vehicle configuration operation includes body configurations operation, amusement Configuration operation and driving configuration operation.
Optionally, in provided vehicle configuration method, the body configurations operation includes adjusting vehicle rearview pitch-angle Degree, seat back angle and height of seat, the entertainment configuration operation includes logging in multimedia corresponding with the User ID Account, the driving configuration operation include obtaining the User ID often to go to drive place.
The second aspect of the present invention provides a kind of vehicle configuration system, comprising:
User's voiceprint acquiring unit is configured to obtain the voice messaging of user and identifies that the voice messaging is corresponding User's voiceprint;
User ID acquiring unit is configured to the one-to-one relationship according to preset multiple voiceprints and multiple User ID Obtain the corresponding User ID of user's voiceprint;
Vehicle configuration operation execution unit is configured to execute User ID association corresponding with user's voiceprint in advance Vehicle configuration operation.
Optionally, in provided vehicle configuration system, user's voiceprint acquiring unit is further configured to Execute following operation:
The mean value of each gauss of distribution function in the universal background model constructed in advance is obtained, and according to the mean value meter Calculate fixed reference feature;
The phonetic feature of the voice messaging is extracted, and the gauss of distribution function is carried out according to the phonetic feature Interpolation obtains the corresponding gauss of distribution function of the phonetic feature;
According to the mean value computation target signature of the corresponding gauss of distribution function of the phonetic feature;
Global variable space matrix is obtained according to expectation-maximization algorithm and the voice messaging;
Method shown according to the fixed reference feature, target signature and global variable space matrix and according to the following formula obtains The acoustic feature of the voice messaging:
W=(s-m)/T
Wherein, w indicates the acoustic feature, and s indicates the target signature, and m indicates the fixed reference feature, described in T expression Global variable space matrix;
Channel compensation is carried out to the acoustic feature, obtains the corresponding voiceprint of the voice messaging;
Wherein, the universal background model is constructed based on preset gauss hybrid models and preset voice training collection Model.
Optionally, in provided vehicle configuration system, the vehicle configuration system further includes model training unit, institute Model training unit is stated to be configured to execute following operation:
The phonetic feature that the voice training concentrates voice messaging is extracted, by speech detection algorithms by the phonetic feature Quiet data reject, obtain universal background model training characteristics;
According to the universal background model training characteristics and utilize the expectation-maximization algorithm training common background mould Type.
Optionally, in provided vehicle configuration system, the voice training collection includes multiple preset sub- training sets;
The quantity of the universal background model is multiple and each universal background model and each son training Collection corresponds.
Optionally, in provided vehicle configuration system, the User ID acquiring unit be further configured to execute with Lower operation:
Calculate separately the similarity between the multiple voiceprint and user's voiceprint;
Judge whether the multiple similarities being calculated are more than or equal to preset similarity threshold respectively:
A similarity more than or equal to the similarity threshold if it exists, the then voiceprint for obtaining the similarity are corresponding User ID;
Multiple similarities more than or equal to the similarity threshold if it exists then obtain maximum phase in the multiple similarity Like the corresponding User ID of voiceprint of degree.
Optionally, in provided vehicle configuration system, the vehicle configuration operation includes body configurations operation, amusement Configuration operation and driving configuration operation.
Optionally, in provided vehicle configuration system, the body configurations operation includes adjusting vehicle rearview pitch-angle Degree, seat back angle and height of seat, the entertainment configuration operation includes logging in and the associated multimedia of the User ID Account, the driving configuration operation, which includes that the acquisition User ID is corresponding, often goes to drive place.
Third aspect present invention provides a kind of vehicle device, including vehicle configuration system described in any of the above embodiments.
The fourth aspect of the present invention provides a kind of vehicle, including vehicle device described above.
The fifth aspect of the present invention provides a kind of storage device, wherein be stored with a plurality of program, described program be suitable for by Processor is loaded to execute vehicle configuration method described in any of the above embodiments.
The sixth aspect of the present invention provides a kind of control device, including processor and storage equipment;The storage equipment, Suitable for storing a plurality of program;Described program is suitable for being loaded by the processor to execute vehicle configuration side described in any of the above embodiments Method.
Compared with the immediate prior art, above-mentioned technical proposal is at least had the following beneficial effects:
1, vehicle configuration method provided by the invention, can be by obtaining the voice messaging of user and identifying voice messaging pair The user's voiceprint answered obtains user's vocal print according to the one-to-one relationship of preset multiple voiceprints and multiple User ID The corresponding User ID of information, executes User ID corresponding with voiceprint associated vehicle configuration operation in advance, user with vehicle Carry out interactive voice during, vehicle can execute automatically associated with the User ID according to the individual cultivation before user Vehicle configuration operation, improves the car experience of user.
2, vehicle configuration method provided by the invention can identify the corresponding voiceprint of voice messaging, according to multiple sound The one-to-one relationship of line information and multiple User ID obtains the corresponding User ID of voiceprint, uses when there is multiple vehicles to use When family, corresponding vehicle configuration operation can be executed according to the ID of different user, improves the intelligence of vehicle.
3, vehicle configuration method provided by the invention, can be according to the multiple common background moulds of the classification of voice training collection training Type obtains the voiceprint of voice messaging based on trained universal background model, and the frame alignment of voice messaging can be improved Accuracy, and then available and user's strong correlation voiceprint, improve the accuracy rate of identification.
Scheme 1, a kind of vehicle configuration method, characterized by comprising:
It obtains the voice messaging of user and identifies the corresponding user's voiceprint of the voice messaging;
It is corresponding with the one-to-one relationship of multiple User ID acquisition user's voiceprint according to preset multiple voiceprints User ID;
Execute the associated vehicle configuration operation of User ID corresponding with user's voiceprint in advance.
Scheme 2, vehicle configuration method according to scheme 1, which is characterized in that " identify the corresponding user of the voice messaging The step of voiceprint " includes:
The mean value of each gauss of distribution function in the universal background model constructed in advance is obtained, and is joined according to the mean value computation Examine feature;
The phonetic feature of the voice messaging is extracted, and according to the phonetic feature gauss of distribution function is carried out slotting Value, obtains the corresponding gauss of distribution function of the phonetic feature;
According to the mean value computation target signature of the corresponding gauss of distribution function of the phonetic feature;
Global variable space matrix is obtained according to expectation-maximization algorithm and the voice messaging;
Described in method shown according to the fixed reference feature, target signature and global variable space matrix and according to the following formula obtains The acoustic feature of voice messaging:
W=(s-m)/T
Wherein, w indicates that the acoustic feature, s indicate the target signature, and m indicates that the fixed reference feature, T indicate the overall situation Variable space matrix;
Channel compensation is carried out to the acoustic feature, obtains the corresponding voiceprint of the voice messaging;
Wherein, the universal background model is the mould constructed based on preset gauss hybrid models and preset voice training collection Type.
Scheme 3, the vehicle configuration method according to scheme 2, which is characterized in that " obtaining the common background mould constructed in advance Before the step of mean value of each gauss of distribution function in type, and according to the mean value computation fixed reference feature ", the method is also Include:
The phonetic feature that the voice training concentrates voice messaging is extracted, and utilizes speech detection algorithms by the phonetic feature Quiet data reject, obtain universal background model training characteristics;
According to the universal background model training characteristics and utilize the expectation-maximization algorithm training common background mould Type.
Scheme 4, vehicle configuration method according to scheme 3, which is characterized in that the voice training collection includes multiple preset Sub- training set;
The quantity of the universal background model is multiple and each universal background model and each sub- training set one One is corresponding.
Scheme 5, vehicle configuration method according to scheme 1, which is characterized in that " according to preset multiple voiceprints and more The one-to-one relationship of a User ID obtains the corresponding User ID of user's voiceprint " the step of include:
Calculate separately the similarity between the multiple voiceprint and user's voiceprint;
Judge whether the multiple similarities being calculated are more than or equal to preset similarity threshold respectively:
A similarity more than or equal to the similarity threshold if it exists, then obtain the corresponding use of voiceprint of the similarity Family ID;
Multiple similarities more than or equal to the similarity threshold if it exists, then obtain maximum similarity in the multiple similarity The corresponding User ID of voiceprint.
Scheme 6, vehicle configuration method according to scheme 1, which is characterized in that the vehicle configuration operation includes body configurations Operation, entertainment configuration operation and driving configuration operation.
Scheme 7, vehicle configuration method according to scheme 6, which is characterized in that the body configurations operation includes adjusting vehicle Door mirror angle, seat back angle and height of seat, the entertainment configuration operation include logging in be associated with the User ID Multimedia account, driving configuration operation includes obtaining that the User ID is corresponding often to go to drive place.
Scheme 8, a kind of vehicle configuration system, characterized by comprising:
User's voiceprint acquiring unit is configured to obtain the voice messaging of user and identifies the corresponding user of the voice messaging Voiceprint;
User ID acquiring unit is configured to be obtained according to the one-to-one relationship of preset multiple voiceprints and multiple User ID The corresponding User ID of user's voiceprint;
Vehicle configuration operation execution unit is configured to execute the associated vehicle of User ID corresponding with user's voiceprint in advance Configuration operation.
Scheme 9, the vehicle configuration system according to scheme 8, which is characterized in that user's voiceprint acquiring unit is into one Step is configured to execute following operation:
The mean value of each gauss of distribution function in the universal background model constructed in advance is obtained, and is joined according to the mean value computation Examine feature;
The phonetic feature of the voice messaging is extracted, and according to the phonetic feature gauss of distribution function is carried out slotting Value, obtains the corresponding gauss of distribution function of the phonetic feature;
According to the mean value computation target signature of the corresponding gauss of distribution function of the phonetic feature;
Global variable space matrix is obtained according to expectation-maximization algorithm and the voice messaging;
Described in method shown according to the fixed reference feature, target signature and global variable space matrix and according to the following formula obtains The acoustic feature of voice messaging:
W=(s-m)/T
Wherein, w indicates that the acoustic feature, s indicate the target signature, and m indicates that the fixed reference feature, T indicate the overall situation Variable space matrix;
Channel compensation is carried out to the acoustic feature, obtains the corresponding voiceprint of the voice messaging;
Wherein, the universal background model is the mould constructed based on preset gauss hybrid models and preset voice training collection Type.
Scheme 10, vehicle configuration system according to scheme 9, which is characterized in that the vehicle configuration system further includes model Training unit, the model training unit are configured to execute following operation:
The phonetic feature that the voice training concentrates voice messaging is extracted, by speech detection algorithms by the quiet of the phonetic feature Sound data are rejected, and universal background model training characteristics are obtained;
According to the universal background model training characteristics and utilize the expectation-maximization algorithm training universal background model.
Scheme 11, vehicle configuration system according to scheme 10, which is characterized in that the voice training collection includes multiple default Sub- training set;
The quantity of the universal background model is multiple and each universal background model and each sub- training set one One is corresponding.
Scheme 12, the vehicle configuration system according to scheme 8, which is characterized in that the User ID acquiring unit is further matched It is set to the following operation of execution:
Calculate separately the similarity between the multiple voiceprint and user's voiceprint;
Judge whether the multiple similarities being calculated are more than or equal to preset similarity threshold respectively:
A similarity more than or equal to the similarity threshold if it exists, then obtain the corresponding use of voiceprint of the similarity Family ID;
Multiple similarities more than or equal to the similarity threshold if it exists, then obtain maximum similarity in the multiple similarity The corresponding User ID of voiceprint.
Scheme 13, the vehicle configuration system according to scheme 8, which is characterized in that the vehicle configuration operation is matched including car body It sets operation, entertainment configuration operation and drives configuration operation.
Scheme 14, vehicle configuration system according to scheme 13, which is characterized in that the body configurations operation includes adjusting vehicle Door mirror angle, seat back angle and height of seat, the entertainment configuration operation include logging in close with the User ID The multimedia account of connection, the driving configuration operation, which includes that the acquisition User ID is corresponding, often goes to drive place.
Scheme 15, a kind of vehicle device, which is characterized in that including vehicle configuration system described in any one of scheme 8-14.
Scheme 16, a kind of vehicle, which is characterized in that including vehicle device described in scheme 15.
Scheme 17, a kind of storage device, wherein being stored with a plurality of program, which is characterized in that described program is suitable for being added by processor It carries with vehicle configuration method described in any one of the 1-7 that carries into execution a plan.
Scheme 18, a kind of control device, including processor and storage equipment;The storage equipment is suitable for storing a plurality of program;Its It is characterized in that, described program is suitable for being loaded as the processor with vehicle configuration method described in any one of the 1-7 that carries into execution a plan.
Detailed description of the invention
Fig. 1 is the key step schematic diagram of the vehicle configuration method of an embodiment of the present invention;
Fig. 2 is the key step schematic diagram of the corresponding voiceprint of identification voice messaging of an embodiment of the present invention;
Fig. 3 is the primary structure schematic diagram of the vehicle configuration system of an embodiment of the present invention.
Specific embodiment
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with the embodiment of the present invention In attached drawing, technical scheme in the embodiment of the invention is clearly and completely described, it is clear that described embodiment is A part of the embodiment of the present invention, instead of all the embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art Every other embodiment obtained without making creative work, shall fall within the protection scope of the present invention.
The preferred embodiment of the present invention described with reference to the accompanying drawings.It will be apparent to a skilled person that this A little embodiments are used only for explaining technical principle of the invention, it is not intended that limit the scope of the invention.
Refering to attached drawing 1, Fig. 1 illustratively gives the key step of vehicle configuration method in the present embodiment.Such as Fig. 1 institute Show, vehicle configuration method includes the following steps: in the present embodiment
Step S101: the voice messaging of user is obtained.
User is during carrying out interactive voice with vehicle, the available user of vehicle voice messaging currently entered, Wherein it is possible to obtain the voice messaging of user by the voice acquisition device of vehicle, the voice messaging of user may include user With the interactive voice of vehicle and the registration voice of user.What the interactive voice of user and vehicle can send for user to vehicle Voice, such as " playing music " are controlled, the registration voice of user can be user and carry out specified number according to the prompt of onboard system The voice of specified duration typing is measured, onboard system can store the registration voice of multiple common users.
Step S102: the corresponding user's voiceprint of identification voice messaging.
The voice messaging of user may include the content and user's voiceprint of voice, can be with by user's voiceprint Differentiate the identity of user.
The identity that user is differentiated by user's voiceprint may include registration phase and cognitive phase.In registration phase, Several voice messagings can be collected or recorded for each user, and vehicle device is extracted from voice messaging embodies user personality spy The parameters,acoustic of sign, the parameters,acoustic for embodying the user personality feature can be user's voiceprint, further according to certain model Method for building up forms the template for being directed to the user, and is stored in user template library.In cognitive phase, system can with registration rank The identical method of section extracts the voiceprint of user speech information to be identified, and the body of user is differentiated according to certain method of discrimination Part.
Refering to attached drawing 2, Fig. 2, which illustratively gives, identifies the corresponding user's voiceprint of voice messaging in the present embodiment Key step.As shown in Fig. 2, identifying that the corresponding user's voiceprint of voice messaging may include following step in the present embodiment:
Step S1021: the mean value of each gauss of distribution function in the universal background model that acquisition constructs in advance, and according to Mean value computation fixed reference feature.
Universal background model (UBM, Universal Background Model) can pass through the language of a large amount of speaker Sound training forms, and has very high degree of mixing and reference background meaning, and careful statistical nature point can be carried out to voice messaging Cloth description.Wherein, universal background model can be based on preset gauss hybrid models (GMM, Gaussian Mixed Model) and preset voice training collection building model.
Gauss hybrid models are the feature distribution situations that speaker is described with the linear combination of multiple Gaussian Profiles, when identification The corresponding speaker of speaker model of tested speech feature can will most be generated as recognition result, same speaker is when different Even the phase says that also it occur frequently that biggish variation, but the personal characteristics of the speaker has from the angle of statistics in short There is consistency, the gauss hybrid models Speaker Identification effect unrelated for text is preferable.Gauss hybrid models can be high by M The summation of this probability density function linear weighted function is constituted, and when the M of gauss hybrid models is sufficiently high, can subtly approach speaker The spatial distribution of characteristic vector.Preset voice training collection can be the voice collection being made of the voice messaging of a large amount of speaker It closes.
Specifically, in available universal background model each gauss of distribution function mean value, according to mean value computation voice The fixed reference feature of information, fixed reference feature can be the Gaussian mean super vector of universal background model, the super vector with specifically The channel for talking about people and speaker's voice is unrelated, has good reference background meaning.
Before the step S1021 the step of, the method for the present embodiment further includes trained universal background model, specifically, instruction The method for practicing universal background model includes the following steps:
Step S10211: the phonetic feature that voice training concentrates voice messaging is extracted, and will using speech detection algorithms The quiet data of phonetic feature is rejected, and universal background model training characteristics are obtained.
The training data of universal background model, therefore can not in training data from a large amount of different backgrounds and noise circumstance Avoid there are quiet datas, and it is the characteristic of voice that Application on Voiceprint Recognition, which utilizes, unrelated with quiet data, subsequent in order to reduce Calculation amount can use the quiet data that speech detection algorithms detect voice messaging, and it rejected from voice messaging, obtain To universal background model training characteristics.Wherein, speech detection algorithms can be VAD (Voice Activity Detection, language Sound activation detection) algorithm etc., universal background model training characteristics can be MFCC (Mel Frequency Cepstral Coefficent, mel-frequency cepstrum coefficient) feature etc., MFCC feature is the perception in order to simulate human ear to different frequency voice Ability and the nonlinear characteristic parameters similar with human hearing characteristic proposed.It should be noted that protection scope of the present invention It is not limited to above-mentioned specific embodiment, other than vad algorithm and MFCC feature, in the premise without departing from the principle of the present invention Under, speech detection algorithms can also can reject the algorithm of quiet data for other, and the training characteristics of universal background model can also Think that other can train the feature of universal background model.
Step S10212: according to universal background model training characteristics and expectation-maximization algorithm training common background is utilized Model.
After obtaining universal background model training characteristics, expectation-maximization algorithm training universal background model can use.Tool Body, training universal background model can use unsupervised e-learning, and the acoustic feature by counting each speaker is had Some probability density functions construct the model of speaker, and voice training collection may include multiple preset sub- training sets, training When, a universal background model can be trained with all data sets, but be the need to ensure that the equilibrium of training data, can also train Multiple universal background models, then integrate them, for example, voice training collection can be divided into two son instructions according to gender Practice collection, recycles the two sub- training sets that two universal background models are respectively trained, lack of balance data control can be effectively utilized Make the universal background model generated.The quantity of universal background model can be multiple and each universal background model and every height Training set can correspond.
Step S1022: extracting the phonetic feature of voice messaging, and is carried out to gauss of distribution function according to phonetic feature slotting Value, obtains the corresponding gauss of distribution function of phonetic feature.
The phonetic feature for extracting voice messaging, according to phonetic feature and using maximum a posteriori probability adaptive algorithm to general The Gaussian function of background model carries out linear interpolation, obtains the gauss hybrid models of user corresponding with the voice messaging, in turn Obtain the corresponding gauss of distribution function of phonetic feature.On the basis of obtaining universal background model, in conjunction with height related to user This mixed model can remove the influence in phonetic feature with background data similar portion, retain larger with background data difference Part, increase to the robustness of noise, improve recognition effect.It is each general in the case where there is multiple universal background models The Gaussian Mixture degree of background model can correspond, and facilitate the complete modification of model.
Step S1023: according to the mean value computation target signature of the corresponding gauss of distribution function of phonetic feature.
After obtaining the corresponding gauss of distribution function of phonetic feature, the mean value of gauss of distribution function is calculated as target spy Sign.In practical applications, if universal background model has C Gaussian component, the dimension of feature is F, then the dimension of target signature is C*F.Target signature contains the most individual information of user, can as the user characteristics with strong distinction, but Be target signature dimension it is higher, a large amount of redundancy and scramble data related with channel are contained, in order to reduce the later period Target signature can be decomposed using global space and carry out dimensionality reduction, obtain more compact feature by calculation amount.
Step S1024: global variable space matrix is obtained according to expectation-maximization algorithm and voice messaging.
In practical applications, it can use the Baum-Welch statistic that universal background model extracts every section of voice of user, Pass through expectation-maximization algorithm (EM, Expectation Maximization Algorithm) iteration based on maximum-likelihood criterion Solve global variable space matrix.Maximum likelihood requires the distribution of model description that can approach training dataset to the maximum extent Distribution, therefore, training set is bigger and can more react the true distribution of user characteristics, and the performance of user's identifying system is also better. The E step of expectation-maximization algorithm can be according to the mean value and variance of Baum-Welch normalized set acoustic feature, it is expected that maximum The mean value and maximum variance for the acoustic feature that the M step for changing algorithm can be such that E step calculates, are walked by iterative solution E step and M, can To obtain global variable space matrix.
Global variable space matrix can reduce the dimension of mean value super vector, reduce unwanted redundancy, reservation and user The relevant low-dimensional vector of identity indicates.In embodiments of the present invention, preparatory trained DNN (Deep Neural can be passed through Networks, deep neural network) model calculating Baum-Welch statistic, the accuracy of the frame alignment of voice messaging is improved, And then the available voiceprint with user's strong correlation, improve the accuracy rate of identification.
Step S1025: according to fixed reference feature, target signature and global variable space matrix and according to shown in formula (1) Method obtain voice messaging acoustic feature:
W=(s-m)/T (1)
Wherein, w indicates that acoustic feature, s indicate target signature, and m indicates that fixed reference feature, T indicate global variable space matrix.
After obtaining fixed reference feature, target signature and global variable space matrix, voice messaging can be further obtained Acoustic feature, acoustic feature had both included the information of user, further included the channel information of voice, but the presence meeting of channel information Interference is generated to user identity identification.
Step S1026: channel compensation is carried out to acoustic feature, obtains the corresponding voiceprint of voice messaging.
It, can be using general since the acoustic feature of voice messaging includes that channel information can generate interference to user identity identification Rate linear discriminant analysis (PLDA, Probabilistic Linear Discriminant Analysis) method is to acoustic feature Carry out channel compensation.Specifically, the voiceprint of user speech information, each user can be obtained according to expectation-maximization algorithm Every words can generate a voiceprint, the voiceprint of the same user is averaging, it is unique that the user can be obtained Voiceprint.
Step S103: user's vocal print is obtained according to the one-to-one relationship of preset multiple voiceprints and multiple User ID The corresponding User ID of information.
When user specifies duration typing voice according to the prompt progress specified quantity of onboard system, onboard system can be deposited It stores up the registration voice of multiple common users and obtains the corresponding voiceprint of user's registration voice according to step S101-S102, And the one-to-one relationship of multiple voiceprints Yu multiple User ID is established, and the corresponding relationship is stored in onboard system. After identifying user speech information corresponding user's voiceprint, from the one of preset multiple voiceprints and multiple User ID User ID corresponding with user's voiceprint is obtained in one corresponding relationship, is associated with so as to subsequent according to User ID execution Vehicle configuration operation.
Specifically, the similarity between multiple voiceprints and user's voiceprint can be calculated separately, respectively judgement meter Whether obtained multiple similarities are more than or equal to preset similarity threshold:
A similarity more than or equal to similarity threshold if it exists, then obtain the corresponding use of voiceprint of the similarity Family ID;
Multiple similarities more than or equal to similarity threshold if it exists, then obtain the sound of maximum similarity in multiple similarities The corresponding User ID of line information.
In practical applications, it is understood that there may be the user for being not belonging to registration sound-groove model library interacts with vehicle, in order to mention The accuracy of height identification, can calculate the similarity between multiple voiceprints and user's voiceprint, and judgement calculates respectively Whether obtained multiple similarities are more than or equal to preset similarity threshold, if multiple similarities are respectively less than similarity threshold, Illustrate the voiceprint of the user not in registration sound-groove model library, it is multiple similar more than or equal to similarity threshold if it exists Degree, then using the corresponding User ID of the maximum voiceprint of similarity as the User ID of the user.In embodiments of the present invention, phase It can be Likelihood Score like degree.
Step S104: the associated vehicle configuration operation of User ID corresponding with voice messaging in advance is executed.
In practical applications, in order to improve the driving experience of user, onboard system can obtain the traveling number of user automatically According to driving data so that the later period provides intelligent service for user.Wherein, running data may include the driving that user often goes Place and point-of-interest etc., driving data may include that user adjusts height of seat and backrest angle etc..When onboard system obtains After taking the corresponding User ID of family voiceprint, the associated vehicle configuration behaviour of User ID corresponding with voice messaging in advance is executed Make, wherein vehicle configuration operation may include body configurations operation, entertainment configuration operation and driving configuration operation.Car body is matched Setting operation may include adjusting vehicle rearview mirror angle, seat back angle and height of seat, can also include the sky of vehicle Adjust set temperature etc..After user gets on the bus, vehicle can be adjusted to above-mentioned hardware facility to meet users ' individualized requirement automatically, In practical application scene, by taking height of seat as an example, if the height gap of a upper user and active user are larger, at upper one After user has used vehicle, active user gets on the bus after determining identity, and onboard system can be automatically according to the height of active user Height of seat is adjusted to suitable position by it.Entertainment configuration operation may include logging in and the associated multimedia account of User ID Deng, it specifically, can be with automated log on and the associated multimedia account of the User ID after determining user identity, acquisition user's sense The multimedia messages of interest, such as the music list of user's collection can obtain user when user wants to play music in time Interested music.Driving configuration operation, which may include that acquisition User ID is corresponding, often goes to driving place and point-of-interest etc., After determining user identity, available User ID is corresponding often to go to drive place, such as CompanyAddress, the home address of user, Further, it is also possible to obtain the point-of-interest of user, the power-up website gone as usual is rationally advised when vehicle needs to be powered on for user Draw route.
Vehicle configuration method provided by the invention, the voice messaging of available user simultaneously identify the corresponding use of voice messaging Family voiceprint obtains user's voiceprint pair according to the one-to-one relationship of preset multiple voiceprints and multiple User ID The User ID answered, executes User ID corresponding with user's voiceprint associated vehicle configuration operation in advance, user with vehicle During carrying out interactive voice, vehicle can execute automatically and the associated vehicle of the User ID according to the individual cultivation before user Configuration operation, improve the car experience of user.
Although each step is described in the way of above-mentioned precedence in above-described embodiment, this field Technical staff is appreciated that the effect in order to realize the present embodiment, executes between different steps not necessarily in such order, It (parallel) execution simultaneously or can be executed with reverse order, these simple variations all protection scope of the present invention it It is interior.
Refering to attached drawing 3, Fig. 3 illustratively gives the primary structure of vehicle configuration system in the present embodiment.Such as Fig. 3 institute Show, vehicle configuration system includes that user's voiceprint acquiring unit 1, User ID acquiring unit 2 and vehicle configuration operation execute Unit 3.
User's voiceprint acquiring unit 1 is configured to obtain the voice messaging of user and identifies the corresponding use of voice messaging Family voiceprint;
User ID acquiring unit 2 is configured to be closed according to preset multiple voiceprints and the one-to-one correspondence of multiple User ID System obtains the corresponding User ID of user's voiceprint;
It is associated to be configured to execute User ID corresponding with user's voiceprint in advance for vehicle configuration operation execution unit 3 Vehicle configuration operation.
Optionally, in provided vehicle configuration system, user's voiceprint acquiring unit 1 is further configured to execute It operates below:
The mean value of each gauss of distribution function in the universal background model constructed in advance is obtained, and is joined according to mean value computation Examine feature;
The phonetic feature of voice messaging is extracted, and interpolation is carried out to gauss of distribution function according to phonetic feature, obtains language The corresponding gauss of distribution function of sound feature;
According to the mean value computation target signature of the corresponding gauss of distribution function of phonetic feature;
Global variable space matrix is obtained according to expectation-maximization algorithm and voice messaging;
It is obtained according to fixed reference feature, target signature and global variable space matrix and according to method shown in formula (1) The acoustic feature of voice messaging:
Channel compensation is carried out to acoustic feature, obtains the corresponding voiceprint of voice messaging;
Wherein, universal background model is the mould constructed based on preset gauss hybrid models and preset voice training collection Type.
Optionally, in provided vehicle configuration system, vehicle configuration system further includes model training unit, model instruction Practice unit to be configured to execute following operation:
Firstly, the phonetic feature that voice training concentrates voice messaging is extracted, by speech detection algorithms by phonetic feature Quiet data is rejected, and universal background model training characteristics are obtained.
Then, universal background model is trained according to universal background model training characteristics and using expectation-maximization algorithm.
Optionally, in provided vehicle configuration system, voice training collection includes multiple preset sub- training sets, accordingly ,
The quantity of universal background model is that multiple and each universal background model and every sub- training set correspond.
Optionally, in provided vehicle configuration system, User ID acquiring unit 2 can be further configured to execute with Lower operation:
Firstly, calculating separately the similarity between multiple voiceprints and user's voiceprint.
Then, judge whether the multiple similarities being calculated are more than or equal to preset similarity threshold respectively:
A similarity more than or equal to similarity threshold if it exists, then obtain the corresponding use of voiceprint of the similarity Family ID;
Multiple similarities more than or equal to similarity threshold if it exists, then obtain the sound of maximum similarity in multiple similarities The corresponding User ID of line information.
Optionally, in provided vehicle configuration system, vehicle configuration operation may include body configurations operation, amusement Configuration operation and driving configuration operation.
Optionally, in provided vehicle configuration system, body configurations operation may include adjusting vehicle rearview pitch-angle Degree, seat back angle and height of seat, entertainment configuration operation may include log in the associated multimedia account of User ID, Driving configuration operation, which may include that acquisition User ID is corresponding, often goes to driving place.
Further, it is based on the above system embodiment, it may include above-mentioned vehicle that the present invention also provides a kind of vehicle devices Configuration system.In the present embodiment, vehicle device can be that by the control system of specific function, in some cases, may Directly refer to the controller of the control system.It should be noted that protection scope of the present invention is not limited to this specific implementation Mode.Under the premise of without departing from the principle of the present invention, it should be appreciated by the person skilled in the art that vehicle device can be vehicle-mounted joy Happy information system can also be vehicle-mounted central control system and carried-on-vehicle computer system etc., the technology after these changes or replacement Scheme will fall within the scope of protection of the present invention.
Still further, being based on above-mentioned vehicle device embodiment, it may include above-mentioned reality that the present invention also provides a kind of vehicles Apply vehicle device described in example.
Still further, being based on above method embodiment, the present invention also provides a kind of storage devices, wherein being stored with more Program, the program may be adapted to be loaded as processor to execute vehicle configuration method described in above method embodiment.
Still further, being based on above method embodiment, it may include place that the present invention also provides a kind of control devices Manage device, storage equipment;Storage equipment may be adapted to store a plurality of program, which may be adapted to be loaded by processor on to execute State vehicle configuration method described in embodiment of the method.
Person of ordinary skill in the field can be understood that, for convenience of description and succinctly, system of the present invention The specific work process and related description of system, vehicle device and vehicle embodiments, can be with reference to the correspondence in preceding method embodiment Process, and with above method beneficial effect having the same, details are not described herein.
Those skilled in the art should be able to recognize that, side described in conjunction with the examples disclosed in the embodiments of the present disclosure Method step, can be realized with electronic hardware, computer software, or a combination of the two, in order to clearly demonstrate electronic hardware and The interchangeability of software generally describes each exemplary composition and step according to function in the above description.These Function is executed actually with electronic hardware or software mode, specific application and design constraint depending on technical solution. Those skilled in the art can use different methods to achieve the described function each specific application, but this reality Now it should not be considered as beyond the scope of the present invention.
So far, it has been combined preferred embodiment shown in the drawings and describes technical solution of the present invention, still, this field Technical staff is it is easily understood that protection scope of the present invention is expressly not limited to these specific embodiments.Without departing from this Under the premise of the principle of invention, those skilled in the art can make equivalent change or replacement to the relevant technologies feature, these Technical solution after change or replacement will fall within the scope of protection of the present invention.

Claims (10)

1. a kind of vehicle configuration method, characterized by comprising:
It obtains the voice messaging of user and identifies the corresponding user's voiceprint of the voice messaging;
It is corresponding with the one-to-one relationship of multiple User ID acquisition user's voiceprint according to preset multiple voiceprints User ID;
Execute the associated vehicle configuration operation of User ID corresponding with user's voiceprint in advance.
2. vehicle configuration method according to claim 1, which is characterized in that " identify the corresponding user of the voice messaging The step of voiceprint " includes:
The mean value of each gauss of distribution function in the universal background model constructed in advance is obtained, and is joined according to the mean value computation Examine feature;
The phonetic feature of the voice messaging is extracted, and according to the phonetic feature gauss of distribution function is carried out slotting Value, obtains the corresponding gauss of distribution function of the phonetic feature;
According to the mean value computation target signature of the corresponding gauss of distribution function of the phonetic feature;
Global variable space matrix is obtained according to expectation-maximization algorithm and the voice messaging;
Described in method shown according to the fixed reference feature, target signature and global variable space matrix and according to the following formula obtains The acoustic feature of voice messaging:
W=(s-m)/T
Wherein, w indicates that the acoustic feature, s indicate the target signature, and m indicates that the fixed reference feature, T indicate the overall situation Variable space matrix;
Channel compensation is carried out to the acoustic feature, obtains the corresponding voiceprint of the voice messaging;
Wherein, the universal background model is the mould constructed based on preset gauss hybrid models and preset voice training collection Type.
3. vehicle configuration method according to claim 2, which is characterized in that " obtaining the common background mould constructed in advance Before the step of mean value of each gauss of distribution function in type, and according to the mean value computation fixed reference feature ", the method is also Include:
The phonetic feature that the voice training concentrates voice messaging is extracted, and utilizes speech detection algorithms by the phonetic feature Quiet data reject, obtain universal background model training characteristics;
According to the universal background model training characteristics and utilize the expectation-maximization algorithm training common background mould Type.
4. vehicle configuration method according to claim 3, which is characterized in that the voice training collection includes multiple preset Sub- training set;
The quantity of the universal background model is multiple and each universal background model and each sub- training set one One is corresponding.
5. vehicle configuration method according to claim 1, which is characterized in that " according to preset multiple voiceprints and more The one-to-one relationship of a User ID obtains the corresponding User ID of user's voiceprint " the step of include:
Calculate separately the similarity between the multiple voiceprint and user's voiceprint;
Judge whether the multiple similarities being calculated are more than or equal to preset similarity threshold respectively:
A similarity more than or equal to the similarity threshold if it exists, then obtain the corresponding use of voiceprint of the similarity Family ID;
Multiple similarities more than or equal to the similarity threshold if it exists, then obtain maximum similarity in the multiple similarity The corresponding User ID of voiceprint.
6. vehicle configuration method according to claim 1, which is characterized in that the vehicle configuration operation includes body configurations Operation, entertainment configuration operation and driving configuration operation.
7. vehicle configuration method according to claim 6, which is characterized in that the body configurations operation includes adjusting vehicle Door mirror angle, seat back angle and height of seat, the entertainment configuration operation include logging in be associated with the User ID Multimedia account, driving configuration operation includes obtaining that the User ID is corresponding often to go to drive place.
8. a kind of vehicle configuration system, characterized by comprising:
User's voiceprint acquiring unit is configured to obtain the voice messaging of user and identifies the corresponding user of the voice messaging Voiceprint;
User ID acquiring unit is configured to be obtained according to the one-to-one relationship of preset multiple voiceprints and multiple User ID The corresponding User ID of user's voiceprint;
Vehicle configuration operation execution unit is configured to execute the associated vehicle of User ID corresponding with user's voiceprint in advance Configuration operation.
9. vehicle configuration system according to claim 8, which is characterized in that user's voiceprint acquiring unit is into one Step is configured to execute following operation:
The mean value of each gauss of distribution function in the universal background model constructed in advance is obtained, and is joined according to the mean value computation Examine feature;
The phonetic feature of the voice messaging is extracted, and according to the phonetic feature gauss of distribution function is carried out slotting Value, obtains the corresponding gauss of distribution function of the phonetic feature;
According to the mean value computation target signature of the corresponding gauss of distribution function of the phonetic feature;
Global variable space matrix is obtained according to expectation-maximization algorithm and the voice messaging;
Described in method shown according to the fixed reference feature, target signature and global variable space matrix and according to the following formula obtains The acoustic feature of voice messaging:
W=(s-m)/T
Wherein, w indicates that the acoustic feature, s indicate the target signature, and m indicates that the fixed reference feature, T indicate the overall situation Variable space matrix;
Channel compensation is carried out to the acoustic feature, obtains the corresponding voiceprint of the voice messaging;
Wherein, the universal background model is the mould constructed based on preset gauss hybrid models and preset voice training collection Type.
10. vehicle configuration system according to claim 9, which is characterized in that the vehicle configuration system further includes model Training unit, the model training unit are configured to execute following operation:
The phonetic feature that the voice training concentrates voice messaging is extracted, by speech detection algorithms by the quiet of the phonetic feature Sound data are rejected, and universal background model training characteristics are obtained;
According to the universal background model training characteristics and utilize the expectation-maximization algorithm training universal background model.
CN201811260569.6A 2018-10-26 2018-10-26 Vehicle configuration method and system, vehicle machine and vehicle Active CN109273002B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811260569.6A CN109273002B (en) 2018-10-26 2018-10-26 Vehicle configuration method and system, vehicle machine and vehicle

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811260569.6A CN109273002B (en) 2018-10-26 2018-10-26 Vehicle configuration method and system, vehicle machine and vehicle

Publications (2)

Publication Number Publication Date
CN109273002A true CN109273002A (en) 2019-01-25
CN109273002B CN109273002B (en) 2022-06-10

Family

ID=65193843

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811260569.6A Active CN109273002B (en) 2018-10-26 2018-10-26 Vehicle configuration method and system, vehicle machine and vehicle

Country Status (1)

Country Link
CN (1) CN109273002B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110932862A (en) * 2019-10-22 2020-03-27 安信通科技(澳门)有限公司 Vehicle-mounted equipment login management method and device in vehicle
CN112185344A (en) * 2020-09-27 2021-01-05 北京捷通华声科技股份有限公司 Voice interaction method and device, computer readable storage medium and processor
CN112298104A (en) * 2019-07-31 2021-02-02 比亚迪股份有限公司 Vehicle control method and device, storage medium, electronic equipment and vehicle
CN113888777A (en) * 2021-09-08 2022-01-04 南京金盾公共安全技术研究院有限公司 Voiceprint unlocking method and device based on cloud machine learning

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103730114A (en) * 2013-12-31 2014-04-16 上海交通大学无锡研究院 Mobile equipment voiceprint recognition method based on joint factor analysis model
CN103809854A (en) * 2014-02-20 2014-05-21 联想(北京)有限公司 Data processing method and electronic equipment
CN203753082U (en) * 2014-02-11 2014-08-06 北汽福田汽车股份有限公司 Identity recognition based vehicle setting system and vehicle
CN105895096A (en) * 2016-03-30 2016-08-24 乐视控股(北京)有限公司 Identity identification and voice interaction operating method and device
US20160358610A1 (en) * 2013-01-28 2016-12-08 Tencent Technology (Shenzhen) Company Limited Method and device for voiceprint recognition
CN106683673A (en) * 2016-12-30 2017-05-17 智车优行科技(北京)有限公司 Method, device and system for adjusting driving modes and vehicle
CN107886943A (en) * 2017-11-21 2018-04-06 广州势必可赢网络科技有限公司 Voiceprint recognition method and device

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160358610A1 (en) * 2013-01-28 2016-12-08 Tencent Technology (Shenzhen) Company Limited Method and device for voiceprint recognition
CN103730114A (en) * 2013-12-31 2014-04-16 上海交通大学无锡研究院 Mobile equipment voiceprint recognition method based on joint factor analysis model
CN203753082U (en) * 2014-02-11 2014-08-06 北汽福田汽车股份有限公司 Identity recognition based vehicle setting system and vehicle
CN103809854A (en) * 2014-02-20 2014-05-21 联想(北京)有限公司 Data processing method and electronic equipment
CN105895096A (en) * 2016-03-30 2016-08-24 乐视控股(北京)有限公司 Identity identification and voice interaction operating method and device
CN106683673A (en) * 2016-12-30 2017-05-17 智车优行科技(北京)有限公司 Method, device and system for adjusting driving modes and vehicle
CN107886943A (en) * 2017-11-21 2018-04-06 广州势必可赢网络科技有限公司 Voiceprint recognition method and device

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112298104A (en) * 2019-07-31 2021-02-02 比亚迪股份有限公司 Vehicle control method and device, storage medium, electronic equipment and vehicle
CN110932862A (en) * 2019-10-22 2020-03-27 安信通科技(澳门)有限公司 Vehicle-mounted equipment login management method and device in vehicle
CN112185344A (en) * 2020-09-27 2021-01-05 北京捷通华声科技股份有限公司 Voice interaction method and device, computer readable storage medium and processor
CN113888777A (en) * 2021-09-08 2022-01-04 南京金盾公共安全技术研究院有限公司 Voiceprint unlocking method and device based on cloud machine learning
CN113888777B (en) * 2021-09-08 2023-08-18 南京金盾公共安全技术研究院有限公司 Voiceprint unlocking method and device based on cloud machine learning

Also Published As

Publication number Publication date
CN109273002B (en) 2022-06-10

Similar Documents

Publication Publication Date Title
KR102562227B1 (en) Dialogue system, Vehicle and method for controlling the vehicle
US20210249035A1 (en) Emotion detection using speaker baseline
CN109273002A (en) Vehicle configuration method, system, vehicle device and vehicle
CN111415677B (en) Method, apparatus, device and medium for generating video
JP3667332B2 (en) Standard model creation apparatus and standard model creation method
EP3090429B1 (en) Modifying operations based on acoustic ambience classification
Bisio et al. Smart and robust speaker recognition for context-aware in-vehicle applications
CN105390136B (en) Vehicle arrangement control device and method for user's adaptive type service
CN110178178A (en) Microphone selection and multiple talkers segmentation with environment automatic speech recognition (ASR)
CN106663446A (en) User environment aware acoustic noise reduction
CN108875682A (en) Information-pushing method and device
JP2017009826A (en) Group state determination device and group state determination method
US11393473B1 (en) Device arbitration using audio characteristics
Kashevnik et al. Multimodal corpus design for audio-visual speech recognition in vehicle cabin
Gajšek et al. Speaker state recognition using an HMM-based feature extraction method
JP2017009825A (en) Conversation state analyzing device and conversation state analyzing method
US20220360641A1 (en) Dynamic time-based playback of content in a vehicle
JP2019158975A (en) Utterance system
Chakraborty et al. Literature Survey
US11195534B1 (en) Permissioning for natural language processing systems
Loh et al. Speech recognition interactive system for vehicle
JP2005227794A (en) Device and method for creating standard model
CN111861666A (en) Vehicle information interaction method and device
WO2023159536A1 (en) Human-computer interaction method and apparatus, and terminal device
CN112863476A (en) Method and device for constructing personalized speech synthesis model, method and device for speech synthesis and testing

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20200901

Address after: Susong Road West and Shenzhen Road North, Hefei Economic and Technological Development Zone, Anhui Province

Applicant after: Weilai (Anhui) Holding Co., Ltd

Address before: 30 Floor of Yihe Building, No. 1 Kangle Plaza, Central, Hong Kong, China

Applicant before: NIO NEXTEV Ltd.

GR01 Patent grant
GR01 Patent grant