Learning Speaker Representations with Mutual Information
0
0
2 विचारों·
10/05/19
में
अन्य
In this brief video, I summarize a technique called Local Info Max (LIM) that learns speaker identities using mutual information. LIM is based on a neural encoder that converts raw samples into a high-level speaker representation. Training is conducted in a self-supervised way without explicitly using speaker labels.
और दिखाओ
0 टिप्पणियाँ
sort इसके अनुसार क्रमबद्ध करें