Tiếp theo

Learning Speaker Representations with Mutual Information

2 Lượt xem· 10/05/19
Олег С.
Олег С.
2 Người đăng ký
2
Trong Khác

In this brief video, I summarize a technique called Local Info Max (LIM) that learns speaker identities using mutual information. LIM is based on a neural encoder that converts raw samples into a high-level speaker representation. Training is conducted in a self-supervised way without explicitly using speaker labels.

Cho xem nhiều hơn

 0 Bình luận sort   Sắp xếp theo


Tiếp theo