AG百家乐大转轮-AG百家乐导航_怎么看百家乐走势_全讯网官网 (中国)·官方网站

Research News

SYSU Makes Research Progress on Speakers’ Voiceprint Recognition in Big Data Era

Source: SYSU-CMU Joint Institute of Engineering
Written by: SYSU-CMU Joint Institute of Engineering
Edited by: Wang Dongmei

Recently, Dr. Ming Li of SYSU-CMU Joint Institute of Engineering (hereinafter referred to as JIE) and his team proposed an unsupervised learning framework for speaker verification, which is of great significance to refine the clustering labels in the big data era.

As one of the main sources for people to acquire information, voice is the most convenient, effective and natural communication tool and information carrier for people to communicate. With the comprehensive informatization of the society, especially the rapid development of communications, multimedia and Internet technologies, intelligent voice technology is becoming increasingly important. Therefore, one of the current research hotspots is to find methods that can verify speakers’ identity through voice signal more accurately.

The research group led by Dr. Ming Li presented an unsupervised learning framework for speaker verification where they seek to address the speaker verification problem without any given data labels. To automatically retrieve the speaker labels of unlabeled training data, the project team proposed to use Affinity Propagation (AP) - a clustering method that takes pairwise data similarity as an input - to generate temporary class labels. The obtained labels then can be used to train a so called “Probabilistic LDA” model in order to generate similarity score for pairwise speech samples. In addition, Ming’s group further fed such similarity score to the input of AP clustering, establishing an iterative framework that updates the PLDA model repeatedly. With the final PLDA model after several iterations, the system can accordingly verify whether the two speakers belong to the same identity. The project team also evaluated the performance of different PLDA scoring methods for the multiple-enrollment task. Experiments show that the proposed iterative and unsupervised PLDA model learning approach outperformed the cosine similarity baseline by more than 20%.

On the 9th International Symposium on Chinese Spoken Language Processing (ISCSLP 2014) and the 15th annual conference of the International Speech Communication Association (INTERSPEECH 2014) held in Singapore, Dr. Ming Li presented three papers about speakers’ voiceprint recognition. Among which, the paper titled “An Iterative Framework for Unsupervised Learning in the PLDA based Speaker Verification” co-authored by Wenbo Liu, Zhiding Yu and Dr. Ming Li won the award of Best Student Paper. Wenbo Liu is a first-year dual-degree Ph.D. student affiliated with the SYSU-CMU Joint Institute of Engineering and the Department of ECE, Carnegie Mellon University, advised by Dr. Ming Li. Zhiding Yu is a third-year Ph.D. candidate at the Department of ECE, Carnegie Mellon University.

Ph.D. programs in JIE are committed to cultivating research talents who explore in depth the theory, methodology, techniques and instruments in the field of electrical and computer engineering, so as to enrich and improve the knowledge system in electrical and computer engineering. Students participating in the Ph.D. JIE double-degree program will study at Carnegie Mellon’s Pittsburgh campus for two years and will receive two degrees upon graduation — one from Sun Yat-sen University and one from Carnegie Mellon University.


大发888游戏平台dafa888 gw| 太阳城百家乐官网祖玛| 百家乐注码论坛| 百家乐官网秘籍下注法| 在线百家乐安卓| 威尼斯人娱乐城客户端| bet365足球| 百家乐官网游戏真人游戏| 至尊百家乐官网奇热| 真博百家乐官网的玩法技巧和规则| 百家乐娱乐真人娱乐| 威尼斯人娱乐下载平台| 网络棋牌游戏| A8百家乐官网娱乐网| 宾利百家乐现金网| 龙博娱乐城| 百家乐官网网络赌场| 百家乐英皇娱乐城| tt娱乐城开户| 新竹县| 91百家乐官网的玩法技巧和规则 | 香河县| 百家乐假在哪里| 大发888 游戏下载| 凯旋门百家乐官网游戏| 百家乐怎么玩高手| 云鼎百家乐官网现金网| 百家乐在发牌技巧| 百家乐的玩法技巧和规则| 百家乐官网扑克牌耙| 尊龙百家乐官网娱乐城| 大发888下载网站| 百家乐官网最常见的路子| 百家乐几点不用补牌| 六合彩开奖查询| 利澳百家乐娱乐城| 真人百家乐官网代理合作| 百家乐7杀6| 网络赌博游戏| 老虎机游戏下载| 百家乐官网统计软件|