File Download
Supplementary
-
Citations:
- Appears in Collections:
Conference Paper: Entropy coding for training deep belief networks with imbalanced and unlabeled data
Title | Entropy coding for training deep belief networks with imbalanced and unlabeled data |
---|---|
Authors | |
Keywords | Physics Sound |
Issue Date | 2012 |
Publisher | Acoustical Society of America. The Journal's web site is located at http://asa.aip.org/jasa.html |
Citation | The ACOUSTICS 2012 Hong Kong Conference & Exihibition, Hong Kong, 13-18 May 2012. In Journal of the Acoustical Society of America, 2012, v. 131 n. 4, p. 3235, abstract no. 1aSCb1 How to Cite? |
Abstract | Training deep belief networks (DBNs) is normally done with large data sets. In this work, the goal is to predict traces of the surface of the tongue in ultrasoundimages of the mouth during speech. Performance on this task can be dramatically enhanced by pre-training a DBN jointly on human-supplied traces and ultrasoundimages, then training a modified version of the network to predict traces from ultrasound only. However, hand-tracing the entire dataset of ultrasoundimages is extremely labor intensive. Moreover, the dataset is highly imbalanced since many images are extremely similar. This work presents a bootstrapping method which takes advantage of this imbalance, iteratively selecting a small subset of images to be hand-traced, then (re)training the DBN, making use of an entropy-based diversity measure for the initial selection. With this approach, a three-fold reduction in human time required to trace an entire dataset with human-level accuracy was achieved. |
Description | Session 1aSCb - Speech Communication: Speech Processing Potpourri (Poster Session): no. 1aSCb1 |
Persistent Identifier | http://hdl.handle.net/10722/211020 |
ISSN | 2023 Impact Factor: 2.1 2023 SCImago Journal Rankings: 0.687 |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Berry, J | - |
dc.contributor.author | Fasel, I | - |
dc.contributor.author | Fadiga, L | - |
dc.contributor.author | Archangeli, D | - |
dc.date.accessioned | 2015-06-30T07:55:39Z | - |
dc.date.available | 2015-06-30T07:55:39Z | - |
dc.date.issued | 2012 | - |
dc.identifier.citation | The ACOUSTICS 2012 Hong Kong Conference & Exihibition, Hong Kong, 13-18 May 2012. In Journal of the Acoustical Society of America, 2012, v. 131 n. 4, p. 3235, abstract no. 1aSCb1 | - |
dc.identifier.issn | 0001-4966 | - |
dc.identifier.uri | http://hdl.handle.net/10722/211020 | - |
dc.description | Session 1aSCb - Speech Communication: Speech Processing Potpourri (Poster Session): no. 1aSCb1 | - |
dc.description.abstract | Training deep belief networks (DBNs) is normally done with large data sets. In this work, the goal is to predict traces of the surface of the tongue in ultrasoundimages of the mouth during speech. Performance on this task can be dramatically enhanced by pre-training a DBN jointly on human-supplied traces and ultrasoundimages, then training a modified version of the network to predict traces from ultrasound only. However, hand-tracing the entire dataset of ultrasoundimages is extremely labor intensive. Moreover, the dataset is highly imbalanced since many images are extremely similar. This work presents a bootstrapping method which takes advantage of this imbalance, iteratively selecting a small subset of images to be hand-traced, then (re)training the DBN, making use of an entropy-based diversity measure for the initial selection. With this approach, a three-fold reduction in human time required to trace an entire dataset with human-level accuracy was achieved. | - |
dc.language | eng | - |
dc.publisher | Acoustical Society of America. The Journal's web site is located at http://asa.aip.org/jasa.html | - |
dc.relation.ispartof | Journal of the Acoustical Society of America | - |
dc.rights | Copyright 2012 Acoustical Society of America. This article may be downloaded for personal use only. Any other use requires prior permission of the author and the Acoustical Society of America. The following article appeared in Journal of the Acoustical Society of America, 2012, v. 131 n. 4, p. 3235, abstract no. 1aSCb1 and may be found at https://doi.org/10.1121/1.4708066 | - |
dc.subject | Physics | - |
dc.subject | Sound | - |
dc.title | Entropy coding for training deep belief networks with imbalanced and unlabeled data | - |
dc.type | Conference_Paper | - |
dc.identifier.email | Archangeli, D: darchang@hku.hk | - |
dc.identifier.authority | Archangeli, D=rp01748 | - |
dc.description.nature | published_or_final_version | - |
dc.identifier.doi | 10.1121/1.4708066 | - |
dc.identifier.volume | 131 | - |
dc.identifier.issue | 4 | - |
dc.identifier.spage | 3235, abstract no. 1aSCb1 | - |
dc.identifier.epage | 3235, abstract no. 1aSCb1 | - |
dc.publisher.place | United States | - |
dc.identifier.issnl | 0001-4966 | - |