File Download
There are no files associated with this item.
Supplementary
-
Citations:
- Appears in Collections:
Conference Paper: Training Deep Nets with Imbalanced and Unlabeled Data
Title | Training Deep Nets with Imbalanced and Unlabeled Data |
---|---|
Authors | |
Issue Date | 2012 |
Publisher | International Speech Communication Association ( ISCA). |
Citation | The 13th Annual Conference of the International Speech Communication Association (INTERSPEECH 2012), Portland, OR., 9-13 September 2012. In Conference Proceedings, 2012, v. 2, p. 1754-1757, abstract no. Wed.P7a.08 How to Cite? |
Abstract | Training deep belief networks (DBNs) is normally done with large data sets. In this work, our goal is to predict {it traces} of the surface of the tongue in ultrasound images of the mouth during speech. Hand-tracing is labor-intensive; the dataset is highly imbalanced since many images are extremely similar. We propose a bootstrapping method which handles this imbalance by iteratively selecting a small subset of images to be hand-traced (thereby reducing human labor time), then (re)training the DBN, making use of an entropy-based diversity measure for the initial selection, thereby achieving a three-fold reduction in human time required for tracing with human-level accuracy. |
Description | Session WED.P7A: Audio Analysis, Estimation and Classification |
Persistent Identifier | http://hdl.handle.net/10722/202122 |
ISBN |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Berry, J | en_US |
dc.contributor.author | Fasel, I | en_US |
dc.contributor.author | Fadiga, L | en_US |
dc.contributor.author | Archangeli, DB | en_US |
dc.date.accessioned | 2014-08-21T08:04:55Z | - |
dc.date.available | 2014-08-21T08:04:55Z | - |
dc.date.issued | 2012 | en_US |
dc.identifier.citation | The 13th Annual Conference of the International Speech Communication Association (INTERSPEECH 2012), Portland, OR., 9-13 September 2012. In Conference Proceedings, 2012, v. 2, p. 1754-1757, abstract no. Wed.P7a.08 | en_US |
dc.identifier.isbn | 9781622767595 | - |
dc.identifier.uri | http://hdl.handle.net/10722/202122 | - |
dc.description | Session WED.P7A: Audio Analysis, Estimation and Classification | - |
dc.description.abstract | Training deep belief networks (DBNs) is normally done with large data sets. In this work, our goal is to predict {it traces} of the surface of the tongue in ultrasound images of the mouth during speech. Hand-tracing is labor-intensive; the dataset is highly imbalanced since many images are extremely similar. We propose a bootstrapping method which handles this imbalance by iteratively selecting a small subset of images to be hand-traced (thereby reducing human labor time), then (re)training the DBN, making use of an entropy-based diversity measure for the initial selection, thereby achieving a three-fold reduction in human time required for tracing with human-level accuracy. | en_US |
dc.language | eng | en_US |
dc.publisher | International Speech Communication Association ( ISCA). | en_US |
dc.relation.ispartof | Annual Conference of the International Speech Communication Association, INTERSPEECH 2012 | en_US |
dc.title | Training Deep Nets with Imbalanced and Unlabeled Data | en_US |
dc.type | Conference_Paper | en_US |
dc.identifier.email | Archangeli, DB: darchang@hku.hk | en_US |
dc.identifier.authority | Archangeli, DB=rp01748 | en_US |
dc.identifier.hkuros | 232524 | en_US |
dc.identifier.volume | 2 | en_US |
dc.identifier.spage | 1754, abstract no. Wed.P7a.08 | - |
dc.identifier.epage | 1757, abstract no. Wed.P7a.08 | - |