File Download

There are no files associated with this item.

Supplementary

Conference Paper: Training Deep Nets with Imbalanced and Unlabeled Data

TitleTraining Deep Nets with Imbalanced and Unlabeled Data
Authors
Issue Date2012
PublisherInternational Speech Communication Association ( ISCA).
Citation
The 13th Annual Conference of the International Speech Communication Association (INTERSPEECH 2012), Portland, OR., 9-13 September 2012. In Conference Proceedings, 2012, v. 2, p. 1754-1757, abstract no. Wed.P7a.08 How to Cite?
AbstractTraining deep belief networks (DBNs) is normally done with large data sets. In this work, our goal is to predict {it traces} of the surface of the tongue in ultrasound images of the mouth during speech. Hand-tracing is labor-intensive; the dataset is highly imbalanced since many images are extremely similar. We propose a bootstrapping method which handles this imbalance by iteratively selecting a small subset of images to be hand-traced (thereby reducing human labor time), then (re)training the DBN, making use of an entropy-based diversity measure for the initial selection, thereby achieving a three-fold reduction in human time required for tracing with human-level accuracy.
DescriptionSession WED.P7A: Audio Analysis, Estimation and Classification
Persistent Identifierhttp://hdl.handle.net/10722/202122
ISBN

 

DC FieldValueLanguage
dc.contributor.authorBerry, Jen_US
dc.contributor.authorFasel, Ien_US
dc.contributor.authorFadiga, Len_US
dc.contributor.authorArchangeli, DBen_US
dc.date.accessioned2014-08-21T08:04:55Z-
dc.date.available2014-08-21T08:04:55Z-
dc.date.issued2012en_US
dc.identifier.citationThe 13th Annual Conference of the International Speech Communication Association (INTERSPEECH 2012), Portland, OR., 9-13 September 2012. In Conference Proceedings, 2012, v. 2, p. 1754-1757, abstract no. Wed.P7a.08en_US
dc.identifier.isbn9781622767595-
dc.identifier.urihttp://hdl.handle.net/10722/202122-
dc.descriptionSession WED.P7A: Audio Analysis, Estimation and Classification-
dc.description.abstractTraining deep belief networks (DBNs) is normally done with large data sets. In this work, our goal is to predict {it traces} of the surface of the tongue in ultrasound images of the mouth during speech. Hand-tracing is labor-intensive; the dataset is highly imbalanced since many images are extremely similar. We propose a bootstrapping method which handles this imbalance by iteratively selecting a small subset of images to be hand-traced (thereby reducing human labor time), then (re)training the DBN, making use of an entropy-based diversity measure for the initial selection, thereby achieving a three-fold reduction in human time required for tracing with human-level accuracy.en_US
dc.languageengen_US
dc.publisherInternational Speech Communication Association ( ISCA).en_US
dc.relation.ispartofAnnual Conference of the International Speech Communication Association, INTERSPEECH 2012en_US
dc.titleTraining Deep Nets with Imbalanced and Unlabeled Dataen_US
dc.typeConference_Paperen_US
dc.identifier.emailArchangeli, DB: darchang@hku.hken_US
dc.identifier.authorityArchangeli, DB=rp01748en_US
dc.identifier.hkuros232524en_US
dc.identifier.volume2en_US
dc.identifier.spage1754, abstract no. Wed.P7a.08-
dc.identifier.epage1757, abstract no. Wed.P7a.08-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats