File Download

There are no files associated with this item.

Supplementary

Conference Paper: Entropy coding for training deep belief networks with imbalanced and unlabeled data

TitleEntropy coding for training deep belief networks with imbalanced and unlabeled data
Authors
Issue Date2011
Citation
The 28th International Conference on Machine Learning (ICML 2011), Bellevue, WA., 28 June-2 July 2011. How to Cite?
AbstractTraining deep belief networks (DBNs) is nor- mally done with large data sets. In this work, our goal is to predict traces of the surface of the tongue in ultrasound images of the mouth during speech. Performance on this task can be dramatically enhanced by pre- training a DBN jointly on human-supplied traces and ultrasound images, then training a modified version of the network to pre- dict traces from ultrasound only. However hand-tracing the entire dataset of ultrasound images is extremely labor intensive. More- over, the dataset is highly imbalanced since many images are extremely similar. Here we present a bootstrapping method which takes advantage of this imbalance, iteratively se- lecting a small subset of images to be hand- traced, then (re)training the DBN, making use of an entropy-based diversity measure for the initial selection. With this approach we achieve a three-fold reduction in human time required to trace an entire dataset with human-level accuracy.
Persistent Identifierhttp://hdl.handle.net/10722/205582

 

DC FieldValueLanguage
dc.contributor.authorBerry, J-
dc.contributor.authorFasel, I-
dc.contributor.authorFadiga, L-
dc.contributor.authorArchangeli, DB-
dc.date.accessioned2014-09-20T04:13:58Z-
dc.date.available2014-09-20T04:13:58Z-
dc.date.issued2011-
dc.identifier.citationThe 28th International Conference on Machine Learning (ICML 2011), Bellevue, WA., 28 June-2 July 2011.-
dc.identifier.urihttp://hdl.handle.net/10722/205582-
dc.description.abstractTraining deep belief networks (DBNs) is nor- mally done with large data sets. In this work, our goal is to predict traces of the surface of the tongue in ultrasound images of the mouth during speech. Performance on this task can be dramatically enhanced by pre- training a DBN jointly on human-supplied traces and ultrasound images, then training a modified version of the network to pre- dict traces from ultrasound only. However hand-tracing the entire dataset of ultrasound images is extremely labor intensive. More- over, the dataset is highly imbalanced since many images are extremely similar. Here we present a bootstrapping method which takes advantage of this imbalance, iteratively se- lecting a small subset of images to be hand- traced, then (re)training the DBN, making use of an entropy-based diversity measure for the initial selection. With this approach we achieve a three-fold reduction in human time required to trace an entire dataset with human-level accuracy.-
dc.languageeng-
dc.relation.ispartofInternational Conference on Machine Learning, ICML 2011-
dc.titleEntropy coding for training deep belief networks with imbalanced and unlabeled data-
dc.typeConference_Paper-
dc.identifier.emailArchangeli, DB: darchang@hku.hk-
dc.identifier.authorityArchangeli, DB=rp01748-
dc.identifier.hkuros232525-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats