File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/BigData.2018.8622090
- Scopus: eid_2-s2.0-85062631268
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Conference Paper: CUImage: A Neverending Learning Platform on a Convolutional Knowledge Graph of Billion Web Images
Title | CUImage: A Neverending Learning Platform on a Convolutional Knowledge Graph of Billion Web Images |
---|---|
Authors | |
Keywords | Graph Convolutional Network Deep Learning Knowledge Graph Large-scale System and Database |
Issue Date | 2019 |
Citation | Proceedings - 2018 IEEE International Conference on Big Data, Big Data 2018, 2019, p. 1787-1796 How to Cite? |
Abstract | © 2018 IEEE. Pretraining visual features by image classification on ImageNet is an indispensable step towards many advanced perception systems in the last decade. ImageNet is the most prevalent database for supervised pretraining of image features. Unlike ImageNet assuming that the visual concepts are static and independent with each other, this work presents a neverending learning platform, termed CUImage, which learns visual representation on a knowledge graph of billions of images, whose data scale is several orders of magnitude larger than ImageNet. A novel dynamic graph convolutional network (GCN) is proposed to learn visual concepts. Once the new data are presented, the GCN is updated dynamically where new concepts can be discovered or existing concepts can be merged. This is enabled by three main components in CUImage, including Data Dispersion (DD), Data Management and Mining (DMM), and Data Evaluation (DE). These three components are built on top of a computer cluster with thousands of GPU/CPU cores and a parallel storage of petabytes. So far, CUImage has processed and managed more than 2 million visual concepts of 2 billion images. To evaluate the learned representation, we transfer the pretrained features to several challenging benchmarks such as image recognition on ImageNet and object detection in MS-COCO. We achieve state-of-the-art results, significantly surpassing the systems that used ImageNet for pretraining. The codes, data, and models will be released. |
Persistent Identifier | http://hdl.handle.net/10722/273692 |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Peng, Zhanglin | - |
dc.contributor.author | Wu, Lingyun | - |
dc.contributor.author | Ren, Jiamin | - |
dc.contributor.author | Zhang, Ruimao | - |
dc.contributor.author | Luo, Ping | - |
dc.date.accessioned | 2019-08-12T09:56:23Z | - |
dc.date.available | 2019-08-12T09:56:23Z | - |
dc.date.issued | 2019 | - |
dc.identifier.citation | Proceedings - 2018 IEEE International Conference on Big Data, Big Data 2018, 2019, p. 1787-1796 | - |
dc.identifier.uri | http://hdl.handle.net/10722/273692 | - |
dc.description.abstract | © 2018 IEEE. Pretraining visual features by image classification on ImageNet is an indispensable step towards many advanced perception systems in the last decade. ImageNet is the most prevalent database for supervised pretraining of image features. Unlike ImageNet assuming that the visual concepts are static and independent with each other, this work presents a neverending learning platform, termed CUImage, which learns visual representation on a knowledge graph of billions of images, whose data scale is several orders of magnitude larger than ImageNet. A novel dynamic graph convolutional network (GCN) is proposed to learn visual concepts. Once the new data are presented, the GCN is updated dynamically where new concepts can be discovered or existing concepts can be merged. This is enabled by three main components in CUImage, including Data Dispersion (DD), Data Management and Mining (DMM), and Data Evaluation (DE). These three components are built on top of a computer cluster with thousands of GPU/CPU cores and a parallel storage of petabytes. So far, CUImage has processed and managed more than 2 million visual concepts of 2 billion images. To evaluate the learned representation, we transfer the pretrained features to several challenging benchmarks such as image recognition on ImageNet and object detection in MS-COCO. We achieve state-of-the-art results, significantly surpassing the systems that used ImageNet for pretraining. The codes, data, and models will be released. | - |
dc.language | eng | - |
dc.relation.ispartof | Proceedings - 2018 IEEE International Conference on Big Data, Big Data 2018 | - |
dc.subject | Graph Convolutional Network | - |
dc.subject | Deep Learning | - |
dc.subject | Knowledge Graph | - |
dc.subject | Large-scale System and Database | - |
dc.title | CUImage: A Neverending Learning Platform on a Convolutional Knowledge Graph of Billion Web Images | - |
dc.type | Conference_Paper | - |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1109/BigData.2018.8622090 | - |
dc.identifier.scopus | eid_2-s2.0-85062631268 | - |
dc.identifier.spage | 1787 | - |
dc.identifier.epage | 1796 | - |