File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1021/acs.jcim.0c00801
- Scopus: eid_2-s2.0-85096854028
- PMID: 33140954
- WOS: WOS:000608875100041
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: MultiCon: A Semi-Supervised Approach for Predicting Drug Function from Chemical Structure Analysis
Title | MultiCon: A Semi-Supervised Approach for Predicting Drug Function from Chemical Structure Analysis |
---|---|
Authors | |
Issue Date | 2020 |
Citation | Journal of Chemical Information and Modeling, 2020, v. 60, n. 12, p. 5995-6006 How to Cite? |
Abstract | Semi-supervised learning has proved its efficacy in utilizing extensive unlabeled data to alleviate the use of a large amount of supervised data and improve model performance. Despite its tremendous potential, semi-supervised learning has yet to be implemented in the field of drug discovery. Empirical testing of drugs and their classification is costly and time-consuming. In contrast, predicting therapeutic applications of drugs from their structural formulas using semi-supervised learning would reduce costs and time significantly. Herein, we employ a new multicontrastive-based semi-supervised learning algorithm - MultiCon - for classifying drugs into 12 categories, according to therapeutic applications, on the basis of image analyses of their structural formulas. By rational use of data balancing, online augmentations of the drug image data during training, and the combined use of multicontrastive loss with consistency regularization, MultiCon achieves better class prediction accuracies when compared with the state-of-the-art machine learning methods across a variety of existing semi-supervised learning benchmarks. In particular, it performs exceptionally well with a limited number of labeled examples. For instance, with just 5000 labeled drugs in a PubChem (D3) data set, MultiCon achieved a class prediction accuracy of 97.74%. |
Persistent Identifier | http://hdl.handle.net/10722/333481 |
ISSN | 2023 Impact Factor: 5.6 2023 SCImago Journal Rankings: 1.396 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Sahoo, Pracheta | - |
dc.contributor.author | Roy, Indranil | - |
dc.contributor.author | Wang, Zhuoyi | - |
dc.contributor.author | Mi, Feng | - |
dc.contributor.author | Yu, Lin | - |
dc.contributor.author | Balasubramani, Pradeep | - |
dc.contributor.author | Khan, Latifur | - |
dc.contributor.author | Stoddart, J. Fraser | - |
dc.date.accessioned | 2023-10-06T05:19:42Z | - |
dc.date.available | 2023-10-06T05:19:42Z | - |
dc.date.issued | 2020 | - |
dc.identifier.citation | Journal of Chemical Information and Modeling, 2020, v. 60, n. 12, p. 5995-6006 | - |
dc.identifier.issn | 1549-9596 | - |
dc.identifier.uri | http://hdl.handle.net/10722/333481 | - |
dc.description.abstract | Semi-supervised learning has proved its efficacy in utilizing extensive unlabeled data to alleviate the use of a large amount of supervised data and improve model performance. Despite its tremendous potential, semi-supervised learning has yet to be implemented in the field of drug discovery. Empirical testing of drugs and their classification is costly and time-consuming. In contrast, predicting therapeutic applications of drugs from their structural formulas using semi-supervised learning would reduce costs and time significantly. Herein, we employ a new multicontrastive-based semi-supervised learning algorithm - MultiCon - for classifying drugs into 12 categories, according to therapeutic applications, on the basis of image analyses of their structural formulas. By rational use of data balancing, online augmentations of the drug image data during training, and the combined use of multicontrastive loss with consistency regularization, MultiCon achieves better class prediction accuracies when compared with the state-of-the-art machine learning methods across a variety of existing semi-supervised learning benchmarks. In particular, it performs exceptionally well with a limited number of labeled examples. For instance, with just 5000 labeled drugs in a PubChem (D3) data set, MultiCon achieved a class prediction accuracy of 97.74%. | - |
dc.language | eng | - |
dc.relation.ispartof | Journal of Chemical Information and Modeling | - |
dc.title | MultiCon: A Semi-Supervised Approach for Predicting Drug Function from Chemical Structure Analysis | - |
dc.type | Article | - |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1021/acs.jcim.0c00801 | - |
dc.identifier.pmid | 33140954 | - |
dc.identifier.scopus | eid_2-s2.0-85096854028 | - |
dc.identifier.volume | 60 | - |
dc.identifier.issue | 12 | - |
dc.identifier.spage | 5995 | - |
dc.identifier.epage | 6006 | - |
dc.identifier.eissn | 1549-960X | - |
dc.identifier.isi | WOS:000608875100041 | - |