File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: CEMA – Cost-Efficient Machine-Assisted Document Annotations

TitleCEMA – Cost-Efficient Machine-Assisted Document Annotations
Authors
Issue Date26-Jun-2023
PublisherAssociation for the Advancement of Artificial Intelligence (AAAI)
Abstract

We study the problem of semantically annotating textual documents that are complex in the sense that the documents are long, feature rich, and domain specific. Due to their complexity, such annotation tasks require trained human workers, which are very expensive in both time and money. We propose CEMA, a method for deploying machine learning to assist humans in complex document annotation. CEMA estimates the human cost of annotating each document and selects the set of documents to be annotated that strike the best balance between model accuracy and human cost. We conduct experiments on complex annotation tasks in which we compare CEMA against other document selection and annotation strategies. Our results show that CEMA is the most cost-efficient solution for those tasks.


Persistent Identifierhttp://hdl.handle.net/10722/333767
ISSN

 

DC FieldValueLanguage
dc.contributor.authorYuan, GW-
dc.contributor.authorKao, B-
dc.contributor.authorWu, TH-
dc.date.accessioned2023-10-06T08:38:54Z-
dc.date.available2023-10-06T08:38:54Z-
dc.date.issued2023-06-26-
dc.identifier.issn2159-5399-
dc.identifier.urihttp://hdl.handle.net/10722/333767-
dc.description.abstract<p>We study the problem of semantically annotating textual documents that are complex in the sense that the documents are long, feature rich, and domain specific. Due to their complexity, such annotation tasks require trained human workers, which are very expensive in both time and money. We propose CEMA, a method for deploying machine learning to assist humans in complex document annotation. CEMA estimates the human cost of annotating each document and selects the set of documents to be annotated that strike the best balance between model accuracy and human cost. We conduct experiments on complex annotation tasks in which we compare CEMA against other document selection and annotation strategies. Our results show that CEMA is the most cost-efficient solution for those tasks.<br></p>-
dc.languageeng-
dc.publisherAssociation for the Advancement of Artificial Intelligence (AAAI)-
dc.relation.ispartof37th AAAI Conference on Artificial Intelligence (07/02/2023-14/02/2023, Washington, DC, USA)-
dc.titleCEMA – Cost-Efficient Machine-Assisted Document Annotations-
dc.typeConference_Paper-
dc.identifier.doi10.1609/aaai.v37i9.26308-
dc.identifier.volume37-
dc.identifier.issnl2159-5399-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats