File Download
Supplementary

Conference Paper: Optimizing plurality for human intelligence tasks

TitleOptimizing plurality for human intelligence tasks
Authors
KeywordsCrowdsourcing
Data Quality
Issue Date2013
PublisherACM.
Citation
The 22nd ACM International Conference on Information and Knowledge Management (CIKM 2013), San Francisco, CA., 27 October-1 November 2013. In Conference Proceedings, 2013, p. 1-13 How to Cite?
AbstractIn a crowdsourcing system, Human Intelligence Tasks (HITs) (e.g., translating sentences, matching photos, tagging videos with keywords) can be conveniently specified. HITs are made available to a large pool of workers, who are paid upon completing the HITs they have selected. Since workers may have different capabilities, some difficult HITs may not be satisfactorily performed by a single worker. If more workers are employed to perform a HIT, the quality of the HIT’s answer could be statistically improved. Given a set of HITs and a fixed “budget”, we address the important problem of determining the number of workers (or plurality) of each HIT so that the overall answer quality is optimized. We propose a dynamic programming (DP) algorithm for solving the plurality assignment problem (PAP). We identify two interesting properties, namely, monotonicity and diminishing return, which are satisfied by a HIT if the quality of the HIT’s answer increases monotonically at a decreasing rate with its plurality. We show for HITs that satisfy the two properties (e.g., multiple-choice-question HITs), the PAP is approximable. We propose an efficient greedy algorithm for such case. We conduct extensive experiments on synthetic and real datasets to evaluate our algorithms. Our experiments show that our greedy algorithm provides close-to-optimal solutions in practice.
DescriptionFull Session 38 - DB Track - Miscellaneous
Persistent Identifierhttp://hdl.handle.net/10722/189633
ISBN

 

DC FieldValueLanguage
dc.contributor.authorMo, Len_US
dc.contributor.authorCheng, Ren_US
dc.contributor.authorKao, Ben_US
dc.contributor.authorYang, XSen_US
dc.contributor.authorRen, Cen_US
dc.contributor.authorLei, Sen_US
dc.contributor.authorCheung, DWLen_US
dc.contributor.authorLo, Een_US
dc.date.accessioned2013-09-17T14:50:31Z-
dc.date.available2013-09-17T14:50:31Z-
dc.date.issued2013en_US
dc.identifier.citationThe 22nd ACM International Conference on Information and Knowledge Management (CIKM 2013), San Francisco, CA., 27 October-1 November 2013. In Conference Proceedings, 2013, p. 1-13en_US
dc.identifier.isbn978-1-4503-2263-8-
dc.identifier.urihttp://hdl.handle.net/10722/189633-
dc.descriptionFull Session 38 - DB Track - Miscellaneous-
dc.description.abstractIn a crowdsourcing system, Human Intelligence Tasks (HITs) (e.g., translating sentences, matching photos, tagging videos with keywords) can be conveniently specified. HITs are made available to a large pool of workers, who are paid upon completing the HITs they have selected. Since workers may have different capabilities, some difficult HITs may not be satisfactorily performed by a single worker. If more workers are employed to perform a HIT, the quality of the HIT’s answer could be statistically improved. Given a set of HITs and a fixed “budget”, we address the important problem of determining the number of workers (or plurality) of each HIT so that the overall answer quality is optimized. We propose a dynamic programming (DP) algorithm for solving the plurality assignment problem (PAP). We identify two interesting properties, namely, monotonicity and diminishing return, which are satisfied by a HIT if the quality of the HIT’s answer increases monotonically at a decreasing rate with its plurality. We show for HITs that satisfy the two properties (e.g., multiple-choice-question HITs), the PAP is approximable. We propose an efficient greedy algorithm for such case. We conduct extensive experiments on synthetic and real datasets to evaluate our algorithms. Our experiments show that our greedy algorithm provides close-to-optimal solutions in practice.-
dc.languageengen_US
dc.publisherACM.-
dc.relation.ispartof22nd ACM International Conference on Information and Knowledge Management, CIKM 2013 Proceedingsen_US
dc.subjectCrowdsourcing-
dc.subjectData Quality-
dc.titleOptimizing plurality for human intelligence tasksen_US
dc.typeConference_Paperen_US
dc.identifier.emailMo, L: lymo@cs.hku.hken_US
dc.identifier.emailCheng, R: ckcheng@cs.hku.hken_US
dc.identifier.emailKao, B: kao@cs.hku.hken_US
dc.identifier.emailYang, XS: xyang2@cs.hku.hk-
dc.identifier.emailRen, C: chren@cs.hku.hk-
dc.identifier.emailLei, S: sylei@cs.hku.hk-
dc.identifier.emailCheung, DWL: dcheung@cs.hku.hk-
dc.identifier.emailLo, E: ericlo@comp.polyu.edu.hk-
dc.identifier.authorityCheng, R=rp00074en_US
dc.identifier.authorityKao, B=rp00123en_US
dc.identifier.authorityCheung, DWL=rp00101en_US
dc.description.naturelink_to_OA_fulltext-
dc.identifier.hkuros222849en_US
dc.identifier.spage1-
dc.identifier.epage13-
dc.publisher.placeUnited States-
dc.customcontrol.immutablesml 131023-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats