Optimizing plurality for human intelligence tasks

Mo, L; Cheng, R; Kao, B; Yang, XS; Ren, C; Lei, S; Cheung, DWL; Lo, E

File Download

re01.htm

Supplementary

Citations:
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: Optimizing plurality for human intelligence tasks

Title	Optimizing plurality for human intelligence tasks
Authors	Mo, L Cheng, R Kao, B Yang, XS Ren, C Lei, S Cheung, DWL Lo, E
Keywords	Crowdsourcing Data Quality
Issue Date	2013
Publisher	ACM.
Citation	The 22nd ACM International Conference on Information and Knowledge Management (CIKM 2013), San Francisco, CA., 27 October-1 November 2013. In Conference Proceedings, 2013, p. 1-13 How to Cite?
Abstract	In a crowdsourcing system, Human Intelligence Tasks (HITs) (e.g., translating sentences, matching photos, tagging videos with keywords) can be conveniently specified. HITs are made available to a large pool of workers, who are paid upon completing the HITs they have selected. Since workers may have different capabilities, some difficult HITs may not be satisfactorily performed by a single worker. If more workers are employed to perform a HIT, the quality of the HIT’s answer could be statistically improved. Given a set of HITs and a fixed “budget”, we address the important problem of determining the number of workers (or plurality) of each HIT so that the overall answer quality is optimized. We propose a dynamic programming (DP) algorithm for solving the plurality assignment problem (PAP). We identify two interesting properties, namely, monotonicity and diminishing return, which are satisfied by a HIT if the quality of the HIT’s answer increases monotonically at a decreasing rate with its plurality. We show for HITs that satisfy the two properties (e.g., multiple-choice-question HITs), the PAP is approximable. We propose an efficient greedy algorithm for such case. We conduct extensive experiments on synthetic and real datasets to evaluate our algorithms. Our experiments show that our greedy algorithm provides close-to-optimal solutions in practice.
Description	Full Session 38 - DB Track - Miscellaneous
Persistent Identifier	http://hdl.handle.net/10722/189633
ISBN	978-1-4503-2263-8

DC Field	Value	Language
dc.contributor.author	Mo, L	en_US
dc.contributor.author	Cheng, R	en_US
dc.contributor.author	Kao, B	en_US
dc.contributor.author	Yang, XS	en_US
dc.contributor.author	Ren, C	en_US
dc.contributor.author	Lei, S	en_US
dc.contributor.author	Cheung, DWL	en_US
dc.contributor.author	Lo, E	en_US
dc.date.accessioned	2013-09-17T14:50:31Z	-
dc.date.available	2013-09-17T14:50:31Z	-
dc.date.issued	2013	en_US
dc.identifier.citation	The 22nd ACM International Conference on Information and Knowledge Management (CIKM 2013), San Francisco, CA., 27 October-1 November 2013. In Conference Proceedings, 2013, p. 1-13	en_US
dc.identifier.isbn	978-1-4503-2263-8	-
dc.identifier.uri	http://hdl.handle.net/10722/189633	-
dc.description	Full Session 38 - DB Track - Miscellaneous	-
dc.description.abstract	In a crowdsourcing system, Human Intelligence Tasks (HITs) (e.g., translating sentences, matching photos, tagging videos with keywords) can be conveniently specified. HITs are made available to a large pool of workers, who are paid upon completing the HITs they have selected. Since workers may have different capabilities, some difficult HITs may not be satisfactorily performed by a single worker. If more workers are employed to perform a HIT, the quality of the HIT’s answer could be statistically improved. Given a set of HITs and a fixed “budget”, we address the important problem of determining the number of workers (or plurality) of each HIT so that the overall answer quality is optimized. We propose a dynamic programming (DP) algorithm for solving the plurality assignment problem (PAP). We identify two interesting properties, namely, monotonicity and diminishing return, which are satisfied by a HIT if the quality of the HIT’s answer increases monotonically at a decreasing rate with its plurality. We show for HITs that satisfy the two properties (e.g., multiple-choice-question HITs), the PAP is approximable. We propose an efficient greedy algorithm for such case. We conduct extensive experiments on synthetic and real datasets to evaluate our algorithms. Our experiments show that our greedy algorithm provides close-to-optimal solutions in practice.	-
dc.language	eng	en_US
dc.publisher	ACM.	-
dc.relation.ispartof	22nd ACM International Conference on Information and Knowledge Management, CIKM 2013 Proceedings	en_US
dc.subject	Crowdsourcing	-
dc.subject	Data Quality	-
dc.title	Optimizing plurality for human intelligence tasks	en_US
dc.type	Conference_Paper	en_US
dc.identifier.email	Mo, L: lymo@cs.hku.hk	en_US
dc.identifier.email	Cheng, R: ckcheng@cs.hku.hk	en_US
dc.identifier.email	Kao, B: kao@cs.hku.hk	en_US
dc.identifier.email	Yang, XS: xyang2@cs.hku.hk	-
dc.identifier.email	Ren, C: chren@cs.hku.hk	-
dc.identifier.email	Lei, S: sylei@cs.hku.hk	-
dc.identifier.email	Cheung, DWL: dcheung@cs.hku.hk	-
dc.identifier.email	Lo, E: ericlo@comp.polyu.edu.hk	-
dc.identifier.authority	Cheng, R=rp00074	en_US
dc.identifier.authority	Kao, B=rp00123	en_US
dc.identifier.authority	Cheung, DWL=rp00101	en_US
dc.description.nature	link_to_OA_fulltext	-
dc.identifier.hkuros	222849	en_US
dc.identifier.spage	1	-
dc.identifier.epage	13	-
dc.publisher.place	United States	-
dc.customcontrol.immutable	sml 131023	-

File Download

Supplementary

Conference Paper: Optimizing plurality for human intelligence tasks

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats