File Download
Supplementary
-
Citations:
- Appears in Collections:
Conference Paper: Optimizing plurality for human intelligence tasks
Title | Optimizing plurality for human intelligence tasks |
---|---|
Authors | |
Keywords | Crowdsourcing Data Quality |
Issue Date | 2013 |
Publisher | ACM. |
Citation | The 22nd ACM International Conference on Information and Knowledge Management (CIKM 2013), San Francisco, CA., 27 October-1 November 2013. In Conference Proceedings, 2013, p. 1-13 How to Cite? |
Abstract | In a crowdsourcing system, Human Intelligence Tasks (HITs) (e.g., translating sentences, matching photos, tagging videos with keywords) can be conveniently specified. HITs are made available to a large pool of workers, who are paid upon completing the HITs they have selected. Since workers may have different capabilities, some difficult HITs may not be satisfactorily performed by a single worker. If more workers are employed to perform a HIT, the quality of the HIT’s answer could be statistically improved. Given a set of HITs and a fixed “budget”, we address the important problem of determining the number of workers (or plurality) of each HIT so that the overall answer quality is optimized. We propose a dynamic programming (DP) algorithm for solving the plurality assignment problem (PAP). We identify two interesting properties, namely, monotonicity and diminishing return, which are satisfied by a HIT if the quality of the HIT’s answer increases monotonically at a decreasing rate with its plurality. We show for HITs that satisfy the two properties (e.g., multiple-choice-question HITs), the PAP is approximable. We propose an efficient greedy algorithm for such case. We conduct extensive experiments on synthetic and real datasets to evaluate our algorithms. Our experiments show that our greedy algorithm provides close-to-optimal solutions in practice. |
Description | Full Session 38 - DB Track - Miscellaneous |
Persistent Identifier | http://hdl.handle.net/10722/189633 |
ISBN |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Mo, L | en_US |
dc.contributor.author | Cheng, R | en_US |
dc.contributor.author | Kao, B | en_US |
dc.contributor.author | Yang, XS | en_US |
dc.contributor.author | Ren, C | en_US |
dc.contributor.author | Lei, S | en_US |
dc.contributor.author | Cheung, DWL | en_US |
dc.contributor.author | Lo, E | en_US |
dc.date.accessioned | 2013-09-17T14:50:31Z | - |
dc.date.available | 2013-09-17T14:50:31Z | - |
dc.date.issued | 2013 | en_US |
dc.identifier.citation | The 22nd ACM International Conference on Information and Knowledge Management (CIKM 2013), San Francisco, CA., 27 October-1 November 2013. In Conference Proceedings, 2013, p. 1-13 | en_US |
dc.identifier.isbn | 978-1-4503-2263-8 | - |
dc.identifier.uri | http://hdl.handle.net/10722/189633 | - |
dc.description | Full Session 38 - DB Track - Miscellaneous | - |
dc.description.abstract | In a crowdsourcing system, Human Intelligence Tasks (HITs) (e.g., translating sentences, matching photos, tagging videos with keywords) can be conveniently specified. HITs are made available to a large pool of workers, who are paid upon completing the HITs they have selected. Since workers may have different capabilities, some difficult HITs may not be satisfactorily performed by a single worker. If more workers are employed to perform a HIT, the quality of the HIT’s answer could be statistically improved. Given a set of HITs and a fixed “budget”, we address the important problem of determining the number of workers (or plurality) of each HIT so that the overall answer quality is optimized. We propose a dynamic programming (DP) algorithm for solving the plurality assignment problem (PAP). We identify two interesting properties, namely, monotonicity and diminishing return, which are satisfied by a HIT if the quality of the HIT’s answer increases monotonically at a decreasing rate with its plurality. We show for HITs that satisfy the two properties (e.g., multiple-choice-question HITs), the PAP is approximable. We propose an efficient greedy algorithm for such case. We conduct extensive experiments on synthetic and real datasets to evaluate our algorithms. Our experiments show that our greedy algorithm provides close-to-optimal solutions in practice. | - |
dc.language | eng | en_US |
dc.publisher | ACM. | - |
dc.relation.ispartof | 22nd ACM International Conference on Information and Knowledge Management, CIKM 2013 Proceedings | en_US |
dc.subject | Crowdsourcing | - |
dc.subject | Data Quality | - |
dc.title | Optimizing plurality for human intelligence tasks | en_US |
dc.type | Conference_Paper | en_US |
dc.identifier.email | Mo, L: lymo@cs.hku.hk | en_US |
dc.identifier.email | Cheng, R: ckcheng@cs.hku.hk | en_US |
dc.identifier.email | Kao, B: kao@cs.hku.hk | en_US |
dc.identifier.email | Yang, XS: xyang2@cs.hku.hk | - |
dc.identifier.email | Ren, C: chren@cs.hku.hk | - |
dc.identifier.email | Lei, S: sylei@cs.hku.hk | - |
dc.identifier.email | Cheung, DWL: dcheung@cs.hku.hk | - |
dc.identifier.email | Lo, E: ericlo@comp.polyu.edu.hk | - |
dc.identifier.authority | Cheng, R=rp00074 | en_US |
dc.identifier.authority | Kao, B=rp00123 | en_US |
dc.identifier.authority | Cheung, DWL=rp00101 | en_US |
dc.description.nature | link_to_OA_fulltext | - |
dc.identifier.hkuros | 222849 | en_US |
dc.identifier.spage | 1 | - |
dc.identifier.epage | 13 | - |
dc.publisher.place | United States | - |
dc.customcontrol.immutable | sml 131023 | - |