Showing results 1 to 2 of 2
Title | Author(s) | Issue Date | |
---|---|---|---|
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments Proceeding/Conference:Neural Information Processing Systems (NeurIPS), 2024 (10/12/2024-15/12/2024, Vancouver, Canada) | 15-Dec-2024 | ||
Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows? Proceeding/Conference:Neural Information Processing Systems (NeurIPS), 2024 (10/12/2024-15/12/2024, Vancouver, Canada) | 15-Dec-2024 |