arsaporta/symile-m3
Dataset Card for Symile-M3 Symile-M3 is a multilingual dataset of (audio, image, text) samples. The dataset is specifically designed to test a model's ability to capture higher-order information between three distinct high-dimensional data types: by incorporating multiple languages, we construct a task where text and audio are both needed to predict the image, and where, importantly, neither text
mlforge datasets pull arsaporta/symile-m3
Dataset details
About arsaporta/symile-m3
Dataset Card for Symile-M3 Symile-M3 is a multilingual dataset of (audio, image, text) samples. The dataset is specifically designed to test a model's ability to capture higher-order information between three distinct high-dimensional data types: by incorporating multiple languages, we construct a task where text and audio are both needed to predict the image, and where, importantly, neither text nor audio alone would suffice. - Paper: https://arxiv.org/abs/2411.01053 - GitHub: https://github.com/rajesh-lab/symile - Questions & Discussion: https://www.alphaxiv.org/abs/2411.01053v1