Research Data Leeds Repository
SpatialLM-StepGame
Citation
Li, Fangjun and Hogg, David C. and Cohn, Anthony G. (2024) SpatialLM-StepGame. University of Leeds. [Dataset] https://doi.org/10.5518/1468
Dataset description
This dataset, associated with the AAAI-24 paper "Advancing Spatial Reasoning in Large Language Models: An In-depth Evaluation and Enhancement Using the StepGame Benchmark", aims to enhance spatial reasoning evaluations in language models. It rectifies template errors in the StepGame benchmark, providing a refined version in the bAbI format, promoting more accurate evaluation of language models' capabilities in spatial reasoning tasks.
Additional information: | This dataset has been developed from StepGame (https://github.com/ZhengxiangShi/StepGame), providing a refined version | ||||||
---|---|---|---|---|---|---|---|
Keywords: | spatial reasoning, language model, multi-hop, StepGame | ||||||
Subjects: | I000 - Computer sciences > I400 - Artificial intelligence | ||||||
Divisions: | Faculty of Engineering and Physical Sciences > School of Computing | ||||||
Related resources: |
|
||||||
License: | MIT License | ||||||
Date deposited: | 04 Sep 2024 10:14 | ||||||
URI: | https://archive.researchdata.leeds.ac.uk/id/eprint/1321 | ||||||