Research Data Leeds Repository

SpatialLM-StepGame

Citation

Li, Fangjun and Hogg, David C. and Cohn, Anthony G. (2024) SpatialLM-StepGame. University of Leeds. [Dataset] https://doi.org/10.5518/1468

Dataset description

This dataset, associated with the AAAI-24 paper "Advancing Spatial Reasoning in Large Language Models: An In-depth Evaluation and Enhancement Using the StepGame Benchmark", aims to enhance spatial reasoning evaluations in language models. It rectifies template errors in the StepGame benchmark, providing a refined version in the bAbI format, promoting more accurate evaluation of language models' capabilities in spatial reasoning tasks.

Additional information: This dataset has been developed from StepGame (https://github.com/ZhengxiangShi/StepGame), providing a refined version
Keywords: spatial reasoning, language model, multi-hop, StepGame
Subjects: I000 - Computer sciences > I400 - Artificial intelligence
Divisions: Faculty of Engineering and Physical Sciences > School of Computing
Related resources:
LocationType
https://doi.org/10.1609/aaai.v38i17.29811Publication
https://eprints.whiterose.ac.uk/211546/Publication
License: MIT License
Date deposited: 04 Sep 2024 10:14
URI: https://archive.researchdata.leeds.ac.uk/id/eprint/1321

Files

Documentation

Data

Program

Research Data Leeds Repository is powered by EPrints
Copyright © University of Leeds