Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models

Hou, Yifan; Li, Jiaoda; Fei, Yu; Stolfo, Alessandro; Zhou, Wangchunshu; Zeng, Guangtao; Bosselut, Antoine; Sachan, Mrinmaya

doi:10.18653/v1/2023.emnlp-main.299

Download

Full text (published version) (PDF, 1.074Mb)

Open access

Author

Fei, Yu

Date

2023-12

Type

Conference Paper

ETH Bibliography

yes

Altmetrics

Download

Full text (published version) (PDF, 1.074Mb)

Rights / license

Creative Commons Attribution 4.0 International

Abstract

Recent work has shown that language models (LMs) have strong multi-step (i.e., procedural) reasoning capabilities. However, it is unclear whether LMs perform these tasks by cheating with answers memorized from pretraining corpus, or, via a multi-step reasoning mechanism. In this paper, we try to an Show more

Permanent link

https://doi.org/10.3929/ethz-b-000653493

Publication status

published

External links

https://doi.org/10.18653/v1/2023.emnlp-main.299

Editor

Bouamor, Houda

Pino, Juan

Bali, Kalika

Book title

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

Pages / Article No.

4902 - 4919

Publisher

Association for Computational Linguistics

Event

2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), Singapore, December 6-10, 2023

Organisational unit

09684 - Sachan, Mrinmaya / Sachan, Mrinmaya

Funding

ETH-19 21-1 - Neuro-cognitive Model Inspired from Human Language Processing (ETHZ)

More

Show all metadata

ETH Bibliography

yes

Altmetrics

Research Collection

Search

Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models Mendeley CSV RIS BibTeX

Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models

Mendeley

CSV

RIS

BibTeX