Bird's Eye: Probing for Linguistic Graph Structures with a Simple Information-Theoretic Approach
dc.contributor.author
Hou, Yifan
dc.contributor.author
Sachan, Mrinmaya
dc.contributor.editor
Zong, Chengqing
dc.contributor.editor
Xia, Fei
dc.contributor.editor
Li, Wenjie
dc.contributor.editor
Navigli, Roberto
dc.date.accessioned
2022-05-31T05:34:07Z
dc.date.available
2021-12-07T09:33:39Z
dc.date.available
2022-05-31T05:34:07Z
dc.date.issued
2021-08
dc.identifier.isbn
978-1-954085-52-7
en_US
dc.identifier.other
10.18653/v1/2021.acl-long.145
en_US
dc.identifier.uri
http://hdl.handle.net/20.500.11850/519233
dc.identifier.doi
10.3929/ethz-b-000519233
dc.description.abstract
NLP has a rich history of representing our prior understanding of language in the form of graphs. Recent work on analyzing contextualized text representations has focused on hand-designed probe models to understand how and to what extent do these representations encode a particular linguistic phenomenon. However, due to the inter-dependence of various phenomena and randomness of training probe models, detecting how these representations encode the rich information in these linguistic graphs remains a challenging problem. In this paper, we propose a new information-theoretic probe, Bird's Eye, which is a fairly simple probe method for detecting if and how these representations encode the information in these linguistic graphs. Instead of using classifier performance, our probe takes an information-theoretic view of probing and estimates the mutual information between the linguistic graph embedded in a continuous space and the contextualized word representations. Furthermore, we also propose an approach to use our probe to investigate localized linguistic information in the linguistic graphs using perturbation analysis. We call this probing setup Worm's Eye. Using these probes, we analyze BERT models on their ability to encode a syntactic and a semantic graph structure, and find that these models encode to some degree both syntactic as well as semantic information; albeit syntactic information to a greater extent. Our implementation is available in https://github.com/yifan-h/Graph_Probe-Birds_Eye.
en_US
dc.format
application/pdf
en_US
dc.language.iso
en
en_US
dc.publisher
Association for Computational Linguistics
en_US
dc.rights.uri
http://creativecommons.org/licenses/by/4.0/
dc.title
Bird's Eye: Probing for Linguistic Graph Structures with a Simple Information-Theoretic Approach
en_US
dc.type
Conference Paper
dc.rights.license
Creative Commons Attribution 4.0 International
ethz.book.title
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing
en_US
ethz.journal.volume
1
en_US
ethz.pages.start
1844
en_US
ethz.pages.end
1859
en_US
ethz.version.deposit
publishedVersion
en_US
ethz.event
Joint Conference of 59th Annual Meeting of the Association-for-Computational-Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021)
en_US
ethz.event.location
Online
en_US
ethz.event.date
August 1-6, 2021
en_US
ethz.grant
Representation Learning for Arbitrarily Long Richly Formatted Multimedia Documents
en_US
ethz.identifier.wos
ethz.publication.place
Stroudsburg, PA
en_US
ethz.publication.status
published
en_US
ethz.leitzahl
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02661 - Institut für Maschinelles Lernen / Institute for Machine Learning::09684 - Sachan, Mrinmaya / Sachan, Mrinmaya
en_US
ethz.leitzahl.certified
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02661 - Institut für Maschinelles Lernen / Institute for Machine Learning::09684 - Sachan, Mrinmaya / Sachan, Mrinmaya
ethz.grant.agreementno
201009
ethz.grant.fundername
SNF
ethz.grant.funderDoi
10.13039/501100001711
ethz.grant.program
Projekte MINT
ethz.date.deposited
2021-12-07T09:34:13Z
ethz.source
WOS
ethz.eth
yes
en_US
ethz.availability
Open access
en_US
ethz.rosetta.installDate
2022-05-31T05:34:16Z
ethz.rosetta.lastUpdated
2023-02-07T03:19:08Z
ethz.rosetta.versionExported
true
ethz.COinS
ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=Bird's%20Eye:%20Probing%20for%20Linguistic%20Graph%20Structures%20with%20a%20Simple%20Information-Theoretic%20Approach&rft.date=2021-08&rft.volume=1&rft.spage=1844&rft.epage=1859&rft.au=Hou,%20Yifan&Sachan,%20Mrinmaya&rft.isbn=978-1-954085-52-7&rft.genre=proceeding&rft_id=info:doi/10.18653/v1/2021.acl-long.145&rft.btitle=Proceedings%20of%20the%2059th%20Annual%20Meeting%20of%20the%20Association%20for%20Computational%20Linguistics%20and%20the%2011th%20International%20Joint%20Conference%20o
Files in this item
Publication type
-
Conference Paper [35280]