Show simple item record

dc.contributor.author
Chen, Jie
dc.contributor.author
Loi, Igor
dc.contributor.author
Flamand, Eric
dc.contributor.author
Tagliavini, Giuseppe
dc.contributor.author
Benini, Luca
dc.contributor.author
Rossi, Davide
dc.date.accessioned
2023-04-14T09:45:51Z
dc.date.available
2023-03-21T04:05:24Z
dc.date.available
2023-03-21T10:33:33Z
dc.date.available
2023-04-14T09:45:51Z
dc.date.issued
2023-04
dc.identifier.issn
1063-8210
dc.identifier.issn
1557-9999
dc.identifier.other
10.1109/TVLSI.2022.3228336
en_US
dc.identifier.uri
http://hdl.handle.net/20.500.11850/604134
dc.description.abstract
High performance and energy efficiency are critical requirements for Internet of Things (IoT) end-nodes. Exploiting tightly coupled clusters of programmable processors (CMPs) has recently emerged as a suitable solution to address this challenge. One of the main bottlenecks limiting the performance and energy efficiency of these systems is the instruction cache architecture due to its criticality in terms of timing (i.e., maximum operating frequency), bandwidth, and power. We propose a hierarchical instruction cache tailored to ultralow-power (ULP) tightly coupled processor clusters where a relatively large cache (L1.5) is shared by L1 private (PR) caches through a two-cycle latency interconnect. To address the performance loss caused by the L1 capacity misses, we introduce a next-line prefetcher with cache probe filtering (CPF) from L1 to L1.5. We optimize the core instruction fetch (IF) stage by removing the critical core-to-L1 combinational path. We present a detailed comparison of instruction cache architectures' performance and energy efficiency for parallel ULP (PULP) clusters. Focusing on the implementation, our two-level instruction cache provides better scalability than existing shared caches, delivering up to 20% higher operating frequency. On average, the proposed two-level cache improves maximum performance by up to 17% compared to the state-of-the-art while delivering similar energy efficiency for most relevant applications.
en_US
dc.language.iso
en
en_US
dc.publisher
IEEE
en_US
dc.subject
Energy efficiency
en_US
dc.subject
instruction cache
en_US
dc.subject
parallel
en_US
dc.subject
prefetch
en_US
dc.subject
ultralow-power (ULP)
en_US
dc.title
Scalable Hierarchical Instruction Cache for Ultralow-Power Processors Clusters
en_US
dc.type
Journal Article
dc.date.published
2023-02-20
ethz.journal.title
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
ethz.journal.volume
31
en_US
ethz.journal.issue
4
en_US
ethz.journal.abbreviated
IEEE Trans. Very Large Scale Integr. (VLSI) Syst.
ethz.pages.start
456
en_US
ethz.pages.end
469
en_US
ethz.grant
Pilot using Independent Local & Open Technologies
en_US
ethz.identifier.wos
ethz.identifier.scopus
ethz.publication.place
New York, NY
en_US
ethz.publication.status
published
en_US
ethz.leitzahl
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02140 - Dep. Inf.technologie und Elektrotechnik / Dep. of Inform.Technol. Electrical Eng.::02636 - Institut für Integrierte Systeme / Integrated Systems Laboratory::03996 - Benini, Luca / Benini, Luca
ethz.leitzahl.certified
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02140 - Dep. Inf.technologie und Elektrotechnik / Dep. of Inform.Technol. Electrical Eng.::02636 - Institut für Integrierte Systeme / Integrated Systems Laboratory::03996 - Benini, Luca / Benini, Luca
ethz.grant.agreementno
101034126
ethz.grant.fundername
EC
ethz.grant.funderDoi
10.13039/501100000780
ethz.grant.program
H2020
ethz.date.deposited
2023-03-21T04:05:36Z
ethz.source
WOS
ethz.eth
yes
en_US
ethz.availability
Metadata only
en_US
ethz.rosetta.installDate
2023-04-14T09:45:52Z
ethz.rosetta.lastUpdated
2024-02-02T21:39:13Z
ethz.rosetta.versionExported
true
ethz.COinS
ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=Scalable%20Hierarchical%20Instruction%20Cache%20for%20Ultralow-Power%20Processors%20Clusters&rft.jtitle=IEEE%20Transactions%20on%20Very%20Large%20Scale%20Integration%20(VLSI)%20Systems&rft.date=2023-04&rft.volume=31&rft.issue=4&rft.spage=456&rft.epage=469&rft.issn=1063-8210&1557-9999&rft.au=Chen,%20Jie&Loi,%20Igor&Flamand,%20Eric&Tagliavini,%20Giuseppe&Benini,%20Luca&rft.genre=article&rft_id=info:doi/10.1109/TVLSI.2022.3228336&
 Search print copy at ETH Library

Files in this item

FilesSizeFormatOpen in viewer

There are no files associated with this item.

Publication type

Show simple item record