Show simple item record

dc.contributor.author
He, Xiaoxi
dc.contributor.author
Gao, Dawei
dc.contributor.author
Zhou, Zimu
dc.contributor.author
Tong, Yongxin
dc.contributor.author
Thiele, Lothar
dc.date.accessioned
2021-09-21T06:35:56Z
dc.date.available
2021-09-21T03:19:33Z
dc.date.available
2021-09-21T06:35:56Z
dc.date.issued
2021-08
dc.identifier.isbn
978-1-4503-8332-5
en_US
dc.identifier.other
10.1145/3447548.3467271
en_US
dc.identifier.uri
http://hdl.handle.net/20.500.11850/506209
dc.description.abstract
Many mobile applications demand selective execution of multiple correlated deep learning inference tasks on resource-constrained platforms. Given a set of deep neural networks, each pre-trained for a single task, it is desired that executing arbitrary combinations of tasks yields minimal computation cost. Pruning each network separately yields suboptimal computation cost due to task relatedness. A promising remedy is to merge the networks into a multitask network to eliminate redundancy across tasks before network pruning. However, pruning a multitask network combined by existing network merging schemes cannot minimise the computation cost of every task combination because they do not consider such a future pruning. To this end, we theoretically identify the conditions such that pruning a multitask network minimises the computation of all task combinations. On this basis, we propose Pruning-Aware Merging (PAM), a heuristic network merging scheme to construct a multitask network that approximates these conditions. The merged network is then ready to be further pruned by existing network pruning methods. Evaluations with different pruning schemes, datasets, and network architectures show that PAM achieves up to 4.87x less computation against the baseline without network merging, and up to 2.01x less computation against the baseline with a state-of-the-art network merging scheme.
en_US
dc.language.iso
en
en_US
dc.publisher
Association for Computing Machinery
en_US
dc.subject
Deep learning
en_US
dc.subject
Network pruning
en_US
dc.subject
Multitasking inference
en_US
dc.title
Pruning-Aware Merging for Efficient Multitask Inference
en_US
dc.type
Conference Paper
dc.date.published
2021-08-14
ethz.book.title
Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining (KDD '21)
en_US
ethz.pages.start
585
en_US
ethz.pages.end
595
en_US
ethz.event
27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2021)
en_US
ethz.event.location
Online
en_US
ethz.event.date
August 14-18, 2021
en_US
ethz.identifier.wos
ethz.identifier.scopus
ethz.publication.place
New York, NY
en_US
ethz.publication.status
published
en_US
ethz.leitzahl
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02140 - Dep. Inf.technologie und Elektrotechnik / Dep. of Inform.Technol. Electrical Eng.::02640 - Inst. f. Technische Informatik und Komm. / Computer Eng. and Networks Lab.::03429 - Thiele, Lothar (emeritus) / Thiele, Lothar (emeritus)
ethz.leitzahl.certified
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02140 - Dep. Inf.technologie und Elektrotechnik / Dep. of Inform.Technol. Electrical Eng.::02640 - Inst. f. Technische Informatik und Komm. / Computer Eng. and Networks Lab.::03429 - Thiele, Lothar (emeritus) / Thiele, Lothar (emeritus)
ethz.date.deposited
2021-09-21T03:19:50Z
ethz.source
SCOPUS
ethz.eth
yes
en_US
ethz.availability
Metadata only
en_US
ethz.rosetta.installDate
2021-09-21T06:36:02Z
ethz.rosetta.lastUpdated
2023-02-06T22:35:02Z
ethz.rosetta.versionExported
true
ethz.COinS
ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=Pruning-Aware%20Merging%20for%20Efficient%20Multitask%20Inference&rft.date=2021-08&rft.spage=585&rft.epage=595&rft.au=He,%20Xiaoxi&Gao,%20Dawei&Zhou,%20Zimu&Tong,%20Yongxin&Thiele,%20Lothar&rft.isbn=978-1-4503-8332-5&rft.genre=proceeding&rft_id=info:doi/10.1145/3447548.3467271&rft.btitle=Proceedings%20of%20the%2027th%20ACM%20SIGKDD%20Conference%20on%20Knowledge%20Discovery%20&%20Data%20Mining%20(KDD%20'21)
 Search print copy at ETH Library

Files in this item

FilesSizeFormatOpen in viewer

There are no files associated with this item.

Publication type

Show simple item record