dcbench: A Benchmark for Data-Centric AI Systems
dc.contributor.author
Eyuboglu, Sabri
dc.contributor.author
Karlaš, Bojan
dc.contributor.author
Ré, Christopher
dc.contributor.author
Zhang, Ce
dc.contributor.author
Zou, James
dc.date.accessioned
2022-07-13T08:38:49Z
dc.date.available
2022-07-11T03:05:56Z
dc.date.available
2022-07-13T08:38:49Z
dc.date.issued
2022-06
dc.identifier.isbn
978-1-4503-9375-1
en_US
dc.identifier.other
10.1145/3533028.3533310
en_US
dc.identifier.uri
http://hdl.handle.net/20.500.11850/557202
dc.description.abstract
The development workflow for today's AI applications has grown far beyond the standard model training task. This workflow typically consists of various data and model management tasks. It includes a "data cycle"aimed at producing high-quality training data, and a "model cycle"aimed at managing trained models on their way to production. This broadened workflow has opened a space for already emerging tools and systems for AI development. However, as a research community, we are still missing standardized ways to evaluate these tools and systems. In a humble effort to get this wheel turning, we developed dcbench, a benchmark for evaluating systems for data-centric AI development. In this report, we present the main ideas behind dcbench, some benchmark tasks that we included in the initial release, and a short summary of its implementation.
en_US
dc.language.iso
en
en_US
dc.publisher
Association for Computing Machinery
en_US
dc.subject
Benchmarks
en_US
dc.subject
data-centric AI
en_US
dc.subject
AI development
en_US
dc.title
dcbench: A Benchmark for Data-Centric AI Systems
en_US
dc.type
Conference Paper
dc.date.published
2022-06-12
ethz.book.title
Proceedings of the 6th Workshop on Data Management for End-To-End Machine Learning (DEEM 2022)
en_US
ethz.pages.start
9
en_US
ethz.size
4 p.
en_US
ethz.event
6th Workshop on Data Management for End-To-End Machine Learning (DEEM 2022)
en_US
ethz.event.location
Philadelphia, PA, USA
en_US
ethz.event.date
June 12, 2022
en_US
ethz.notes
Held in conjunction with the 2022 ACM SIGMOD/PODS Conference
en_US
ethz.identifier.scopus
ethz.publication.place
New York, NY
en_US
ethz.publication.status
published
en_US
ethz.date.deposited
2022-07-11T03:06:08Z
ethz.source
SCOPUS
ethz.eth
yes
en_US
ethz.availability
Metadata only
en_US
ethz.rosetta.installDate
2022-07-13T08:38:55Z
ethz.rosetta.lastUpdated
2022-07-13T08:38:55Z
ethz.rosetta.versionExported
true
ethz.COinS
ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=dcbench:%20A%20Benchmark%20for%20Data-Centric%20AI%20Systems&rft.date=2022-06&rft.spage=9&rft.au=Eyuboglu,%20Sabri&Karla%C5%A1,%20Bojan&R%C3%A9,%20Christopher&Zhang,%20Ce&Zou,%20James&rft.isbn=978-1-4503-9375-1&rft.genre=proceeding&rft_id=info:doi/10.1145/3533028.3533310&rft.btitle=Proceedings%20of%20the%206th%20Workshop%20on%20Data%20Management%20for%20End-To-End%20Machine%20Learning%20(DEEM%202022)
Files in this item
Files | Size | Format | Open in viewer |
---|---|---|---|
There are no files associated with this item. |
Publication type
-
Conference Paper [33039]