Show simple item record

dc.contributor.author
Gruenheid, Anja
dc.contributor.author
Kossmann, Donald
dc.contributor.author
Sukriti, Ramesh
dc.contributor.author
Widmer, Florian
dc.date.accessioned
2017-09-22T10:46:22Z
dc.date.available
2017-06-10T15:50:22Z
dc.date.available
2017-09-22T10:46:22Z
dc.date.issued
2012-09
dc.identifier.uri
http://hdl.handle.net/20.500.11850/65601
dc.identifier.doi
10.3929/ethz-a-009761323
dc.description.abstract
There are several computational tasks for which the help of people is useful. One such task is entity resolution. For this task, human experts can help to identify whether two customers are identical given their profile. Since crowdsourcing is expensive, the goal is to ask as few questions as possible. At the same time, high quality results can only be achieved if several experts are asked for their opinion and for confirmation. This paper shows how to address this cost / quality trade-off and how to tolerate and resolve errors from the crowd. Specifically, this paper shows how to exploit mathematical properties such as symmetry, transitivity, and anti-transitivity of the is-same-entity-as relation to improve both cost and quality. The results of extensive experiments provide surprising insights on how best to crowd-source for entity resolution and other classification problems.
en_US
dc.language.iso
en
en_US
dc.publisher
ETH, Department of Computer Science, Systems Group
en_US
dc.rights.uri
http://rightsstatements.org/page/InC-NC/1.0/
dc.subject
DATABASE MANAGEMENT + DATABASE ADMINISTRATION (INFORMATION SYSTEMS)
en_US
dc.subject
INFORMATION MANAGEMENT (MANAGEMENT OF COMPUTER SYSTEMS)
en_US
dc.subject
SPECIAL PROGRAMMING METHODS
en_US
dc.subject
SPEZIELLE PROGRAMMIERMETHODEN
en_US
dc.subject
INFORMATIONSMANAGEMENT (MANAGEMENT VON COMPUTERSYSTEMEN)
en_US
dc.subject
DATENBANKVERWALTUNG + DATENBANKADMINISTRATION (INFORMATIONSSYSTEME)
en_US
dc.title
Crowdsourcing Entity Resolution
en_US
dc.type
Report
dc.rights.license
In Copyright - Non-Commercial Use Permitted
dc.date.published
2013
ethz.title.subtitle
When is A=B?
en_US
ethz.journal.title
Technical Report / ETH Zurich, Department of Computer Science
ethz.journal.volume
785
en_US
ethz.size
34 p.
en_US
ethz.code.ddc
0 - Computer science, information & general works::004 - Data processing, computer science
en_US
ethz.identifier.nebis
009761323
ethz.publication.place
Zürich
en_US
ethz.publication.status
published
en_US
ethz.leitzahl
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02663 - Institut für Computing Platforms / Institute for Computing Platforms::03689 - Kossmann, Donald (ehemalig)
en_US
ethz.leitzahl
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science
en_US
ethz.leitzahl.certified
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02663 - Institut für Computing Platforms / Institute for Computing Platforms::03689 - Kossmann, Donald (ehemalig)
ethz.date.deposited
2017-06-10T15:52:34Z
ethz.source
ECOL
ethz.source
ECIT
ethz.identifier.importid
imp59366b3bc154d32785
ethz.identifier.importid
imp5936507f1d38d79695
ethz.ecolpid
eth:6833
ethz.ecitpid
pub:104509
ethz.eth
yes
en_US
ethz.availability
Open access
en_US
ethz.rosetta.installDate
2017-07-18T16:46:49Z
ethz.rosetta.lastUpdated
2018-11-05T20:02:30Z
ethz.rosetta.exportRequired
true
ethz.rosetta.versionExported
true
ethz.COinS
ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=Crowdsourcing%20Entity%20Resolution&rft.jtitle=Technical%20Report%20/%20ETH%20Zurich,%20Department%20of%20Computer%20Science&rft.date=2012-09&rft.volume=785&rft.au=Gruenheid,%20Anja&Kossmann,%20Donald&Sukriti,%20Ramesh&Widmer,%20Florian&rft.genre=report&
 Search via SFX

Files in this item

Thumbnail

Publication type

Show simple item record