Show simple item record

dc.contributor.author
Chiosa, Monica
dc.contributor.supervisor
Alonso, Gustavo
dc.contributor.supervisor
Zhang, Ce
dc.contributor.supervisor
Josipovic, Lana
dc.contributor.supervisor
Preußer, Thomas B.
dc.date.accessioned
2024-01-25T11:45:22Z
dc.date.available
2024-01-24T14:23:54Z
dc.date.available
2024-01-25T10:37:32Z
dc.date.available
2024-01-25T11:30:36Z
dc.date.available
2024-01-25T11:45:22Z
dc.date.issued
2023
dc.identifier.uri
http://hdl.handle.net/20.500.11850/655231
dc.identifier.doi
10.3929/ethz-b-000655231
dc.description.abstract
The distributed nature of the cloud, regarding resource placement and application execution, incurs large data transfer. As data movement is unavoidable, it lies within the systems’ capacities to ensure its effectiveness. This thesis takes a step towards enhancing data movement with compute capabilities, more precisely with data analytics and privacy computational features, with the ultimate goal of making data movement more efficient and secure using Field Programmable Gate Arrays (FPGAs) based compute engines. From the data analytics perspective, a correlation coefficient compute engine is designed and analyzed. This selection is based on the widespread adoption of correlation across database and ML systems that require knowledge of the correlation relationship between the input data before processing it. The experimental evaluation shows that correlation computation offloaded to an FPGA-based smartNIC can sustain streams arriving at 100 Gbps over an RDMA network, while performing a single pass over the data and requiring an order of magnitude less time for computing compared to CPU or GPU designs. Maintaining the continuity in the data characterization approach, a streaming analytics compute engine is analyzed. The system objective is to sustain a 100 Gbps TCP/IP data rate while characterizing input data (cardinality, second frequency moment, frequency distribution) in a manageable space and with a single pass over the data. In addition to accomplishing its intended objective, the system deployment on a single FPGA can match the performance of 70 CPU cores. From the data privacy perspective, the performance benefits of offloading the I/O path data transformation operations (encryption and decryption) are analyzed, taking into account the requirements of a relational analytics engine (SAP HANA), while providing three levels of security. The findings indicate that substantial benefits can be achieved, both in terms of performance and freeing CPU resources, in the cloud environment, and protecting the data while it is at rest or transmitted over the network.
en_US
dc.format
application/pdf
en_US
dc.language.iso
en
en_US
dc.publisher
ETH Zurich
en_US
dc.rights.uri
http://rightsstatements.org/page/InC-NC/1.0/
dc.subject
FPGA, correlation computation, sketch algorithms, encryption and decryption, data movement, data characterization, data transformation, Pearson correlation coefficient, HyperLogLog, Count-Min, Fast-AGMS, AES, Remote Direct Memory Access (RDMA), TCP/IP
en_US
dc.title
FPGA-Based Systems for Stream Data Analytics and I/O Data Transformations
en_US
dc.type
Doctoral Thesis
dc.rights.license
In Copyright - Non-Commercial Use Permitted
dc.date.published
2024-01-25
ethz.size
166 p.
en_US
ethz.code.ddc
DDC - DDC::0 - Computer science, information & general works::004 - Data processing, computer science
en_US
ethz.identifier.diss
29551
en_US
ethz.publication.place
Zurich
en_US
ethz.publication.status
published
en_US
ethz.leitzahl
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02663 - Institut für Computing Platforms / Institute for Computing Platforms::03506 - Alonso, Gustavo / Alonso, Gustavo
en_US
ethz.date.deposited
2024-01-24T14:23:54Z
ethz.source
FORM
ethz.eth
yes
en_US
ethz.availability
Open access
en_US
ethz.rosetta.installDate
2024-01-25T11:45:23Z
ethz.rosetta.lastUpdated
2024-01-25T11:45:23Z
ethz.rosetta.versionExported
true
ethz.COinS
ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=FPGA-Based%20Systems%20for%20Stream%20Data%20Analytics%20and%20I/O%20Data%20Transformations&rft.date=2023&rft.au=Chiosa,%20Monica&rft.genre=unknown&rft.btitle=FPGA-Based%20Systems%20for%20Stream%20Data%20Analytics%20and%20I/O%20Data%20Transformations
 Search print copy at ETH Library

Files in this item

Thumbnail

Publication type

Show simple item record