Search

Show Advanced FiltersHide Advanced Filters

Use the advanced filters to refine the search results.

Results

Now showing items 1-5 of 5

TuRaN: True Random Number Generation Using Supply Voltage Underscaling in SRAMs

Yüksel, İsmail Emir; Olgun, Ataberk; Salami, Behzad; et al. (2022)

arXiv

Prior works propose SRAM-based TRNGs that extract entropy from SRAM arrays. SRAM arrays are widely used in a majority of specialized or general-purpose chips that perform the computation to store data inside the chip. Thus, SRAM-based TRNGs present a low-cost alternative to dedicated hardware TRNGs. However, existing SRAM-based TRNGs suffer from 1) low TRNG throughput, 2) high energy consumption, 3) high TRNG latency, and 4) the inability ...

Working Paper
TargetCall: Eliminating the Wasted Computation in Basecalling via Pre-Basecalling Filtering

Cavlak, Meryem Banu; Singh, Gagandeep; Alser, Mohammed; et al. (2022)

arXiv

Basecalling is an essential step in nanopore sequencing analysis where the raw signals of nanopore sequencers are converted into nucleotide sequences, i.e., reads. State-of-the-art basecallers employ complex deep learning models to achieve high basecalling accuracy. This makes basecalling computationally-inefficient and memory-hungry; bottlenecking the entire genome analysis pipeline. However, for many applications, the majority of reads ...

Working Paper
ALP: Alleviating CPU-Memory Data Movement Overheads in Memory-Centric Systems

Mansouri Ghiasi, Nika; Vijaykumar, Nandita; Oliveira, Geraldo F.; et al. (2022)

arXiv

Partitioning applications between NDP and host CPU cores causes inter-segment data movement overhead, which is caused by moving data generated from one segment (e.g., instructions, functions) and used in consecutive segments. Prior works take two approaches to this problem. The first class of works maps segments to NDP or host cores based on the properties of each segment, neglecting the inter-segment data movement overhead. The second ...

Working Paper
Accelerating Time Series Analysis via Processing using Non-Volatile Memories

Fernandez, Ivan; Manglik, Aditya; Giannoula, Christina; et al. (2022)

arXiv

Time Series Analysis (TSA) is a critical workload for consumer-facing devices. Accelerating TSA is vital for many domains as it enables the extraction of valuable information and predict future events. The state-of-the-art algorithm in TSA is the subsequence Dynamic Time Warping (sDTW) algorithm. However, sDTW's computation complexity increases quadratically with the time series' length, resulting in two performance implications. First, ...

Working Paper
RevaMp3D: Architecting the Processor Core and Cache Hierarchy for Systems with Monolithically-Integrated Logic and Memory

Mansouri Ghiasi, Nika; Sadrosadati, Mohammad; Oliveira, Geraldo F.; et al. (2022)

arXiv

Recent nano-technological advances enable the Monolithic 3D (M3D) integration of multiple memory and logic layers in a single chip with fine-grained connections. M3D technology leads to significantly higher main memory bandwidth and shorter latency than existing 3D-stacked systems. We show for a variety of workloads on a state-of-the-art M3D system that the performance and energy bottlenecks shift from the main memory to the core and cache ...

Working Paper

Research Collection

Search

Results

TuRaN: True Random Number Generation Using Supply Voltage Underscaling in SRAMs ﻿

TargetCall: Eliminating the Wasted Computation in Basecalling via Pre-Basecalling Filtering ﻿

ALP: Alleviating CPU-Memory Data Movement Overheads in Memory-Centric Systems ﻿

Accelerating Time Series Analysis via Processing using Non-Volatile Memories ﻿

RevaMp3D: Architecting the Processor Core and Cache Hierarchy for Systems with Monolithically-Integrated Logic and Memory ﻿

Refine by

TuRaN: True Random Number Generation Using Supply Voltage Underscaling in SRAMs

TargetCall: Eliminating the Wasted Computation in Basecalling via Pre-Basecalling Filtering

ALP: Alleviating CPU-Memory Data Movement Overheads in Memory-Centric Systems

Accelerating Time Series Analysis via Processing using Non-Volatile Memories

RevaMp3D: Architecting the Processor Core and Cache Hierarchy for Systems with Monolithically-Integrated Logic and Memory