Bibliometrics
Skip Table Of Content Section
research-article
Open Access
Accelerating Graph Computations on 3D NoC-Enabled PIM Architectures
Article No.: 30, pp 1–16https://doi.org/10.1145/3564290

Graph application workloads are dominated by random memory accesses with the poor locality. To tackle the irregular and sparse nature of computation, ReRAM-based Processing-in-Memory (PIM) architectures have been proposed recently. Most of these ReRAM ...

research-article
Open Access
Virtuoso: Energy- and Latency-aware Streamlining of Streaming Videos on Systems-on-Chips
Article No.: 31, pp 1–32https://doi.org/10.1145/3564289

Efficient and adaptive computer vision systems have been proposed to make computer vision tasks, such as image classification and object detection, optimized for embedded or mobile devices. These solutions, quite recent in their origin, focus on ...

research-article
Design of Synthesis-time Vectorized Arithmetic Hardware for Tapered Floating-point Addition and Subtraction
Article No.: 32, pp 1–35https://doi.org/10.1145/3567423

Energy efficiency has become the new performance criterion in this era of pervasive embedded computing; thus, accelerator-rich multi-processor system-on-chips are commonly used in embedded computing hardware. Once computationally intensive machine ...

research-article
Auto-tuning Fixed-point Precision with TVM on RISC-V Packed SIMD Extension
Article No.: 33, pp 1–21https://doi.org/10.1145/3569939

Today, as deep learning (DL) is applied more often in daily life, dedicated processors such as CPUs and GPUs have become very important for accelerating model executions. With the growth of technology, people are becoming accustomed to using edge devices, ...

research-article
Open Access
Hardware-aware Quantization/Mapping Strategies for Compute-in-Memory Accelerators
Article No.: 34, pp 1–23https://doi.org/10.1145/3569940

The emerging non-volatile memory (eNVM) based mixed-signal Compute-in-Memory (CIM) accelerators are of great interest in today's AI accelerators design due to their high energy efficiency. Various CIM architectures and circuit-level designs have been ...

research-article
GANDSE: Generative Adversarial Network-based Design Space Exploration for Neural Network Accelerator Design
Article No.: 35, pp 1–20https://doi.org/10.1145/3570926

With the popularity of deep learning, the hardware implementation platform of deep learning has received increasing interest. Unlike the general purpose devices, e.g., CPU or GPU, where the deep learning algorithms are executed at the software level, ...

research-article
DDAM: Data Distribution-Aware Mapping of CNNs on Processing-In-Memory Systems
Article No.: 36, pp 1–30https://doi.org/10.1145/3576196

Convolution neural networks (CNNs) are widely used algorithms in image processing, natural language processing and many other fields. The large amount of memory access of CNNs is one of the major concerns in CNN accelerator designs that influences the ...

research-article
A Switching NMOS Based Single Ended Sense Amplifier for High Density SRAM Applications
Article No.: 37, pp 1–14https://doi.org/10.1145/3576198

The demand for single ended static random access memory is growing, driven by the decreasing technology node and increasing processing load. This mandates the need for a single ended sense amplifier to be used along with the memory. Consequently, a single ...

research-article
Inferencing on Edge Devices: A Time- and Space-aware Co-scheduling Approach
Article No.: 38, pp 1–33https://doi.org/10.1145/3576197

Neural Network (NN)-based real-time inferencing tasks are often co-scheduled on GPGPU-style edge platforms. Existing works advocate using different NN parameters for the same detection task in different environments. However, realizing such approaches ...

research-article
Component Fault Diagnosability of Hierarchical Cubic Networks
Article No.: 39, pp 1–19https://doi.org/10.1145/3577018

The fault diagnosability of a network indicates the self-diagnosis ability of the network, thus it is an important measure of robustness of the network. As a neoteric feature for measuring fault diagnosability, the r-component diagnosability ctr(G) of a ...

research-article
Open Access
CNNFlow: Memory-driven Data Flow Optimization for Convolutional Neural Networks
Article No.: 40, pp 1–36https://doi.org/10.1145/3577017

Convolution Neural Networks (CNNs) are widely deployed in computer vision applications. The datasets are large, and the data reuse across different parts is heavily interleaved. Given that memory access (SRAM and especially DRAM) is more expensive in both ...

research-article
Open Access
Multi-Objective Optimization for Safety-Related Available E/E Architectures Scoping Highly Automated Driving Vehicles
Article No.: 41, pp 1–37https://doi.org/10.1145/3582004

Megatrends such as Highly Automated Driving (HAD) (SAE ≥ Level 3), electrification, and connectivity are reshaping the automotive industry. Together with the new technologies, the business models will also evolve, opening up new possibilities and new ...

research-article
Low-energy Pipelined Hardware Design for Approximate Medium Filter
Article No.: 42, pp 1–21https://doi.org/10.1145/3582005

Image and video processing algorithms are currently crucial for many applications. Hardware implementation of these algorithms provides higher speed for large computation applications. Removing noise is often a typical pre-processing step to enhance the ...

research-article
Accurately Measuring Contention in Mesh NoCs in Time-Sensitive Embedded Systems
Article No.: 43, pp 1–34https://doi.org/10.1145/3582006

The computing capacity demanded by embedded systems is on the rise as software implements more functionalities, ranging from best-effort entertainment functions to performance-guaranteed safety-related functions. Heterogeneous manycore processors, using ...

Subjects

Comments

About Cookies On This Site

We use cookies to ensure that we give you the best experience on our website.

Learn more

Got it!