Hardware Optimization

14,000,000 Leading Edge Experts on the ideXlab platform

Scan Science and Technology

Contact Leading Edge Experts & Companies

Scan Science and Technology

Contact Leading Edge Experts & Companies

The Experts below are selected from a list of 41055 Experts worldwide ranked by ideXlab platform

Jason H Anderson - One of the best experts on this subject based on the ideXlab platform.

  • range and bitmask analysis for Hardware Optimization in high level synthesis
    Asia and South Pacific Design Automation Conference, 2013
    Co-Authors: Marcel Gort, Jason H Anderson
    Abstract:

    We consider the extent to which the bit-level representation of variables can be used to optimize Hardware generated by high-level synthesis (HLS). Two approaches to bit-level Optimization are considered (individually and together): 1) range analysis, and 2) bitmask analysis. Range analysis aims to predetermine min/max ranges for variables to reduce the bitwidth required to represent variables in Hardware. Bitmask analysis characterizes individual bits within a word as either constants (1 or 0), sign bits, or unknowns, where constants/don't-cares permit Hardware to be eliminated under certain conditions. Static compiler-based analysis is contrasted with dynamic profiling-based analysis in terms of their potential to impact area and speed of HLS-generated Hardware. For a set of benchmarks implemented in the Altera Cyclone II FPGA, results show bit-level Optimizations in HLS based on static analysis reduce circuit area by 9%, on average, while additional Optimizations based on dynamic analysis provide 34% area reduction.

  • ASP-DAC - Range and bitmask analysis for Hardware Optimization in high-level synthesis
    2013 18th Asia and South Pacific Design Automation Conference (ASP-DAC), 2013
    Co-Authors: Marcel Gort, Jason H Anderson
    Abstract:

    We consider the extent to which the bit-level representation of variables can be used to optimize Hardware generated by high-level synthesis (HLS). Two approaches to bit-level Optimization are considered (individually and together): 1) range analysis, and 2) bitmask analysis. Range analysis aims to predetermine min/max ranges for variables to reduce the bitwidth required to represent variables in Hardware. Bitmask analysis characterizes individual bits within a word as either constants (1 or 0), sign bits, or unknowns, where constants/don't-cares permit Hardware to be eliminated under certain conditions. Static compiler-based analysis is contrasted with dynamic profiling-based analysis in terms of their potential to impact area and speed of HLS-generated Hardware. For a set of benchmarks implemented in the Altera Cyclone II FPGA, results show bit-level Optimizations in HLS based on static analysis reduce circuit area by 9%, on average, while additional Optimizations based on dynamic analysis provide 34% area reduction.

Timothy G. Constandinou - One of the best experts on this subject based on the ideXlab platform.

  • Adaptive spike detection and Hardware Optimization towards autonomous, high-channel-count BMIs.
    Journal of neuroscience methods, 2021
    Co-Authors: Zheng Zhang, Timothy G. Constandinou
    Abstract:

    Abstract Background The progress in microtechnology has enabled an exponential trend in the number of neurons that can be simultaneously recorded. The data bandwidth requirement is however increasing with channel count. The vast majority of experimental work involving electrophysiology stores the raw data and then processes this offline; to detect the underlying spike events. Emerging applications however require new methods for local, real-time processing. New Methods: We have developed an adaptive, low complexity spike detection algorithm that combines three novel components for: (1) removing the local field potentials; (2) enhancing the signal-to-noise ratio; and (3) computing an adaptive threshold. The proposed algorithm has been optimised for Hardware implementation (i.e. minimising computations, translating to a fixed-point implementation), and demonstrated on low-power embedded targets. Main results The algorithm has been validated on both synthetic datasets and real recordings yielding a detection sensitivity of up to 90%. The initial Hardware implementation using an off-the-shelf embedded platform demonstrated a memory requirement of less than 0.1 kb ROM and 3 kb program flash, consuming an average power of 130μW. Comparison with Existing Methods The method presented has the advantages over other approaches, that it allows spike events to be robustly detected in real-time from neural activity in a completely autonomous way, without the need for any calibration, and can be implemented with low Hardware resources. Conclusion The proposed method can detect spikes effectively and adaptively. It alleviates the need for re-calibration, which is critical towards achieving a viable BMI, and more so with future ‘high bandwidth’ systems’ targeting 1000s of channels.

Oreste Villa - One of the best experts on this subject based on the ideXlab platform.

  • power performance Hardware Optimization for synchronization intensive applications in mpsocs
    Design Automation and Test in Europe, 2006
    Co-Authors: Matteo Monchiero, Gianluca Palermo, Cristina Silvano, Oreste Villa
    Abstract:

    This paper explores Optimization techniques of the synchronization mechanisms for MPSoCs based on complex interconnect (Network-on-Chip), targeted at future power-efficient systems. The proposed solution is based on the idea of locally performing synchronization operations which require the continuous polling of a shared variable, thus featuring large contention (e. g. spin locks). We introduce a HW module, the Synchronization-operation Buffer (SB), which queues and manages the requests issued by the processors. Experimental validation has been carried out by using GRAPES, a cycle-accurate performance/power simulation platform. For 8-processor target architecture, we show that the proposed solution achieves up to 40% performance improvement and 30% energy saving with respect to synchronization based on directory-based coherence protocol.

  • DATE - Power/Performance Hardware Optimization for Synchronization Intensive Applications in MPSoCs
    Proceedings of the Design Automation & Test in Europe Conference, 2006
    Co-Authors: Matteo Monchiero, Gianluca Palermo, Cristina Silvano, Oreste Villa
    Abstract:

    This paper explores Optimization techniques of the synchronization mechanisms for MPSoCs based on complex interconnect (Network-on-Chip), targeted at future power-efficient systems. The proposed solution is based on the idea of locally performing synchronization operations which require the continuous polling of a shared variable, thus featuring large contention (e. g. spin locks). We introduce a HW module, the Synchronization-operation Buffer (SB), which queues and manages the requests issued by the processors. Experimental validation has been carried out by using GRAPES, a cycle-accurate performance/power simulation platform. For 8-processor target architecture, we show that the proposed solution achieves up to 40% performance improvement and 30% energy saving with respect to synchronization based on directory-based coherence protocol.

Marcel Gort - One of the best experts on this subject based on the ideXlab platform.

  • range and bitmask analysis for Hardware Optimization in high level synthesis
    Asia and South Pacific Design Automation Conference, 2013
    Co-Authors: Marcel Gort, Jason H Anderson
    Abstract:

    We consider the extent to which the bit-level representation of variables can be used to optimize Hardware generated by high-level synthesis (HLS). Two approaches to bit-level Optimization are considered (individually and together): 1) range analysis, and 2) bitmask analysis. Range analysis aims to predetermine min/max ranges for variables to reduce the bitwidth required to represent variables in Hardware. Bitmask analysis characterizes individual bits within a word as either constants (1 or 0), sign bits, or unknowns, where constants/don't-cares permit Hardware to be eliminated under certain conditions. Static compiler-based analysis is contrasted with dynamic profiling-based analysis in terms of their potential to impact area and speed of HLS-generated Hardware. For a set of benchmarks implemented in the Altera Cyclone II FPGA, results show bit-level Optimizations in HLS based on static analysis reduce circuit area by 9%, on average, while additional Optimizations based on dynamic analysis provide 34% area reduction.

  • ASP-DAC - Range and bitmask analysis for Hardware Optimization in high-level synthesis
    2013 18th Asia and South Pacific Design Automation Conference (ASP-DAC), 2013
    Co-Authors: Marcel Gort, Jason H Anderson
    Abstract:

    We consider the extent to which the bit-level representation of variables can be used to optimize Hardware generated by high-level synthesis (HLS). Two approaches to bit-level Optimization are considered (individually and together): 1) range analysis, and 2) bitmask analysis. Range analysis aims to predetermine min/max ranges for variables to reduce the bitwidth required to represent variables in Hardware. Bitmask analysis characterizes individual bits within a word as either constants (1 or 0), sign bits, or unknowns, where constants/don't-cares permit Hardware to be eliminated under certain conditions. Static compiler-based analysis is contrasted with dynamic profiling-based analysis in terms of their potential to impact area and speed of HLS-generated Hardware. For a set of benchmarks implemented in the Altera Cyclone II FPGA, results show bit-level Optimizations in HLS based on static analysis reduce circuit area by 9%, on average, while additional Optimizations based on dynamic analysis provide 34% area reduction.

Antonio Teixeira - One of the best experts on this subject based on the ideXlab platform.