Microarchitectures

14,000,000 Leading Edge Experts on the ideXlab platform

Scan Science and Technology

Contact Leading Edge Experts & Companies

Scan Science and Technology

Contact Leading Edge Experts & Companies

The Experts below are selected from a list of 26319 Experts worldwide ranked by ideXlab platform

Antonio Gonzalez - One of the best experts on this subject based on the ideXlab platform.

  • CGO - Heterogeneous Clustered VLIW Microarchitectures
    International Symposium on Code Generation and Optimization (CGO'07), 2007
    Co-Authors: Alex Aletà, Josep Maria Codina, Antonio Gonzalez, David Kaeli
    Abstract:

    Increasing performance, while at the same time reducing power consumption, is a major design tradeoff in current microprocessors. In this paper, we investigate the potential of using a heterogeneous clustered VLIW microarchitecture. In the proposed microarchitecture, each cluster, the interconnection network and the supporting memory hierarchy can run at different frequencies and voltages. Some of the clusters can then be configured to be performanceoriented and run at high frequency, while the other clusters can be configured to be low-power-oriented and run at lower frequencies, thus reducing overall consumption. For this heterogeneous design to be effective, we need to select the most suitable frequencies and voltages for each component. We propose a scheme to choose these parameters based on a model that estimates the energy consumption and the execution time of floating-point codes at compile time. Finally, we present a modulo scheduling technique based on graph partitioning that exploits the opportunities presented on heterogeneous clustered Microarchitectures. Results show that the Energy-Delay2 product (ED2) can be significantly reduced by 15% on average for a microarchitecture with 4-clusters and by as much as 35% for selected programs.

  • heterogeneous clustered vliw Microarchitectures
    Symposium on Code Generation and Optimization, 2007
    Co-Authors: Alex Aletà, Josep Maria Codina, Antonio Gonzalez, David Kaeli
    Abstract:

    Increasing performance, while at the same time reducing power consumption, is a major design tradeoff in current microprocessors. In this paper, we investigate the potential of using a heterogeneous clustered VLIW microarchitecture. In the proposed microarchitecture, each cluster, the interconnection network and the supporting memory hierarchy can run at different frequencies and voltages. Some of the clusters can then be configured to be performanceoriented and run at high frequency, while the other clusters can be configured to be low-power-oriented and run at lower frequencies, thus reducing overall consumption. For this heterogeneous design to be effective, we need to select the most suitable frequencies and voltages for each component. We propose a scheme to choose these parameters based on a model that estimates the energy consumption and the execution time of floating-point codes at compile time. Finally, we present a modulo scheduling technique based on graph partitioning that exploits the opportunities presented on heterogeneous clustered Microarchitectures. Results show that the Energy-Delay2 product (ED2) can be significantly reduced by 15% on average for a microarchitecture with 4-clusters and by as much as 35% for selected programs.

  • IPDPS - Inherently workload-balanced clustered microarchitecture
    19th IEEE International Parallel and Distributed Processing Symposium, 2005
    Co-Authors: Jaume Abella, Antonio Gonzalez
    Abstract:

    The performance of clustered Microarchitectures relies on steering schemes that try to find the best trade-off between workload balance and inter-cluster communication penalties. In previously proposed clustered processors, reducing communication penalties and balancing the workload are opposite targets, since improving one usually implies a detriment in the other. In this paper we propose a new clustered microarchitecture that can minimize communication penalties without compromising workload balance. The key idea is to arrange the clusters in a ring topology in such a way that results of one cluster can be forwarded to the neighbor cluster with a very short latency. In this way, minimizing communication penalties is favored when the producer of a value and its consumer are placed in adjacent clusters, which also favors workload balance. The proposed microarchitecture is shown to outperform a state-of-the-art clustered processor. For instance, for an 8-cluster configuration and just one fully pipelined unidirectional bus, 15% speedup is achieved on average for FP programs.

  • on chip interconnects and instruction steering schemes for clustered Microarchitectures
    IEEE Transactions on Parallel and Distributed Systems, 2005
    Co-Authors: J.-m. Parcerisa, Antonio Gonzalez, J. Sahuquillo, J. Duato
    Abstract:

    Clustering is an effective microarchitectural technique for reducing the impact of wire delays, the complexity, and the power requirements of microprocessors. In this work, we investigate the design of on-chip interconnection networks for clustered superscalar Microarchitectures. This new class of interconnects has demands and characteristics different from traditional multiprocessor networks. In particular, in a clustered microarchitecture, a low intercluster communication latency is essential for high performance. We propose some point-to-point cluster interconnects and new improved instruction steering schemes. The results show that these point-to-point interconnects achieve much better performance than bus-based ones, and that the connectivity of the network together with effective steering schemes are key for high performance. We also show that these interconnects can be built with simple hardware and achieve a performance close to that of an idealized contention-free model.

  • efficient interconnects for clustered Microarchitectures
    International Conference on Parallel Architectures and Compilation Techniques, 2002
    Co-Authors: J.-m. Parcerisa, Antonio Gonzalez, J. Sahuquillo, J. Duato
    Abstract:

    Clustering is an effective microarchitectural technique for reducing the impact of wire delays, the complexity, and the power requirements of microprocessors. In this work, we investigate the design of on-chip interconnection networks for clustered Microarchitectures. This new class of interconnects has different demands and characteristics than traditional multiprocessor networks. In a clustered microarchitecture, a low inter-cluster communication latency is essential for high performance.We propose point-to-point interconnects together with an effective latency-aware instruction steering scheme and show that they achieve much better performance than bus-based interconnects. The results show that the connectivity of the network together with latency-aware steering schemes are key for high performance. We also show that these interconnects can be built with simple hardware and achieve a performance close to that of an idealized contention-free model.

David Kaeli - One of the best experts on this subject based on the ideXlab platform.

  • CGO - Heterogeneous Clustered VLIW Microarchitectures
    International Symposium on Code Generation and Optimization (CGO'07), 2007
    Co-Authors: Alex Aletà, Josep Maria Codina, Antonio Gonzalez, David Kaeli
    Abstract:

    Increasing performance, while at the same time reducing power consumption, is a major design tradeoff in current microprocessors. In this paper, we investigate the potential of using a heterogeneous clustered VLIW microarchitecture. In the proposed microarchitecture, each cluster, the interconnection network and the supporting memory hierarchy can run at different frequencies and voltages. Some of the clusters can then be configured to be performanceoriented and run at high frequency, while the other clusters can be configured to be low-power-oriented and run at lower frequencies, thus reducing overall consumption. For this heterogeneous design to be effective, we need to select the most suitable frequencies and voltages for each component. We propose a scheme to choose these parameters based on a model that estimates the energy consumption and the execution time of floating-point codes at compile time. Finally, we present a modulo scheduling technique based on graph partitioning that exploits the opportunities presented on heterogeneous clustered Microarchitectures. Results show that the Energy-Delay2 product (ED2) can be significantly reduced by 15% on average for a microarchitecture with 4-clusters and by as much as 35% for selected programs.

  • heterogeneous clustered vliw Microarchitectures
    Symposium on Code Generation and Optimization, 2007
    Co-Authors: Alex Aletà, Josep Maria Codina, Antonio Gonzalez, David Kaeli
    Abstract:

    Increasing performance, while at the same time reducing power consumption, is a major design tradeoff in current microprocessors. In this paper, we investigate the potential of using a heterogeneous clustered VLIW microarchitecture. In the proposed microarchitecture, each cluster, the interconnection network and the supporting memory hierarchy can run at different frequencies and voltages. Some of the clusters can then be configured to be performanceoriented and run at high frequency, while the other clusters can be configured to be low-power-oriented and run at lower frequencies, thus reducing overall consumption. For this heterogeneous design to be effective, we need to select the most suitable frequencies and voltages for each component. We propose a scheme to choose these parameters based on a model that estimates the energy consumption and the execution time of floating-point codes at compile time. Finally, we present a modulo scheduling technique based on graph partitioning that exploits the opportunities presented on heterogeneous clustered Microarchitectures. Results show that the Energy-Delay2 product (ED2) can be significantly reduced by 15% on average for a microarchitecture with 4-clusters and by as much as 35% for selected programs.

Jarkko Niiranen - One of the best experts on this subject based on the ideXlab platform.

  • lattice structures as thermoelastic strain gradient metamaterials evidence from full field simulations and applications to functionally step wise graded beams
    Composites Part B-engineering, 2019
    Co-Authors: Sergei Khakalo, Jarkko Niiranen
    Abstract:

    Abstract The present work investigates the mechanical and thermomechanical bending response of beam structures possessing a triangular lattice microarchitecture. The validity of generalized continuum models, in general, and the associated dimensionally reduced models for functionally step-wise-graded microarchitectural beams, in particular, is approved by full-field finite element simulations. Most importantly, the necessity of the temperature gradient in the Helmholtz free energy is substantiated. The corresponding strong and weak forms for the associated Bernoulli–Euler and Timoshenko models of functionally graded beams are derived. The effective classical thermoelastic properties of a metamaterial with a triangular lattice microarchitecture are defined by means of computational homogenization. The additional length scale parameter involved in the generalized beam models, and associated to the particular triangular microarchitecture, is calibrated by fitting the mechanical bending responses of a series of lattice beams to the analytical solutions of the corresponding theoretical models. Strongly size-dependent mechanical and size-independent thermal bending responses are observed for both thin and thick beams with triangular lattice Microarchitectures. Finally, different lattice beams with varying Microarchitectures are introduced and shown to behave as generalized functionally step-wise-graded beams with respect to the higher-order elastic modulus, i.e., the length scale parameter varying in the direction of the beam axis.

Jianlin Shi - One of the best experts on this subject based on the ideXlab platform.

Alex Aletà - One of the best experts on this subject based on the ideXlab platform.

  • CGO - Heterogeneous Clustered VLIW Microarchitectures
    International Symposium on Code Generation and Optimization (CGO'07), 2007
    Co-Authors: Alex Aletà, Josep Maria Codina, Antonio Gonzalez, David Kaeli
    Abstract:

    Increasing performance, while at the same time reducing power consumption, is a major design tradeoff in current microprocessors. In this paper, we investigate the potential of using a heterogeneous clustered VLIW microarchitecture. In the proposed microarchitecture, each cluster, the interconnection network and the supporting memory hierarchy can run at different frequencies and voltages. Some of the clusters can then be configured to be performanceoriented and run at high frequency, while the other clusters can be configured to be low-power-oriented and run at lower frequencies, thus reducing overall consumption. For this heterogeneous design to be effective, we need to select the most suitable frequencies and voltages for each component. We propose a scheme to choose these parameters based on a model that estimates the energy consumption and the execution time of floating-point codes at compile time. Finally, we present a modulo scheduling technique based on graph partitioning that exploits the opportunities presented on heterogeneous clustered Microarchitectures. Results show that the Energy-Delay2 product (ED2) can be significantly reduced by 15% on average for a microarchitecture with 4-clusters and by as much as 35% for selected programs.

  • heterogeneous clustered vliw Microarchitectures
    Symposium on Code Generation and Optimization, 2007
    Co-Authors: Alex Aletà, Josep Maria Codina, Antonio Gonzalez, David Kaeli
    Abstract:

    Increasing performance, while at the same time reducing power consumption, is a major design tradeoff in current microprocessors. In this paper, we investigate the potential of using a heterogeneous clustered VLIW microarchitecture. In the proposed microarchitecture, each cluster, the interconnection network and the supporting memory hierarchy can run at different frequencies and voltages. Some of the clusters can then be configured to be performanceoriented and run at high frequency, while the other clusters can be configured to be low-power-oriented and run at lower frequencies, thus reducing overall consumption. For this heterogeneous design to be effective, we need to select the most suitable frequencies and voltages for each component. We propose a scheme to choose these parameters based on a model that estimates the energy consumption and the execution time of floating-point codes at compile time. Finally, we present a modulo scheduling technique based on graph partitioning that exploits the opportunities presented on heterogeneous clustered Microarchitectures. Results show that the Energy-Delay2 product (ED2) can be significantly reduced by 15% on average for a microarchitecture with 4-clusters and by as much as 35% for selected programs.