Microarchitectures - Explore the Science & Experts

The Experts below are selected from a list of 26319 Experts worldwide ranked by ideXlab platform

Antonio Gonzalez - One of the best experts on this subject based on the ideXlab platform.

CGO - Heterogeneous Clustered VLIW Microarchitectures

International Symposium on Code Generation and Optimization (CGO'07), 2007

Co-Authors: Alex Aletà, Josep Maria Codina, Antonio Gonzalez, David Kaeli

Abstract:

Increasing performance, while at the same time reducing power consumption, is a major design tradeoff in current microprocessors. In this paper, we investigate the potential of using a heterogeneous clustered VLIW microarchitecture. In the proposed microarchitecture, each cluster, the interconnection network and the supporting memory hierarchy can run at different frequencies and voltages. Some of the clusters can then be configured to be performanceoriented and run at high frequency, while the other clusters can be configured to be low-power-oriented and run at lower frequencies, thus reducing overall consumption. For this heterogeneous design to be effective, we need to select the most suitable frequencies and voltages for each component. We propose a scheme to choose these parameters based on a model that estimates the energy consumption and the execution time of floating-point codes at compile time. Finally, we present a modulo scheduling technique based on graph partitioning that exploits the opportunities presented on heterogeneous clustered Microarchitectures. Results show that the Energy-Delay2 product (ED2) can be significantly reduced by 15% on average for a microarchitecture with 4-clusters and by as much as 35% for selected programs.

15 days free trial to Access Article
heterogeneous clustered vliw Microarchitectures

Symposium on Code Generation and Optimization, 2007

Co-Authors: Alex Aletà, Josep Maria Codina, Antonio Gonzalez, David Kaeli

Abstract:

Increasing performance, while at the same time reducing power consumption, is a major design tradeoff in current microprocessors. In this paper, we investigate the potential of using a heterogeneous clustered VLIW microarchitecture. In the proposed microarchitecture, each cluster, the interconnection network and the supporting memory hierarchy can run at different frequencies and voltages. Some of the clusters can then be configured to be performanceoriented and run at high frequency, while the other clusters can be configured to be low-power-oriented and run at lower frequencies, thus reducing overall consumption. For this heterogeneous design to be effective, we need to select the most suitable frequencies and voltages for each component. We propose a scheme to choose these parameters based on a model that estimates the energy consumption and the execution time of floating-point codes at compile time. Finally, we present a modulo scheduling technique based on graph partitioning that exploits the opportunities presented on heterogeneous clustered Microarchitectures. Results show that the Energy-Delay2 product (ED2) can be significantly reduced by 15% on average for a microarchitecture with 4-clusters and by as much as 35% for selected programs.

15 days free trial to Access Article
IPDPS - Inherently workload-balanced clustered microarchitecture

19th IEEE International Parallel and Distributed Processing Symposium, 2005

Co-Authors: Jaume Abella, Antonio Gonzalez

Abstract:

The performance of clustered Microarchitectures relies on steering schemes that try to find the best trade-off between workload balance and inter-cluster communication penalties. In previously proposed clustered processors, reducing communication penalties and balancing the workload are opposite targets, since improving one usually implies a detriment in the other. In this paper we propose a new clustered microarchitecture that can minimize communication penalties without compromising workload balance. The key idea is to arrange the clusters in a ring topology in such a way that results of one cluster can be forwarded to the neighbor cluster with a very short latency. In this way, minimizing communication penalties is favored when the producer of a value and its consumer are placed in adjacent clusters, which also favors workload balance. The proposed microarchitecture is shown to outperform a state-of-the-art clustered processor. For instance, for an 8-cluster configuration and just one fully pipelined unidirectional bus, 15% speedup is achieved on average for FP programs.

15 days free trial to Access Article
on chip interconnects and instruction steering schemes for clustered Microarchitectures

IEEE Transactions on Parallel and Distributed Systems, 2005

Co-Authors: J.-m. Parcerisa, Antonio Gonzalez, J. Sahuquillo, J. Duato

Abstract:

Clustering is an effective microarchitectural technique for reducing the impact of wire delays, the complexity, and the power requirements of microprocessors. In this work, we investigate the design of on-chip interconnection networks for clustered superscalar Microarchitectures. This new class of interconnects has demands and characteristics different from traditional multiprocessor networks. In particular, in a clustered microarchitecture, a low intercluster communication latency is essential for high performance. We propose some point-to-point cluster interconnects and new improved instruction steering schemes. The results show that these point-to-point interconnects achieve much better performance than bus-based ones, and that the connectivity of the network together with effective steering schemes are key for high performance. We also show that these interconnects can be built with simple hardware and achieve a performance close to that of an idealized contention-free model.

15 days free trial to Access Article
efficient interconnects for clustered Microarchitectures

International Conference on Parallel Architectures and Compilation Techniques, 2002

Co-Authors: J.-m. Parcerisa, Antonio Gonzalez, J. Sahuquillo, J. Duato

Abstract:

Clustering is an effective microarchitectural technique for reducing the impact of wire delays, the complexity, and the power requirements of microprocessors. In this work, we investigate the design of on-chip interconnection networks for clustered Microarchitectures. This new class of interconnects has different demands and characteristics than traditional multiprocessor networks. In a clustered microarchitecture, a low inter-cluster communication latency is essential for high performance.We propose point-to-point interconnects together with an effective latency-aware instruction steering scheme and show that they achieve much better performance than bus-based interconnects. The results show that the connectivity of the network together with latency-aware steering schemes are key for high performance. We also show that these interconnects can be built with simple hardware and achieve a performance close to that of an idealized contention-free model.

15 days free trial to Access Article

David Kaeli - One of the best experts on this subject based on the ideXlab platform.

CGO - Heterogeneous Clustered VLIW Microarchitectures

International Symposium on Code Generation and Optimization (CGO'07), 2007

Co-Authors: Alex Aletà, Josep Maria Codina, Antonio Gonzalez, David Kaeli

Abstract:

Increasing performance, while at the same time reducing power consumption, is a major design tradeoff in current microprocessors. In this paper, we investigate the potential of using a heterogeneous clustered VLIW microarchitecture. In the proposed microarchitecture, each cluster, the interconnection network and the supporting memory hierarchy can run at different frequencies and voltages. Some of the clusters can then be configured to be performanceoriented and run at high frequency, while the other clusters can be configured to be low-power-oriented and run at lower frequencies, thus reducing overall consumption. For this heterogeneous design to be effective, we need to select the most suitable frequencies and voltages for each component. We propose a scheme to choose these parameters based on a model that estimates the energy consumption and the execution time of floating-point codes at compile time. Finally, we present a modulo scheduling technique based on graph partitioning that exploits the opportunities presented on heterogeneous clustered Microarchitectures. Results show that the Energy-Delay2 product (ED2) can be significantly reduced by 15% on average for a microarchitecture with 4-clusters and by as much as 35% for selected programs.

15 days free trial to Access Article
heterogeneous clustered vliw Microarchitectures

Symposium on Code Generation and Optimization, 2007

Co-Authors: Alex Aletà, Josep Maria Codina, Antonio Gonzalez, David Kaeli

Abstract:

Increasing performance, while at the same time reducing power consumption, is a major design tradeoff in current microprocessors. In this paper, we investigate the potential of using a heterogeneous clustered VLIW microarchitecture. In the proposed microarchitecture, each cluster, the interconnection network and the supporting memory hierarchy can run at different frequencies and voltages. Some of the clusters can then be configured to be performanceoriented and run at high frequency, while the other clusters can be configured to be low-power-oriented and run at lower frequencies, thus reducing overall consumption. For this heterogeneous design to be effective, we need to select the most suitable frequencies and voltages for each component. We propose a scheme to choose these parameters based on a model that estimates the energy consumption and the execution time of floating-point codes at compile time. Finally, we present a modulo scheduling technique based on graph partitioning that exploits the opportunities presented on heterogeneous clustered Microarchitectures. Results show that the Energy-Delay2 product (ED2) can be significantly reduced by 15% on average for a microarchitecture with 4-clusters and by as much as 35% for selected programs.

15 days free trial to Access Article

Jarkko Niiranen - One of the best experts on this subject based on the ideXlab platform.

lattice structures as thermoelastic strain gradient metamaterials evidence from full field simulations and applications to functionally step wise graded beams

Composites Part B-engineering, 2019

Co-Authors: Sergei Khakalo, Jarkko Niiranen

Abstract:

Abstract The present work investigates the mechanical and thermomechanical bending response of beam structures possessing a triangular lattice microarchitecture. The validity of generalized continuum models, in general, and the associated dimensionally reduced models for functionally step-wise-graded microarchitectural beams, in particular, is approved by full-field finite element simulations. Most importantly, the necessity of the temperature gradient in the Helmholtz free energy is substantiated. The corresponding strong and weak forms for the associated Bernoulli–Euler and Timoshenko models of functionally graded beams are derived. The effective classical thermoelastic properties of a metamaterial with a triangular lattice microarchitecture are defined by means of computational homogenization. The additional length scale parameter involved in the generalized beam models, and associated to the particular triangular microarchitecture, is calibrated by fitting the mechanical bending responses of a series of lattice beams to the analytical solutions of the corresponding theoretical models. Strongly size-dependent mechanical and size-independent thermal bending responses are observed for both thin and thick beams with triangular lattice Microarchitectures. Finally, different lattice beams with varying Microarchitectures are introduced and shown to behave as generalized functionally step-wise-graded beams with respect to the higher-order elastic modulus, i.e., the length scale parameter varying in the direction of the beam axis.

15 days free trial to Access Article

Jianlin Shi - One of the best experts on this subject based on the ideXlab platform.

controlled construction of monodisperse la2 moo4 3 yb tm Microarchitectures with upconversion luminescent property

Journal of Physical Chemistry C, 2008

Co-Authors: Zhenxing Chen, Na Zhang And, Jianlin Shi

Abstract:

Monodisperse La2(MoO4)3:Yb,Tm Microarchitectures with uniform waxberry-like morphology have been successfully constructed in large scale by a facile surfactant-assisted hydrothermal route, in which sodium lauryl sulfate (SLS) was used as the structure directing agent. It was found that the pH value was a crucial factor controlling the phase composition and purity, which was unaffected by surfactant SLS. The growth process of these waxberry-shaped Microarchitectures has been examined in detail, and it was proved that a special dissolution−recrystallization transformation mechanism as well as a preferential adsorption of SLS process was responsible for the morphology evolution of the La2(MoO4)3:Yb,Tm Microarchitectures. A mechanism for the formation of the waxberry-shape La2(MoO4)3:Yb,Tm Microarchitectures was put forward. The specially shaped architectures showed blue up-conversion emission properties.

15 days free trial to Access Article
synthesis and characterization of uniform spindle shaped Microarchitectures self assembled from aligned single crystalline nanowires of lanthanum phosphates

Crystal Growth & Design, 2007

Co-Authors: Lingxia Zhang, Hangrong Chen, Zile Hua, Jianlin Shi

Abstract:

A new synthetic approach using the Pluronic P123-assisted hydrothermal reaction of lanthanum phosphate and europium-doped lanthanum phosphate has been developed, which results in the formation of uniform spindle-shaped Microarchitectures most probably by a self-assembly process. Our results reveal that the obtained spindle-shaped Microarchitectures consist of several tens of aligned single-crystalline nanowires with smooth, well-defined facets and highly uniform morphologies. These well-defined spindle-shaped Microarchitectures show greatly enhanced photoluminescence in these compounds when compared to their counterparts of disordered arrangements. A possible formation mechanism for these spindle-shaped Microarchitectures is presented and discussed.

15 days free trial to Access Article
controlled construction of uniform pompon shaped Microarchitectures self assembled from single crystalline lanthanum molybdate nanoflakes

Langmuir, 2007

Co-Authors: Na Zhang, Hangrong Chen, Zile Hua, Jianlin Shi

Abstract:

Uniform three-dimensional La2(MoO4)3 nanostructures with a pompon shape have been successfully constructed by a simple surfactant-free hydrothermal approach via self-assembly from single-crystalline nanoflakes. The formation of the uniform pompon-shaped La2(MoO4)3 Microarchitectures is closely related to the presence of a proper amount of ammonium ions, and it is proposed that the pompon-shaped microarchitecture forms through an electrostatic attraction/repulsion effect between the oppositely charged flat surface and the edge of nanoflakes. Without the introduction of ammonium ions, no pompon-shaped Microarchitectures can be formed, and while under the presence of excess ammonium ions, the nanoflakes on the micropompons become amorphous, twisted, and rugged. The novel Microarchitectures of the product can be successfully modified from spherical to columelliform by using a mixed solvent of water/ethanol. This simple and efficient method may provide a practical reference to the controlled synthesis of other...

15 days free trial to Access Article

Alex Aletà - One of the best experts on this subject based on the ideXlab platform.

CGO - Heterogeneous Clustered VLIW Microarchitectures

International Symposium on Code Generation and Optimization (CGO'07), 2007

Co-Authors: Alex Aletà, Josep Maria Codina, Antonio Gonzalez, David Kaeli

Abstract:

Increasing performance, while at the same time reducing power consumption, is a major design tradeoff in current microprocessors. In this paper, we investigate the potential of using a heterogeneous clustered VLIW microarchitecture. In the proposed microarchitecture, each cluster, the interconnection network and the supporting memory hierarchy can run at different frequencies and voltages. Some of the clusters can then be configured to be performanceoriented and run at high frequency, while the other clusters can be configured to be low-power-oriented and run at lower frequencies, thus reducing overall consumption. For this heterogeneous design to be effective, we need to select the most suitable frequencies and voltages for each component. We propose a scheme to choose these parameters based on a model that estimates the energy consumption and the execution time of floating-point codes at compile time. Finally, we present a modulo scheduling technique based on graph partitioning that exploits the opportunities presented on heterogeneous clustered Microarchitectures. Results show that the Energy-Delay2 product (ED2) can be significantly reduced by 15% on average for a microarchitecture with 4-clusters and by as much as 35% for selected programs.

15 days free trial to Access Article
heterogeneous clustered vliw Microarchitectures

Symposium on Code Generation and Optimization, 2007

Co-Authors: Alex Aletà, Josep Maria Codina, Antonio Gonzalez, David Kaeli

Abstract:

Increasing performance, while at the same time reducing power consumption, is a major design tradeoff in current microprocessors. In this paper, we investigate the potential of using a heterogeneous clustered VLIW microarchitecture. In the proposed microarchitecture, each cluster, the interconnection network and the supporting memory hierarchy can run at different frequencies and voltages. Some of the clusters can then be configured to be performanceoriented and run at high frequency, while the other clusters can be configured to be low-power-oriented and run at lower frequencies, thus reducing overall consumption. For this heterogeneous design to be effective, we need to select the most suitable frequencies and voltages for each component. We propose a scheme to choose these parameters based on a model that estimates the energy consumption and the execution time of floating-point codes at compile time. Finally, we present a modulo scheduling technique based on graph partitioning that exploits the opportunities presented on heterogeneous clustered Microarchitectures. Results show that the Energy-Delay2 product (ED2) can be significantly reduced by 15% on average for a microarchitecture with 4-clusters and by as much as 35% for selected programs.

15 days free trial to Access Article

Discover everything there is to know about the scientific topic Microarchitectures with ideXlab!

Antonio Gonzalez - One of the best experts on this subject based on the ideXlab platform.

CGO - Heterogeneous Clustered VLIW Microarchitectures

heterogeneous clustered vliw Microarchitectures

IPDPS - Inherently workload-balanced clustered microarchitecture

on chip interconnects and instruction steering schemes for clustered Microarchitectures

efficient interconnects for clustered Microarchitectures

David Kaeli - One of the best experts on this subject based on the ideXlab platform.

CGO - Heterogeneous Clustered VLIW Microarchitectures

heterogeneous clustered vliw Microarchitectures

Jarkko Niiranen - One of the best experts on this subject based on the ideXlab platform.

lattice structures as thermoelastic strain gradient metamaterials evidence from full field simulations and applications to functionally step wise graded beams

Jianlin Shi - One of the best experts on this subject based on the ideXlab platform.

controlled construction of monodisperse la2 moo4 3 yb tm Microarchitectures with upconversion luminescent property

synthesis and characterization of uniform spindle shaped Microarchitectures self assembled from aligned single crystalline nanowires of lanthanum phosphates

controlled construction of uniform pompon shaped Microarchitectures self assembled from single crystalline lanthanum molybdate nanoflakes

Alex Aletà - One of the best experts on this subject based on the ideXlab platform.

CGO - Heterogeneous Clustered VLIW Microarchitectures

heterogeneous clustered vliw Microarchitectures