Parallel Realization - Explore the Science & Experts

The Experts below are selected from a list of 321 Experts worldwide ranked by ideXlab platform

Junjie Yang - One of the best experts on this subject based on the ideXlab platform.

Paralleling variable block size motion estimation of hevc on cpu plus gpu platform

International Conference on Multimedia and Expo, 2013

Co-Authors: Xiangwen Wang, Li Song, Min Chen, Junjie Yang

Abstract:

The emerging HEVC standard supports up to 12 variable block sizes ranging from 4×8/8×4 to 64×64 to conduct motion estimation (ME) and motion compensation (MC). This feature contributes considerable coding gain compared with 7 variable block sizes in H.264/AVC at the cost of huge computational complexity. In the test model HM, ME with variable block sizes (VBSME) may be called up to 425 times for the mode decision procedure of one CTU (Coding Tree Unit). Obviously, VBSME becomes the bottleneck for real time encoding. In this paper, we focus on Parallel Realization architecture design of VBSME in HEVC. Firstly, an efficient Parallel encoder framework is proposed for CPU plus GPU platform. With the framework, VBSME, fractional-pixel image interpolation and border padding processes run on GPU without burden on the host CPU. Secondly, for workload balance between CPU and GPU, a fast Prediction Unit partition mode decision algorithm is also proposed. Lastly, the Parallel Realization strategy of VBSME on GPU is improved for ME compression performance improvement. Experimental results based on the NVIDIA's C2050 GPU show that the speed of the VBSME strategy on GPU is about 113 times faster than the one on CPU.

15 days free trial to Access Article
ICME Workshops - Paralleling variable block size motion estimation of HEVC on CPU plus GPU platform

2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW), 2013

Co-Authors: Xiangwen Wang, Li Song, Min Chen, Junjie Yang

Abstract:

The emerging HEVC standard supports up to 12 variable block sizes ranging from 4×8/8×4 to 64×64 to conduct motion estimation (ME) and motion compensation (MC). This feature contributes considerable coding gain compared with 7 variable block sizes in H.264/AVC at the cost of huge computational complexity. In the test model HM, ME with variable block sizes (VBSME) may be called up to 425 times for the mode decision procedure of one CTU (Coding Tree Unit). Obviously, VBSME becomes the bottleneck for real time encoding. In this paper, we focus on Parallel Realization architecture design of VBSME in HEVC. Firstly, an efficient Parallel encoder framework is proposed for CPU plus GPU platform. With the framework, VBSME, fractional-pixel image interpolation and border padding processes run on GPU without burden on the host CPU. Secondly, for workload balance between CPU and GPU, a fast Prediction Unit partition mode decision algorithm is also proposed. Lastly, the Parallel Realization strategy of VBSME on GPU is improved for ME compression performance improvement. Experimental results based on the NVIDIA's C2050 GPU show that the speed of the VBSME strategy on GPU is about 113 times faster than the one on CPU.

15 days free trial to Access Article

Xiangwen Wang - One of the best experts on this subject based on the ideXlab platform.

Paralleling variable block size motion estimation of hevc on cpu plus gpu platform

International Conference on Multimedia and Expo, 2013

Co-Authors: Xiangwen Wang, Li Song, Min Chen, Junjie Yang

Abstract:

The emerging HEVC standard supports up to 12 variable block sizes ranging from 4×8/8×4 to 64×64 to conduct motion estimation (ME) and motion compensation (MC). This feature contributes considerable coding gain compared with 7 variable block sizes in H.264/AVC at the cost of huge computational complexity. In the test model HM, ME with variable block sizes (VBSME) may be called up to 425 times for the mode decision procedure of one CTU (Coding Tree Unit). Obviously, VBSME becomes the bottleneck for real time encoding. In this paper, we focus on Parallel Realization architecture design of VBSME in HEVC. Firstly, an efficient Parallel encoder framework is proposed for CPU plus GPU platform. With the framework, VBSME, fractional-pixel image interpolation and border padding processes run on GPU without burden on the host CPU. Secondly, for workload balance between CPU and GPU, a fast Prediction Unit partition mode decision algorithm is also proposed. Lastly, the Parallel Realization strategy of VBSME on GPU is improved for ME compression performance improvement. Experimental results based on the NVIDIA's C2050 GPU show that the speed of the VBSME strategy on GPU is about 113 times faster than the one on CPU.

15 days free trial to Access Article
ICME Workshops - Paralleling variable block size motion estimation of HEVC on CPU plus GPU platform

2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW), 2013

Co-Authors: Xiangwen Wang, Li Song, Min Chen, Junjie Yang

Abstract:

The emerging HEVC standard supports up to 12 variable block sizes ranging from 4×8/8×4 to 64×64 to conduct motion estimation (ME) and motion compensation (MC). This feature contributes considerable coding gain compared with 7 variable block sizes in H.264/AVC at the cost of huge computational complexity. In the test model HM, ME with variable block sizes (VBSME) may be called up to 425 times for the mode decision procedure of one CTU (Coding Tree Unit). Obviously, VBSME becomes the bottleneck for real time encoding. In this paper, we focus on Parallel Realization architecture design of VBSME in HEVC. Firstly, an efficient Parallel encoder framework is proposed for CPU plus GPU platform. With the framework, VBSME, fractional-pixel image interpolation and border padding processes run on GPU without burden on the host CPU. Secondly, for workload balance between CPU and GPU, a fast Prediction Unit partition mode decision algorithm is also proposed. Lastly, the Parallel Realization strategy of VBSME on GPU is improved for ME compression performance improvement. Experimental results based on the NVIDIA's C2050 GPU show that the speed of the VBSME strategy on GPU is about 113 times faster than the one on CPU.

15 days free trial to Access Article

Min Chen - One of the best experts on this subject based on the ideXlab platform.

Paralleling variable block size motion estimation of hevc on cpu plus gpu platform

International Conference on Multimedia and Expo, 2013

Co-Authors: Xiangwen Wang, Li Song, Min Chen, Junjie Yang

Abstract:

The emerging HEVC standard supports up to 12 variable block sizes ranging from 4×8/8×4 to 64×64 to conduct motion estimation (ME) and motion compensation (MC). This feature contributes considerable coding gain compared with 7 variable block sizes in H.264/AVC at the cost of huge computational complexity. In the test model HM, ME with variable block sizes (VBSME) may be called up to 425 times for the mode decision procedure of one CTU (Coding Tree Unit). Obviously, VBSME becomes the bottleneck for real time encoding. In this paper, we focus on Parallel Realization architecture design of VBSME in HEVC. Firstly, an efficient Parallel encoder framework is proposed for CPU plus GPU platform. With the framework, VBSME, fractional-pixel image interpolation and border padding processes run on GPU without burden on the host CPU. Secondly, for workload balance between CPU and GPU, a fast Prediction Unit partition mode decision algorithm is also proposed. Lastly, the Parallel Realization strategy of VBSME on GPU is improved for ME compression performance improvement. Experimental results based on the NVIDIA's C2050 GPU show that the speed of the VBSME strategy on GPU is about 113 times faster than the one on CPU.

15 days free trial to Access Article
ICME Workshops - Paralleling variable block size motion estimation of HEVC on CPU plus GPU platform

2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW), 2013

Co-Authors: Xiangwen Wang, Li Song, Min Chen, Junjie Yang

Abstract:

The emerging HEVC standard supports up to 12 variable block sizes ranging from 4×8/8×4 to 64×64 to conduct motion estimation (ME) and motion compensation (MC). This feature contributes considerable coding gain compared with 7 variable block sizes in H.264/AVC at the cost of huge computational complexity. In the test model HM, ME with variable block sizes (VBSME) may be called up to 425 times for the mode decision procedure of one CTU (Coding Tree Unit). Obviously, VBSME becomes the bottleneck for real time encoding. In this paper, we focus on Parallel Realization architecture design of VBSME in HEVC. Firstly, an efficient Parallel encoder framework is proposed for CPU plus GPU platform. With the framework, VBSME, fractional-pixel image interpolation and border padding processes run on GPU without burden on the host CPU. Secondly, for workload balance between CPU and GPU, a fast Prediction Unit partition mode decision algorithm is also proposed. Lastly, the Parallel Realization strategy of VBSME on GPU is improved for ME compression performance improvement. Experimental results based on the NVIDIA's C2050 GPU show that the speed of the VBSME strategy on GPU is about 113 times faster than the one on CPU.

15 days free trial to Access Article

Li Song - One of the best experts on this subject based on the ideXlab platform.

Paralleling variable block size motion estimation of hevc on cpu plus gpu platform

International Conference on Multimedia and Expo, 2013

Co-Authors: Xiangwen Wang, Li Song, Min Chen, Junjie Yang

Abstract:

The emerging HEVC standard supports up to 12 variable block sizes ranging from 4×8/8×4 to 64×64 to conduct motion estimation (ME) and motion compensation (MC). This feature contributes considerable coding gain compared with 7 variable block sizes in H.264/AVC at the cost of huge computational complexity. In the test model HM, ME with variable block sizes (VBSME) may be called up to 425 times for the mode decision procedure of one CTU (Coding Tree Unit). Obviously, VBSME becomes the bottleneck for real time encoding. In this paper, we focus on Parallel Realization architecture design of VBSME in HEVC. Firstly, an efficient Parallel encoder framework is proposed for CPU plus GPU platform. With the framework, VBSME, fractional-pixel image interpolation and border padding processes run on GPU without burden on the host CPU. Secondly, for workload balance between CPU and GPU, a fast Prediction Unit partition mode decision algorithm is also proposed. Lastly, the Parallel Realization strategy of VBSME on GPU is improved for ME compression performance improvement. Experimental results based on the NVIDIA's C2050 GPU show that the speed of the VBSME strategy on GPU is about 113 times faster than the one on CPU.

15 days free trial to Access Article
ICME Workshops - Paralleling variable block size motion estimation of HEVC on CPU plus GPU platform

2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW), 2013

Co-Authors: Xiangwen Wang, Li Song, Min Chen, Junjie Yang

Abstract:

The emerging HEVC standard supports up to 12 variable block sizes ranging from 4×8/8×4 to 64×64 to conduct motion estimation (ME) and motion compensation (MC). This feature contributes considerable coding gain compared with 7 variable block sizes in H.264/AVC at the cost of huge computational complexity. In the test model HM, ME with variable block sizes (VBSME) may be called up to 425 times for the mode decision procedure of one CTU (Coding Tree Unit). Obviously, VBSME becomes the bottleneck for real time encoding. In this paper, we focus on Parallel Realization architecture design of VBSME in HEVC. Firstly, an efficient Parallel encoder framework is proposed for CPU plus GPU platform. With the framework, VBSME, fractional-pixel image interpolation and border padding processes run on GPU without burden on the host CPU. Secondly, for workload balance between CPU and GPU, a fast Prediction Unit partition mode decision algorithm is also proposed. Lastly, the Parallel Realization strategy of VBSME on GPU is improved for ME compression performance improvement. Experimental results based on the NVIDIA's C2050 GPU show that the speed of the VBSME strategy on GPU is about 113 times faster than the one on CPU.

15 days free trial to Access Article

Piotr Ostalczyk - One of the best experts on this subject based on the ideXlab platform.

ECC - Parallel Realization of the variable-, fractional-order difference equation

2019 18th European Control Conference (ECC), 2019

Co-Authors: Piotr Ostalczyk

Abstract:

In the paper a block–matrix form of the variable–, fractional–order linear time–invariant difference equation is investigated. It is proved that similar to the state–space representation but with linearly increasing matrix–vector dimensions is useful in the difference equation solution. The block diagram of the system is described by the Jordan–like canonical form. It uses the variable–, fractional–order integrators and proportional blocks. The proposed approach simplifies numerical calculations of solutions due to a possibility of Parallel computations. Analysis is supported by the numerical example.

15 days free trial to Access Article

Discover everything there is to know about the scientific topic Parallel Realization with ideXlab!

Junjie Yang - One of the best experts on this subject based on the ideXlab platform.

Paralleling variable block size motion estimation of hevc on cpu plus gpu platform

ICME Workshops - Paralleling variable block size motion estimation of HEVC on CPU plus GPU platform

Xiangwen Wang - One of the best experts on this subject based on the ideXlab platform.

Paralleling variable block size motion estimation of hevc on cpu plus gpu platform

ICME Workshops - Paralleling variable block size motion estimation of HEVC on CPU plus GPU platform

Min Chen - One of the best experts on this subject based on the ideXlab platform.

Paralleling variable block size motion estimation of hevc on cpu plus gpu platform

ICME Workshops - Paralleling variable block size motion estimation of HEVC on CPU plus GPU platform

Li Song - One of the best experts on this subject based on the ideXlab platform.

Paralleling variable block size motion estimation of hevc on cpu plus gpu platform

ICME Workshops - Paralleling variable block size motion estimation of HEVC on CPU plus GPU platform

Piotr Ostalczyk - One of the best experts on this subject based on the ideXlab platform.

ECC - Parallel Realization of the variable-, fractional-order difference equation