Parallel Realization

14,000,000 Leading Edge Experts on the ideXlab platform

Scan Science and Technology

Contact Leading Edge Experts & Companies

Scan Science and Technology

Contact Leading Edge Experts & Companies

The Experts below are selected from a list of 321 Experts worldwide ranked by ideXlab platform

Junjie Yang - One of the best experts on this subject based on the ideXlab platform.

  • Paralleling variable block size motion estimation of hevc on cpu plus gpu platform
    International Conference on Multimedia and Expo, 2013
    Co-Authors: Xiangwen Wang, Li Song, Min Chen, Junjie Yang
    Abstract:

    The emerging HEVC standard supports up to 12 variable block sizes ranging from 4×8/8×4 to 64×64 to conduct motion estimation (ME) and motion compensation (MC). This feature contributes considerable coding gain compared with 7 variable block sizes in H.264/AVC at the cost of huge computational complexity. In the test model HM, ME with variable block sizes (VBSME) may be called up to 425 times for the mode decision procedure of one CTU (Coding Tree Unit). Obviously, VBSME becomes the bottleneck for real time encoding. In this paper, we focus on Parallel Realization architecture design of VBSME in HEVC. Firstly, an efficient Parallel encoder framework is proposed for CPU plus GPU platform. With the framework, VBSME, fractional-pixel image interpolation and border padding processes run on GPU without burden on the host CPU. Secondly, for workload balance between CPU and GPU, a fast Prediction Unit partition mode decision algorithm is also proposed. Lastly, the Parallel Realization strategy of VBSME on GPU is improved for ME compression performance improvement. Experimental results based on the NVIDIA's C2050 GPU show that the speed of the VBSME strategy on GPU is about 113 times faster than the one on CPU.

  • ICME Workshops - Paralleling variable block size motion estimation of HEVC on CPU plus GPU platform
    2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW), 2013
    Co-Authors: Xiangwen Wang, Li Song, Min Chen, Junjie Yang
    Abstract:

    The emerging HEVC standard supports up to 12 variable block sizes ranging from 4×8/8×4 to 64×64 to conduct motion estimation (ME) and motion compensation (MC). This feature contributes considerable coding gain compared with 7 variable block sizes in H.264/AVC at the cost of huge computational complexity. In the test model HM, ME with variable block sizes (VBSME) may be called up to 425 times for the mode decision procedure of one CTU (Coding Tree Unit). Obviously, VBSME becomes the bottleneck for real time encoding. In this paper, we focus on Parallel Realization architecture design of VBSME in HEVC. Firstly, an efficient Parallel encoder framework is proposed for CPU plus GPU platform. With the framework, VBSME, fractional-pixel image interpolation and border padding processes run on GPU without burden on the host CPU. Secondly, for workload balance between CPU and GPU, a fast Prediction Unit partition mode decision algorithm is also proposed. Lastly, the Parallel Realization strategy of VBSME on GPU is improved for ME compression performance improvement. Experimental results based on the NVIDIA's C2050 GPU show that the speed of the VBSME strategy on GPU is about 113 times faster than the one on CPU.

Xiangwen Wang - One of the best experts on this subject based on the ideXlab platform.

  • Paralleling variable block size motion estimation of hevc on cpu plus gpu platform
    International Conference on Multimedia and Expo, 2013
    Co-Authors: Xiangwen Wang, Li Song, Min Chen, Junjie Yang
    Abstract:

    The emerging HEVC standard supports up to 12 variable block sizes ranging from 4×8/8×4 to 64×64 to conduct motion estimation (ME) and motion compensation (MC). This feature contributes considerable coding gain compared with 7 variable block sizes in H.264/AVC at the cost of huge computational complexity. In the test model HM, ME with variable block sizes (VBSME) may be called up to 425 times for the mode decision procedure of one CTU (Coding Tree Unit). Obviously, VBSME becomes the bottleneck for real time encoding. In this paper, we focus on Parallel Realization architecture design of VBSME in HEVC. Firstly, an efficient Parallel encoder framework is proposed for CPU plus GPU platform. With the framework, VBSME, fractional-pixel image interpolation and border padding processes run on GPU without burden on the host CPU. Secondly, for workload balance between CPU and GPU, a fast Prediction Unit partition mode decision algorithm is also proposed. Lastly, the Parallel Realization strategy of VBSME on GPU is improved for ME compression performance improvement. Experimental results based on the NVIDIA's C2050 GPU show that the speed of the VBSME strategy on GPU is about 113 times faster than the one on CPU.

  • ICME Workshops - Paralleling variable block size motion estimation of HEVC on CPU plus GPU platform
    2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW), 2013
    Co-Authors: Xiangwen Wang, Li Song, Min Chen, Junjie Yang
    Abstract:

    The emerging HEVC standard supports up to 12 variable block sizes ranging from 4×8/8×4 to 64×64 to conduct motion estimation (ME) and motion compensation (MC). This feature contributes considerable coding gain compared with 7 variable block sizes in H.264/AVC at the cost of huge computational complexity. In the test model HM, ME with variable block sizes (VBSME) may be called up to 425 times for the mode decision procedure of one CTU (Coding Tree Unit). Obviously, VBSME becomes the bottleneck for real time encoding. In this paper, we focus on Parallel Realization architecture design of VBSME in HEVC. Firstly, an efficient Parallel encoder framework is proposed for CPU plus GPU platform. With the framework, VBSME, fractional-pixel image interpolation and border padding processes run on GPU without burden on the host CPU. Secondly, for workload balance between CPU and GPU, a fast Prediction Unit partition mode decision algorithm is also proposed. Lastly, the Parallel Realization strategy of VBSME on GPU is improved for ME compression performance improvement. Experimental results based on the NVIDIA's C2050 GPU show that the speed of the VBSME strategy on GPU is about 113 times faster than the one on CPU.

Min Chen - One of the best experts on this subject based on the ideXlab platform.

  • Paralleling variable block size motion estimation of hevc on cpu plus gpu platform
    International Conference on Multimedia and Expo, 2013
    Co-Authors: Xiangwen Wang, Li Song, Min Chen, Junjie Yang
    Abstract:

    The emerging HEVC standard supports up to 12 variable block sizes ranging from 4×8/8×4 to 64×64 to conduct motion estimation (ME) and motion compensation (MC). This feature contributes considerable coding gain compared with 7 variable block sizes in H.264/AVC at the cost of huge computational complexity. In the test model HM, ME with variable block sizes (VBSME) may be called up to 425 times for the mode decision procedure of one CTU (Coding Tree Unit). Obviously, VBSME becomes the bottleneck for real time encoding. In this paper, we focus on Parallel Realization architecture design of VBSME in HEVC. Firstly, an efficient Parallel encoder framework is proposed for CPU plus GPU platform. With the framework, VBSME, fractional-pixel image interpolation and border padding processes run on GPU without burden on the host CPU. Secondly, for workload balance between CPU and GPU, a fast Prediction Unit partition mode decision algorithm is also proposed. Lastly, the Parallel Realization strategy of VBSME on GPU is improved for ME compression performance improvement. Experimental results based on the NVIDIA's C2050 GPU show that the speed of the VBSME strategy on GPU is about 113 times faster than the one on CPU.

  • ICME Workshops - Paralleling variable block size motion estimation of HEVC on CPU plus GPU platform
    2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW), 2013
    Co-Authors: Xiangwen Wang, Li Song, Min Chen, Junjie Yang
    Abstract:

    The emerging HEVC standard supports up to 12 variable block sizes ranging from 4×8/8×4 to 64×64 to conduct motion estimation (ME) and motion compensation (MC). This feature contributes considerable coding gain compared with 7 variable block sizes in H.264/AVC at the cost of huge computational complexity. In the test model HM, ME with variable block sizes (VBSME) may be called up to 425 times for the mode decision procedure of one CTU (Coding Tree Unit). Obviously, VBSME becomes the bottleneck for real time encoding. In this paper, we focus on Parallel Realization architecture design of VBSME in HEVC. Firstly, an efficient Parallel encoder framework is proposed for CPU plus GPU platform. With the framework, VBSME, fractional-pixel image interpolation and border padding processes run on GPU without burden on the host CPU. Secondly, for workload balance between CPU and GPU, a fast Prediction Unit partition mode decision algorithm is also proposed. Lastly, the Parallel Realization strategy of VBSME on GPU is improved for ME compression performance improvement. Experimental results based on the NVIDIA's C2050 GPU show that the speed of the VBSME strategy on GPU is about 113 times faster than the one on CPU.

Li Song - One of the best experts on this subject based on the ideXlab platform.

  • Paralleling variable block size motion estimation of hevc on cpu plus gpu platform
    International Conference on Multimedia and Expo, 2013
    Co-Authors: Xiangwen Wang, Li Song, Min Chen, Junjie Yang
    Abstract:

    The emerging HEVC standard supports up to 12 variable block sizes ranging from 4×8/8×4 to 64×64 to conduct motion estimation (ME) and motion compensation (MC). This feature contributes considerable coding gain compared with 7 variable block sizes in H.264/AVC at the cost of huge computational complexity. In the test model HM, ME with variable block sizes (VBSME) may be called up to 425 times for the mode decision procedure of one CTU (Coding Tree Unit). Obviously, VBSME becomes the bottleneck for real time encoding. In this paper, we focus on Parallel Realization architecture design of VBSME in HEVC. Firstly, an efficient Parallel encoder framework is proposed for CPU plus GPU platform. With the framework, VBSME, fractional-pixel image interpolation and border padding processes run on GPU without burden on the host CPU. Secondly, for workload balance between CPU and GPU, a fast Prediction Unit partition mode decision algorithm is also proposed. Lastly, the Parallel Realization strategy of VBSME on GPU is improved for ME compression performance improvement. Experimental results based on the NVIDIA's C2050 GPU show that the speed of the VBSME strategy on GPU is about 113 times faster than the one on CPU.

  • ICME Workshops - Paralleling variable block size motion estimation of HEVC on CPU plus GPU platform
    2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW), 2013
    Co-Authors: Xiangwen Wang, Li Song, Min Chen, Junjie Yang
    Abstract:

    The emerging HEVC standard supports up to 12 variable block sizes ranging from 4×8/8×4 to 64×64 to conduct motion estimation (ME) and motion compensation (MC). This feature contributes considerable coding gain compared with 7 variable block sizes in H.264/AVC at the cost of huge computational complexity. In the test model HM, ME with variable block sizes (VBSME) may be called up to 425 times for the mode decision procedure of one CTU (Coding Tree Unit). Obviously, VBSME becomes the bottleneck for real time encoding. In this paper, we focus on Parallel Realization architecture design of VBSME in HEVC. Firstly, an efficient Parallel encoder framework is proposed for CPU plus GPU platform. With the framework, VBSME, fractional-pixel image interpolation and border padding processes run on GPU without burden on the host CPU. Secondly, for workload balance between CPU and GPU, a fast Prediction Unit partition mode decision algorithm is also proposed. Lastly, the Parallel Realization strategy of VBSME on GPU is improved for ME compression performance improvement. Experimental results based on the NVIDIA's C2050 GPU show that the speed of the VBSME strategy on GPU is about 113 times faster than the one on CPU.

Piotr Ostalczyk - One of the best experts on this subject based on the ideXlab platform.

  • ECC - Parallel Realization of the variable-, fractional-order difference equation
    2019 18th European Control Conference (ECC), 2019
    Co-Authors: Piotr Ostalczyk
    Abstract:

    In the paper a block–matrix form of the variable–, fractional–order linear time–invariant difference equation is investigated. It is proved that similar to the state–space representation but with linearly increasing matrix–vector dimensions is useful in the difference equation solution. The block diagram of the system is described by the Jordan–like canonical form. It uses the variable–, fractional–order integrators and proportional blocks. The proposed approach simplifies numerical calculations of solutions due to a possibility of Parallel computations. Analysis is supported by the numerical example.