会议专题

GOP-Level Parallelization of the H.264 Decoder without a Start-Code Scanner

Recent researches on parallelization of H.264 video decoders focused on fine-grain methods. These works led to designs having very short latencies and good memory usage. However, they could not reach the scalability of Group of Pictures (GOP) level approaches although assuming a well-designed entropy decoder which can feed the increasing number of parallel working cores. We would like to introduce a GOP-level approach due to its high scalability, mentioning solution approaches for the well-known latency and memory issues. Our design revokes the need to a scanner for GOP start-codes which was used in the earlier methods. This approach lets all the cores work on the decoding task. Although the performance on shared memory systems is subject to improve, we have observed a one-to-one linear speedup in parallel working nodes. We have tested our method using a cluster of 5 machines each having 2 processors with 4 cores. The decoding is 5 times faster when we run only one process in each machine, that is we saw one-to-one linear speedup when there is no memory shortage. We observed a maximum of 11 times speedup when using all of the 40 cores distributed among 5 machines.

Parallel and Distributed Processing H.264 Decoder Image/Video Processing and Coding High-Performance Computing Video Compression

Ahmet Giirhanh Charlie Chung-Ping Chen Shih-Hao Hung

Graduate Institute of Electronics Engineering National Taiwan University Taipei, Taiwan 10617, R.O.C Graduate Institute of Networking and Multimedia National Taiwan University Taipei, Taiwan 10617, R.O

国际会议

2010 2nd International Conference on Signal Processing System(2010年信号处理系统国际会议 ICSPS 2010)

大连

英文

2308-2311

2010-07-05(万方平台首次上网日期,不代表论文的发表时间)