This paper investigates the scalability of MacroBlock(MB) level parallelization of the H.264 decoder for High Definition (HD) applications. The study includes three
parts. First, a formal model for predicting the maximum performance that can be obtained taking into account variable processing
time of tasks and thread synchronization overhead. Second, an implementation on a real multiprocessor architecture including a comparison of different scheduling strategies and a profiling analysis for identifying the performance bottlenecks.
Finally, a trace-driven simulation methodology has been used for identifying the opportunities of acceleration for removing the main bottlenecks. It includes the acceleration potential for the entropy decoding stage and thread synchronization
and scheduling. Our study presents a quantitative analysis of the main bottlenecks of the application and estimates the acceleration levels that are required to make the MB-level parallel decoder scalable.
CitationAlvarez, M. [et al.]. Scalability of Macroblock-level parallelism for H.264 decoding. A: International Conference on Parallel and Distributed Systems. "The IEEE Fifteenth International Conference on Parallel and Distributed Systems (ICPADS-2009)". Shenzen: IEEE Computer Society Publications, 2010, p. 236-243.
All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder. If you wish to make any use of the work not provided for in the law, please contact: firstname.lastname@example.org