ii Working with modern architectures for high performance applications is increasingly more diffi-cult for programmers as the complexity of both the system architectures and software continue to increase. The level of hand tuning and native adaptations required to achieve high performance comes at the cost of limiting the portability of the software. For instance, we show that a compute intensive DCT algorithm performs better on graphic processors than the best algorithm for x86. In particular, limited portability is true for cyclic multimedia workloads, a set of programs that run continuously with strict requirements for high performance and low latency. An example of a typical multimedia workload is a pipeline of many small image processi...
This paper aims to provide a quantitative understanding of the performance of image and video proces...
This research aims to explore possible solutions to improvementof performance in multimedia processo...
Implementing a real-time image-processing algorithm on a serial processor is difficult to achieve b...
Processor architectures have been evolving quickly since the introduction of the central processing ...
In this document, we study the characteristics of various multimedia applications in an attempt to g...
Multimedia applications are an increasingly important workload for a large range of systems in-cludi...
As applications such as image processing, audio playback, and video decompression become more popula...
The main challenge for reducing the design effort cost of complex systems on chip is to pursue more ...
Current High Performance Embedded Architectures offer architectural improvements over previous gener...
Parallelism is ubiquitous in modern computer architectures. Heterogeneity of CPU cores and deep memo...
Multimedia applications are an increasingly important workload for general-purpose processors. This ...
In this thesis, I investigate issues in the development of continuous media (CM) applications. CM ap...
Multimedia systems combine the digital form of images, graphics, audio, electronic signals, or video...
Concurrently exploring both algorithmic and architectural optimizations is a new design paradigm. Th...
The well-known wave-front parallelization is proposed for parallel H.264/AVC videoprocessing. Under ...
This paper aims to provide a quantitative understanding of the performance of image and video proces...
This research aims to explore possible solutions to improvementof performance in multimedia processo...
Implementing a real-time image-processing algorithm on a serial processor is difficult to achieve b...
Processor architectures have been evolving quickly since the introduction of the central processing ...
In this document, we study the characteristics of various multimedia applications in an attempt to g...
Multimedia applications are an increasingly important workload for a large range of systems in-cludi...
As applications such as image processing, audio playback, and video decompression become more popula...
The main challenge for reducing the design effort cost of complex systems on chip is to pursue more ...
Current High Performance Embedded Architectures offer architectural improvements over previous gener...
Parallelism is ubiquitous in modern computer architectures. Heterogeneity of CPU cores and deep memo...
Multimedia applications are an increasingly important workload for general-purpose processors. This ...
In this thesis, I investigate issues in the development of continuous media (CM) applications. CM ap...
Multimedia systems combine the digital form of images, graphics, audio, electronic signals, or video...
Concurrently exploring both algorithmic and architectural optimizations is a new design paradigm. Th...
The well-known wave-front parallelization is proposed for parallel H.264/AVC videoprocessing. Under ...
This paper aims to provide a quantitative understanding of the performance of image and video proces...
This research aims to explore possible solutions to improvementof performance in multimedia processo...
Implementing a real-time image-processing algorithm on a serial processor is difficult to achieve b...