Abstract—The host-SIMD style heterogeneous multi-processor architecture offers high computing performance and user-friendly programmability. It explores both task level parallelism and data level parallelism by the on-chip multiple SIMD co-processors. For embedded DSP applications with predictable computing feature, this architecture can be further optimized for performance, implementation cost and power consumption. The optimization could be done by improving the SIMD processing efficiency and reducing redundant memory accesses and data shuffle operations. This paper introduces one effective approach by designing a software programmable multi-bank memory system for SIMD processors. Both the hardware architecture and software programming mo...
Numerous applications gather increasing amounts of data, which have to be managed and queried. Diffe...
Over the past few years, energy consumption has become the main limiting factor for computing in gen...
Recent extensions to microprocessor instruction sets are intended to speed-up multimedia algorithms ...
Processor clock frequencies and the related performance improvements recently stagnated due to sever...
Abstract. The rapid growth of multimedia applications has been putting high pressure on the processi...
Abstract- This paper targets data-parallel applications which are also computa tion-intensive. It pr...
The host-multi-SIMD chip multiprocessor (CMP) architecture has been proved to be an efficient archit...
Energy efficiency has become one of the most important topics in computing. To meet the ever increas...
Abstract. Reconfigurable Architecture (RA), which provides extremely high en-ergy efficiency for cer...
Coarse-Grained Reconfigurable Architecture (CGRA) is a very promising platform that provides fast tu...
Reconfigurable Architecture (RA), which provides extremely high energy efficiency for certain domain...
Energy efficiency is one of the most important metrics in embedded processor design. The use of wide...
Today’s multimedia and DSP applications impose requirements on performance and power consumption tha...
Most of today’s commodity processors have single-instruction multiple-data (SIMD) instructions built...
Modern CPUs have instructions that allow basic operations to be performed on several data elements i...
Numerous applications gather increasing amounts of data, which have to be managed and queried. Diffe...
Over the past few years, energy consumption has become the main limiting factor for computing in gen...
Recent extensions to microprocessor instruction sets are intended to speed-up multimedia algorithms ...
Processor clock frequencies and the related performance improvements recently stagnated due to sever...
Abstract. The rapid growth of multimedia applications has been putting high pressure on the processi...
Abstract- This paper targets data-parallel applications which are also computa tion-intensive. It pr...
The host-multi-SIMD chip multiprocessor (CMP) architecture has been proved to be an efficient archit...
Energy efficiency has become one of the most important topics in computing. To meet the ever increas...
Abstract. Reconfigurable Architecture (RA), which provides extremely high en-ergy efficiency for cer...
Coarse-Grained Reconfigurable Architecture (CGRA) is a very promising platform that provides fast tu...
Reconfigurable Architecture (RA), which provides extremely high energy efficiency for certain domain...
Energy efficiency is one of the most important metrics in embedded processor design. The use of wide...
Today’s multimedia and DSP applications impose requirements on performance and power consumption tha...
Most of today’s commodity processors have single-instruction multiple-data (SIMD) instructions built...
Modern CPUs have instructions that allow basic operations to be performed on several data elements i...
Numerous applications gather increasing amounts of data, which have to be managed and queried. Diffe...
Over the past few years, energy consumption has become the main limiting factor for computing in gen...
Recent extensions to microprocessor instruction sets are intended to speed-up multimedia algorithms ...