. We examine the importance of problem formulation for the solution of large-scale optimization problems on high-performance architectures. We use limited memory variable metric methods to illustrate performance issues. We show that the performance of these algorithms is drastically affected by application implementation. Model applications are drawn from the MINPACK-2 test problem collection, with numerical results from a super-scalar architecture (IBM RS6000/370), a vector architecture (CRAY-2), and a massively parallel architecture (Intel DELTA). Key words. optimization, large-scale, limited memory, variable metric, performance evaluation, vector architecture, parallel architecture. AMS subject classifications. 65Y05, 65Y20, 65K05, 65K...
. In this paper we explore the characteristics of numerically intensive programs and explore their e...
Scientific programs are typically characterized as floating-point intensive loop-dominated tasks wit...
In this paper we analyze the effect of compiler optimizations on fine grain parallelism in scalar pr...
The recent explosion in size and complexity of datasets and the increased availability of computatio...
Abstract. I consider the problem of the domain-specific optimization of programs. I review different...
The basic architectures of vector and parallel computers and their properties are presented followed...
Modeling and analysis techniques are used to investigate the performance of a massively parallel ver...
The area of parallel and distributed computing has grown very fast in the past few decades with the ...
Big data processing has recently gained a lot of attention both from academia and industry. The term...
The next frontier of high performance computing is the Exascale, and this will certainly stand as a ...
Vector architectures have long been the architecture of choice for numerical high performance comput...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
Modeling and analysis techniques are used to inves-tigate the performance of a massively parallel ve...
During the last decade the scientific computing community has optimized many applications for execu...
233 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1987.The peak performance of a mul...
. In this paper we explore the characteristics of numerically intensive programs and explore their e...
Scientific programs are typically characterized as floating-point intensive loop-dominated tasks wit...
In this paper we analyze the effect of compiler optimizations on fine grain parallelism in scalar pr...
The recent explosion in size and complexity of datasets and the increased availability of computatio...
Abstract. I consider the problem of the domain-specific optimization of programs. I review different...
The basic architectures of vector and parallel computers and their properties are presented followed...
Modeling and analysis techniques are used to investigate the performance of a massively parallel ver...
The area of parallel and distributed computing has grown very fast in the past few decades with the ...
Big data processing has recently gained a lot of attention both from academia and industry. The term...
The next frontier of high performance computing is the Exascale, and this will certainly stand as a ...
Vector architectures have long been the architecture of choice for numerical high performance comput...
While parallel computing offers an attractive perspective for the future, developing efficient paral...
Modeling and analysis techniques are used to inves-tigate the performance of a massively parallel ve...
During the last decade the scientific computing community has optimized many applications for execu...
233 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1987.The peak performance of a mul...
. In this paper we explore the characteristics of numerically intensive programs and explore their e...
Scientific programs are typically characterized as floating-point intensive loop-dominated tasks wit...
In this paper we analyze the effect of compiler optimizations on fine grain parallelism in scalar pr...