The computation of a one-dimensional FFT on a c-dimensional torus multicomputer is analyzed. Different approaches are proposed which differ in the way they use the interconnection network. The first approach is based on the multidimensional index mapping technique for the FFT computation. The second approach starts from a hypercube algorithm and then embeds the hypercube onto the torus. The third approach reduces the communication cost of the hypercube algorithm by pipelining the communication operations. A novel methodology to pipeline the communication operations on a torus is proposed. Analytical models are presented to compare the different approaches. This comparison study shows that the best approach depends on the number of dimension...
In this work, we propose parallel FFT algorithms, for medium-to-coarse grain hypercube-connected mul...
AbstractThe present paper begins with a survey of various up-to-date parallel 3-D FFT algorithms and...
Abstract—In high-performance computing on distributed-memory systems, communication often represents...
The computation of a one-dimensional FFT on a c-dimensional torus multicomputer is analyzed. Differe...
In this paper, the computation of a one-dimensional FFT on a c-dimensional torus multicomputer is an...
In this paper, the problem of computing a one-dimensional FFT on a c-dimensional torus multicomputer...
The fast Fourier transform (FFT) is of intense interest to the scientific community. Its utility in...
Many parallel algorithms use hypercubes as the communication topology among their processes. When su...
Many parallel algorithms use hypercubes as the communication topology among their processes. When su...
In this work, we propose parallel FFT algorithms, for medium-to-coarse grain hypercubeconnected mult...
In high-performance computing on distributed-memory systems, communication often represents a signif...
This paper addresses the problem of monodimensional (1D) FFT parallel computation on constant-valenc...
In this chapter we describe the architecture of a torus interconnect and its implementation on FPGAs...
Complex two-dimensional FFT's up to size 256 X 256 points were implemented on the Intel iPSC/System...
The Three Dimensional Fast Fourier Transform (3D-FFT) is commonly used to solve the partial differen...
In this work, we propose parallel FFT algorithms, for medium-to-coarse grain hypercube-connected mul...
AbstractThe present paper begins with a survey of various up-to-date parallel 3-D FFT algorithms and...
Abstract—In high-performance computing on distributed-memory systems, communication often represents...
The computation of a one-dimensional FFT on a c-dimensional torus multicomputer is analyzed. Differe...
In this paper, the computation of a one-dimensional FFT on a c-dimensional torus multicomputer is an...
In this paper, the problem of computing a one-dimensional FFT on a c-dimensional torus multicomputer...
The fast Fourier transform (FFT) is of intense interest to the scientific community. Its utility in...
Many parallel algorithms use hypercubes as the communication topology among their processes. When su...
Many parallel algorithms use hypercubes as the communication topology among their processes. When su...
In this work, we propose parallel FFT algorithms, for medium-to-coarse grain hypercubeconnected mult...
In high-performance computing on distributed-memory systems, communication often represents a signif...
This paper addresses the problem of monodimensional (1D) FFT parallel computation on constant-valenc...
In this chapter we describe the architecture of a torus interconnect and its implementation on FPGAs...
Complex two-dimensional FFT's up to size 256 X 256 points were implemented on the Intel iPSC/System...
The Three Dimensional Fast Fourier Transform (3D-FFT) is commonly used to solve the partial differen...
In this work, we propose parallel FFT algorithms, for medium-to-coarse grain hypercube-connected mul...
AbstractThe present paper begins with a survey of various up-to-date parallel 3-D FFT algorithms and...
Abstract—In high-performance computing on distributed-memory systems, communication often represents...