Overcoming the GPU Memory Limitation on FDTD Through the Use of Overlapping Subgrids
The method Finite Difference Time Domain (FDTD) is widely used in electromagnetic simulations. Since this method is a data intensive and compulation intensive problem, there are a lot of initiatives to improve the scalability and the performance of the FDTD. Specifically the use of GPU to accelerate the FDTD is in focus, which has a good cost-benefit, offering a speedup of hundreds of times if compared to the traditional CPU computation. Nevertheless, the current implementations of FDTD in CPU need GPU devices with memory capacity to allocate the roll data needed for the simulation, which in many cases could require additional investments to increase the domain of the simulation. This work proposes a solution of FDTD over GPU that uses overlapping subgrids to increase the domain of the FDTD simulation through the use of CPU memory. The redundancy of the overlapping subgrids is used to minimize the memory traffic between GPU and CPU, increasing the performance. Tests of performance show that the solution combines the power of GPU with the memory facility of the CPU, allowing the accomplishment of simulations that require more memory than what is offered by the GPU device at a damaging performance cost lower than 10%.
Leonardo Mattes Sergio Kofuji
Laboratorio de Sistemas Integraveis (LSI) - Universidade de Sao Paulo (USP) Av. Prof. Luciano Gualberto, 158. Trav.3. CEP: 05508-900 - Sao Paulo - SP - Brasil
国际会议
成都
英文
1536-1539
2010-05-08(万方平台首次上网日期,不代表论文的发表时间)