TY - GEN
T1 - Non-threaded and threaded approaches to Multirail communication with uDAPL
AU - Cai, Jie
AU - Rendell, Alistair P.
AU - Strazdins, Peter E.
PY - 2009
Y1 - 2009
N2 - uDAPL is a portable and platform independent communication library that provides RDMA as well as send/recv operations. Some well-known software has attempted to take advantage of uDAPL's portability, such as Open MPI, MVAPICH2, Intel MPI, and Cluster OpenMP. However, network bandwidth limitations can still be a bottleneck for applications using these software. Engaging a "Multirail" network is a method to by-pass this. In this paper, we design a non-threaded and a threaded approach to improve the performance of uDAPL over multirail configured clusters. The two approaches are evaluated on an InfiniBand cluster with different multirail configurations. The results show that the threaded approach improves by 33% and 148% the uni-directional bandwidth on the multi-port and the multi-HCA configured network respectively, and the nonthreaded approach improves ∼90% of the uni-directional bandwidth on the multi-HCA configured network. A similar improvement is achieved for the bi-directional bandwidth.
AB - uDAPL is a portable and platform independent communication library that provides RDMA as well as send/recv operations. Some well-known software has attempted to take advantage of uDAPL's portability, such as Open MPI, MVAPICH2, Intel MPI, and Cluster OpenMP. However, network bandwidth limitations can still be a bottleneck for applications using these software. Engaging a "Multirail" network is a method to by-pass this. In this paper, we design a non-threaded and a threaded approach to improve the performance of uDAPL over multirail configured clusters. The two approaches are evaluated on an InfiniBand cluster with different multirail configurations. The results show that the threaded approach improves by 33% and 148% the uni-directional bandwidth on the multi-port and the multi-HCA configured network respectively, and the nonthreaded approach improves ∼90% of the uni-directional bandwidth on the multi-HCA configured network. A similar improvement is achieved for the bi-directional bandwidth.
UR - http://www.scopus.com/inward/record.url?scp=73449136872&partnerID=8YFLogxK
U2 - 10.1109/NPC.2009.10
DO - 10.1109/NPC.2009.10
M3 - Conference contribution
SN - 9780769538372
T3 - NPC 2009 - 6th International Conference on Network and Parallel Computing
SP - 233
EP - 239
BT - NPC 2009 - 6th International Conference on Network and Parallel Computing
T2 - NPC 2009 - 6th International Conference on Network and Parallel Computing
Y2 - 19 October 2009 through 21 October 2009
ER -