Skip to content

razor1991/hucx

 
 

Repository files navigation



Unified Communication X

Unified Communication X (UCX) provides an optimized communication layer for Message Passing (MPI), PGAS/OpenSHMEM libraries and RPC/data-centric applications.

UCX utilizes high-speed networks for inter-node communication, and shared memory mechanisms for efficient intra-node communication.

Using UCX

Release Builds

Building UCX is typically a combination of running "configure" and "make". Execute the following commands to install the UCX system from within the directory at the top of the tree:

$ ./autogen.sh
$ ./contrib/configure-release --prefix=/where/to/install
$ make -j8
$ make install

NOTE: Compiling support for various networks or other specific hardware may require additional command line flags when running configure.

Developer Builds

$ ./autogen.sh
$ ./contrib/configure-devel --prefix=$PWD/install-debug

*** NOTE: Developer builds of UCX typically include a large performance penalty at run-time because of extra debugging code.

Running internal unit tests

$ make -C test/gtest test

Build RPM package

$ contrib/buildrpm.sh -s -b

Build DEB package

$ dpkg-buildpackage -us -uc

Build Doxygen documentation

$ make docs

OpenMPI and OpenSHMEM installation with UCX

Wiki page

MPICH installation with UCX

Wiki page

UCX Performance Test

Start server:

$ ./src/tools/perf/ucx_perftest -c 0

Connect client:

$ ./src/tools/perf/ucx_perftest <server-hostname> -t tag_lat -c 1

Note: the -c flag sets CPU affinity. If running both commands on same host, make sure you set the affinity to different CPU cores.

Our Community

Licenses

UCX is licensed as:

Contributor Agreement and Guidelines

In order to contribute to UCX, please sign up with an appropriate Contributor Agreement.

Follow these instructions when submitting contributions and changes.

UCX Publications

To reference UCX in a publication, please use the following entry:

@inproceedings{shamis2015ucx,
  title={UCX: an open source framework for HPC network APIs and beyond},
  author={Shamis, Pavel and Venkata, Manjunath Gorentla and Lopez, M Graham and Baker, Matthew B and Hernandez, Oscar and Itigin, Yossi and Dubman, Mike and Shainer, Gilad and Graham, Richard L and Liss, Liran and others},
  booktitle={2015 IEEE 23rd Annual Symposium on High-Performance Interconnects},
  pages={40--43},
  year={2015},
  organization={IEEE}
}

To reference the UCX website:

@misc{openucx-website,
    title = {{The Unified Communication X Library}},
    key = {{{The Unified Communication X Library}},
    howpublished = {{\url{http://www.openucx.org}}}
}

UCX Architecture

Component Role Description
UCP Protocol Implements high-level abstractions such as tag-matching, streams, connection negotiation and establishment, multi-rail, and handling different memory types
UCT Transport Implements low-level communication primitives such as active messages, remote memory access, and atomic operations
UCS Services A collection of data structures, algorithms, and system utilities for common use
UCM Memory Intercepts memory allocation and release events, used by the memory registration cache

Supported Transports

Supported CPU Architectures

Huawei Optimization Introduction

Based on performance consideration, UCX DO NOT provide the functionalities related to transmission security.

There are three optimized collective operations:

  • MPI_Allreduce
  • MPI_Bcast
  • MPI_Barrier

New algorithms are as follows:

  • Binomial tree
  • Ring
  • Recursive
  • Topo-aware Binomial tree
  • Topo-aware K-nomial tree
  • Topo-aware Recursive + Binomial(intra)
  • Topo-aware Recursive + K-nomial(intra)

Select specific algorithm with parameters which is showed in the table below.

Bcast:

UCX_BUILTIN_BCAST_ALGORITHM Algorithm
1 Binomial tree
2 Topo-aware Binomial tree
3 Topo-aware K-nomial tree
4 Topo-aware K-nomial tree + Binomial tree(intra)

Allreduce:

UCX_BUILTIN_ALLREDUCE_ALGORITHM Algorithm
1 Recursive
2 Topo-aware Recursive + Binomial(intra)(Node)
3 Topo-aware Recursive + Binomial(intra)(Socket)
4 Ring
5 Topo-aware Recursive + K-nomial (intra)(Node)
6 Topo-aware Recursive + K-nomial (intra)(Socket)
7 Topo-aware K-nomial(Node)
8 Topo-aware K-nomial(Socket)

Barrier:

UCX_BUILTIN_BARRIER_ALGORITHM Algorithm
1 Recursive
2 Topo-aware Recursive + Binomial(intra)(Node)
3 Topo-aware Recursive + Binomial(intra)(Socket)
4 Topo-aware Recursive + K-nomial (intra)(Node)
5 Topo-aware Recursive + K-nomial (intra)(Socket)
6 Topo-aware K-nomial(Node)
7 Topo-aware K-nomial(Socket)

Packages

No packages published

Languages

  • C 60.5%
  • C++ 35.3%
  • M4 1.4%
  • Java 1.4%
  • Shell 0.7%
  • Makefile 0.5%
  • Other 0.2%