Skip to content

andrasq/node-qrpc

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Qrpc

Build Status Coverage Status

Qrpc is a very fast remote procedure call package.

Features:

  • low latency, 0.05 ms round trip
  • high throughput, 0.013 ms per call avg
  • efficient connection sharing with call and response multiplexing
  • out-of-order call completion
  • a single call can receive multiple responses, to stream response in chunks
  • request/response or one-way message sending
  • message is any serializable json object or a Buffer
  • can use user-specified serialization function
  • wire protocol is newline terminated strings
  • fast transfers of binary data in Buffers
  • familiar middleware-like handler(req, res, next) message handler signature
  • familiar function(err, ret) client callback signature
  • familiar createServer / listen / connect usage
  • can communicate over sockets, streams, or pretty much anything with read and write methods
  • very small, with minimal dependencies

Summary

The rpc system consists of one or more clients making calls and a server that processes the calls and sends back responses. Each call includes an optional argument; each response returns a data item and an optional error.

Any of the serializable JavaScript data types may be sent and received (numbers, strings, arrays, objects, booleans, null), as well as Buffers. Special objects (Date, RegExp, etc) lose their special properties across the rpc call. Undefined can not be serialized, and undefined fields are omitted.

Calls are transmitted and run in the order made (even though they may finish out of order). In particular, this means that receipt of a stream of one-way messages can be explicitly verified by inserting a callback-ed call into the message stream. (Eg the verifier in a logging server could make an rpc call to flush the data, thus ensuring that data has been both received and persisted.)

Server:

var qrpc = require('qrpc')
var server = qrpc.createServer()
server.addHandler('echo', function(req, res, next) {
    // echo sends two response messages
    var err = null
    res.write('test ran!')      // first response
    next(err, req.m)            // final response
})
server.listen(1337, function() {
    console.log("qrpc listening on port 1337")
})

Client:

var client = qrpc.connect(1337, 'localhost', function() {
    client.call('echo', {a: 1, b: 'test'}, function(err, ret) {
        // client callback is invoked on every response message
        console.log("reply from server:", ret)
        // => reply from server: 'test ran!'
        // => reply from server: { a: 1, b: 'test' }
    })
    client.call('echo', new Buffer("test"), function(err, ret) {
        console.log("reply from server:", ret)
        // => reply from server: 'test ran!'
        // => reply from server: <Buffer 74 65 73 74>
    })
})

Benchmark

Qrpc can sustain 200k calls per second. Full end-to-end throughput measured at the client is around 160k round-trip calls per second. Timings on a 32-bit i7-6700k Skylake at 4410 MHz running Linux 3.16-amd64. (64-bit kernel with 32-bit apps: double the memory of a full 64-bit system for free!)

$ node-v6.10.2 test/benchmark.js

rpc: listening on 1337
echo data: { a: 1, b: 2, c: 3, d: 4, e: 5 }
----
parallel: 50000 calls in 314 ms
series: 20000 calls in 697 ms
send to endpoint: 100000 in 38 ms
retrieved 100000 data in 245 ms
retrieved 20000 1k Buffers in 82 ms
parallel 1k buffers: 20000 in 228 ms
wrapped parallel: 50000 calls in 452 ms
parallel 1K object: 20000 calls in 1032 ms
logged 100000 200B lines in 483 ms syncing every 250 lines

Here are the original timings on the old AMD 3600 MHz Phenom II running the same Linux 3.16.0-amd64:

$ node-v0.10.29 test/benchmark.js

rpc: listening on 1337
echo data: { a: 1, b: 2, c: 3, d: 4, e: 5 }
parallel: 50000 calls in 780 ms
series: 20000 calls in 1130 ms
send to endpoint: 100000 in 87 ms
retrieved 100000 data in 825 ms
retrieved 20000 1k Buffers in 342 ms

The parallel rate is call throughput -- the rpc server decodes the calls, process them, and encodes and send the response. The times shown above include the client-side formatting and sending of the rpc messages; the server time to process the 50000 calls (read request, decode, dispatch, process, encode, write response) is 250 ms; the 780 ms shown above includes the client time to format and send send the request and to read, decode and deliver the response.

The series time is handling latency -- it is all-inclusive back-to-back round-trip time; each call is made only after the previous response has been received.

The send rate is the number of calls processed by the server, not including data prep time (but yes including the time to transmit, decode, and dispatch). Send-only mode is one-way message passing: the data will be acted on by the server but no acknowledgement, status or error is returned.

Retrieval is getting multiple responses for one request, for chunked data fetching. These tests make one call for every 10,000 responses, one data item (or one Buffer) per response.

The logging benchmark ships 200-byte log lines to the server with one-way messages, and every 250 lines makes a round-trip rpc call to sync them, ie to ensure that all preceding lines have been successfully received and persisted. qrpc servers process calls in transmission order, so for the sync to have been received all preceding calls must have been received as well.

Note that the logging test uses socket.setNoDelay() to turn off the Nagle algorithm on the client side. With Nagle enbaled, only 25 sync calls get through per second, ie the throughput drops from 200k lines/sec to 25 * 250 = 6.25k lines/sec.

Qrpc Server

The server listens for incoming messages, processes them, and returns the responses.

Calls are tagged with a the handler name string. Each handler appears similar to a framework middleware step, taking a request, response and next callback.

The response is returned via the next() callback, or via res.write() and res.end(). Any number of write() calls may be used, each will send a response message that will be pass to the client call's callback. end() and next() both send a final message and close the call. Once the call is closed, no more responses must be sent.

write() and end() return data. Errors may be returned with next(). Qrpc restores error objects so the client callback receives instanceof Error (note: these errors occurred in the handler code on the server, not on the client)

Calls may be kept open indefinitely, but each open call uses memory while holding on to the callback.

Calls do not time out. Write errors on the server side will not be noticed by the client. Timeout-based error handling is up to the application. (But see the Todo list below)

server = qrpc.createServer( [options][, callback(socket)] )

Create a new server. Returns the QrpcServer object.

The callback, if specified, will be invoked on every connection to the rpc server with the connected socket. This makes it possible for the server to tune the socket settings.

Options:

  • json_encode - the object serializer function to use (default JSON.stringify)
  • json_decode - the object deserializer to use (default JSON.parse)

server.addHandler( handlerName, handlerFunction(req, res, next) )

Define the code that will process calls of type handlerName.

A handler receives 3 parameters just like a middleware stack function: the call object (req), the response object (res), and a callback next that can be used to return errors and/or data.

The req object has a field .m that contains the object passed to the call, if any, and a field .id that is the unique caller-side id of the call.

The res object has methods write(data [,cb]) and end([data] [,cb]) that reply to the caller with the provided data. Each reply will be delivered to the client callback. Buffers are sent and received as base64 Buffers, other objects as JSON serialized strings. End() will send the reply, if any, then close the call. After the call is closed, no more replies can be sent.

server.addHandlerNoResponse( handlerName, handlerFunction(req, res) )

Define the code that will handle messages of type handlerName.

NoResponse handlers are endpoints for one-way messages, not full RPC calls. They process messages, but do not return a response to the caller. One-way message passing is a much more efficient way to push data for eg reporting or stats delivery. The handler function should not declare the next argument as a reminder that no response will be returned to the caller. A no-op next function is passed to the handler, though, just in case.

Calling endpoints is one-way message passing: the data will be acted on by the server but no acknowledgement, status or error is returned. This offers a very fast pipelined path to ship data.

NOTE: The client must not pass a callback function when calling a message endoint, because this would leak memory. Endpoint calls will never be closed by the sender, so the callback context would never be freed.

server.listen( port, [whenListening()] )

Start listening for calls. Incoming calls will invoke the appropritae handlers.

If the whenListening callback is provided, it will be invoked once the server is listening for incoming calls.

server.close( )

Stop listening for calls.

var server = qrpc.createServer()
server.addHandler('echo', function(req, res, next) {
    // echo server, return our arguments
    next(null, req.m)
})
server.listen(1337)

server.wrap( object [,methods] [,options] )

Make the methods of the object callable by rpc. Adds handlers for the methods whose names are in the methods array (all function properties by default). The handlers will expect the method arguments list in req.m and will return a list with all arguments returned by the method callback.

To wrap functions, pass an object with the functions as named properties.

Options:

  • prefix - build the rpc handler name by prepending prefix to the method name. The default is just the method name.

Qrpc Client

The client makes calls to the server, and consumes the responses. A single request can result in more than one response; qrpc sends all requests and responses over a single socket (multiplexes) and steers each response to its correct destination.

client = qrpc.connect( port|options, [host,] whenConnected(clientSocket) )

Connect to the qrpc server listening on host:port (or 'localhost':port if host is not specified). Returns the QrpcClient object.

If provided, the newly created net.socket will be passed to the whenConnected callback for socket configuration and tuning.

Once connected, calls may be made with client.call()

Instead of port (or port and host) an options object can be passed which is then passed to net.connect().

Options:

  • port - port to connect to (required, no default)
  • host - host to connect to (default 'localhost')
  • plus all other valid net.connect() options

In addition, QrpcClient recognizes:

  • json_encode - the object serializer function to use (default JSON.stringify)
  • json_decode - the object deserializer to use (default JSON.parse)

client.call( handlerName, [data,] [callback(err, replyData)] )

Invoke the handler named handlerName, and return the server reply via the callback. Handlers are registered on the server with addHandler(). Data is optional; if any data is specified, it is passed in the call to the server in req.m unless it is a Buffer, which is passed in req.b.

Omitting the callback sends a one-way message to the server. Any response received from the server will be discarded.

Note: Buffers maybe be sent only standalone. Sending an object that has as a Buffer as a property will arrive as a JSON.stringified Buffer stringand not as an instanceof Buffer.

client.close( )

Disconnect from the qrpc server. Any subsequent calls will return a "write after end" error to their callback.

client = qrpc.connect(1337, 'localhost', function whenConnected() {
    client.call('echo', {i: 123, t: 'test'}, function(err, ret) {
        console.log("echo =>", err, ret)
    })
}

// produces "echo => null, { i: 123, t: 'test' }"

client.wrap( object [,methods] [,options] )

Return an object with methods names as in the methods array (or all function properties on object) that will invoke the corresponding handlers on the qrpc server.

To wrap functions, pass an object with the functions as named properties.

Options:

  • prefix - build the rpc handlerName by prepending prefix to the method name

Under The Hood

Qrpc is implemented as streaming message passing with call multiplexing. Each message is a newline terminated string sent to the server. Messages are batched for efficiency, and arrive at the server in large chunks. The server splits the stream, decodes each message, and invokes each handler.

Messages are sent to named handlers; the handlers are registered with the server using the addHandler method. Every handler takes the same arguments, a middleware stack-like (req, res, next): the request bundle (the message itself), the response object, and a callback that can return a response and is used to return errors.

The response object can be used to send replies back to the caller. The write method sends a reply message and keeps the call open for more replies later. The end method sends an optional final reply and closes the call. End without an argument just closes the call by sending a special empty message back to the client. Calling the handler callback cb(err, data) is equivalent to calling end(data), except it will return the error too, if any. A single call can result in more than once response; coordinating the responses is up to the application.

Responses arrive at the caller in the same newline terminated text format, also in batches, over the same bidirectional connection. The calling library splits the batches, decodes the responses, demultiplexes the replies to match them to their originating call, and invokes the call's callback function with the error and data from reply message.

Calls are multiplexed, and may complete out of order. Each message is tagged with a unique call id to identify which call's callback is to process the reply.

The RPC service is implemented using the QrpcServer and QrpcClient classes. They communicate over any bidirectional EventEmitter stream or object with a write() method. A customized RPC can be built over non-socket non-socket streams, which is how the unit tests work.

To build an rpc service using QrpcServer and QrpcClient on top of net sockets the way qrpc does:

// create qrpc server
var server = new qrpc.QrpcServer()
var netServer = net.createServer(function(socket) {
    // have the server read rpc calls from the first arg
    // and write responses to the second arg
    server.setSource(socket, socket)
})
server.setListenFunc(function(port, cb){ netServer.listen(port, cb) })
server.setCloseFunc(function(){ netServer.close() })


// create qrpc client to talk to the server
var client = new qrpc.QrpcClient()
var socket = net.connect(1337, 'localhost')
client.setTarget(socket, socket)

Message Format

Qrpc requests and responses are sent as simple serialized json objects, one per line:

{
    v: 1,               // protocol version, 1: json bundle
    id: id,             // unique call id to match replies to calls
    n: name,            // call name string, in request only
    e: error            // returned error, in response only
    s: status           // response status, one of
                        //   'ok' (on write()),
                        //   'end' (on end()),
                        //   'err' (server error; means end)
    m: message          // object payload, in call or response
    b: blob             // base64 encoded buffer payload
}

Related Work

  • qrpc - 60k calls / sec round-trip, 1m messages / sec dispatched
  • rpc-stream - 16k calls / sec
  • dnode - 14k calls / sec light load, throughput drops sharply with load
  • fast - 12k calls / sec
  • X mrpc - npm install failed (C++ compile errors)
  • X kamote - hangs on concurrent calls (v0.0.2)
  • X fast-rpc - just a placeholder since 2013 (v0.0.0)

Todo

  • more unit tests
  • server should periodically yield to the event loop
  • support call timeouts for more convenient error detection and cleanup (client-side and server-side both) Make server track running calls with timestamp, set done calls = undefined, periodically copy over object to gc properties, only the "olds" object could have timed-out calls.
  • think about how to gc or time out callbacks that have been abandoned by the server (call not closed)
  • maybe make the the client and server pipable event emitters
  • provide a client.send() method to send to an endpoint without a callback
  • ? allow pre- and post-handler functions to be registered, for shared processing (eg for authentication and stats logging)
  • ? allow multi-step handlers (ie an array of functions, each taking req-res-next)
  • option to auto-reconnect if connection drops
  • way to test whether connection was dropped since last test (counter? or connection id?) to check whether datagrams got there ok
  • Use Cases readme section to discuss rpc, send datagrams to server (addHandlerNoResponse), data retrieval (multiple replies from server)
  • support qrpc.* out-of-band calls for client-server metadata, setup/config/etc
  • support request cancellation (qrpc.cancel(id) call, processed asap out-of-band)

About

quicker nodejs remote procedure call

Resources

License

Stars

Watchers

Forks

Packages

No packages published