Message brokers

04 Oct 2017

When microservice architectures become bigger, it’s important to think about the scalability of the system. When the processing power of the architecture isn’t sufficient, it can be scaled up vertically by adding more RAM or Faster CPU’s. This vertical approach is relatively easy because we don’t have to change our code. At some point the limitations of vertical scaling are met, and at this time, it’s time to think about horizontal scaling.

Another disadvantage is downtime caused by vertical scaling.

Horizontal scaling is achieved by adding more servers to perform computations. Horizontal scaling is considerably more difficult than vertical scaling because the architecture must allow horizontal scaling1.

In order to achieve horizontal scaling, microservices architectures are often deployed which make scaling easier2. Indirect messaging is used to ensure that services are decoupled and integration patterns are used in order to achieve horizontal scalability.

In modern microservice based architectures, messaging between services is proxied through message brokers. In this article we will investigate the advantages, differences and need for different messaging platforms.

Message brokers

A message broker is a service deployed within the cluster. This service routes messages between services. Messages sent between services and the broker are often send through an open protocol3, and automatically encoded/decoded by client libraries. Often times libraries are provided for multiple popular programming languages, which negates the need for language uniformity throughout the distributed system.

There are a lot of platforms on the market and in order to choose, knowing the differences is important. Let’s look at some common differences between messaging platforms:

  1. Guarantees: Some message queues guarantee messages are distributed FIFO while others do not. Some queues even guarantee at-once-delivery, at-most-once-delivery or other options.

  2. Persistence: Most message brokers have the ability to temporarily persist messages to disk which makes sure that no messages are lost even if the broker itself fails. Apache Kafka takes this to the extreme with its ability to persist messages indefinitely. This makes Kafka a suitable solution for event sourcing.

  3. Distributed: Some messaging queues have the ability to run multiple broker instances in sync. Distribution eliminates single points of failure, thus increasing reliability.

  4. Cloud: It is possible to let 3ᵈ parties host broker. Services like Amazon SQS and iron.io are available only as cloud platforms, while popular open-source solutions like RabbitMQ have hosted solutions. For most platforms you’ll have to host your own infrastructure.

  5. Protocols: Some queues use standardized protocols, whilst others do not.

  6. Throughput: Some queues are more focused on reliability and features while others prioritize low latency and high throughput.

performance is dependent on a number of factors, for each service we’ll use (relatively): LOW, MEDIUM or HIGH as vectors. Also note that cloud provisioned services have additional network latency.

  1. Pricing: Most messaging solutions are open-source and free to implement, Pricing is mostly relevant for hosted solutions (and how that compares to hosting it yourself).

Platforms

Apache ActiveMQ

ActiveMQ is a mature and popular open source messaging platform. It features client libraries for a wide array of modern programming languages and communicates using open protocols. Messages can be persisted in a relational database via the JDBC tools. ActiveMQ also supports distributed deployment, although limited.

Advantages: stable and proven, client libraries for different languages, open messaging protocols, advanced integration patterns build in, REST api for retrieving performance metrics, open-source

Disadvantages: One of the slower queues, client library is part of JMS specification there is also Spring integration but no mature standalone implementation

Persistence Distributed Cloud Protocols Throughput Price
donevia JDBC done clear OpenWire, Stomp, AMQP, MQTT LOW -

Amazon Simple Queue Service

Amazon’s simple queue service is a hosted queuing solutions which is easy to setup. SQS was the first service added in Amazon’s AWS. The main selling point for SQS is it’s elimination of operational costs of running own hardware.

Advantages: simple to setup, secure, scales elastically, AWS integration, at least and exactly once guarantees

Disadvantages: closed-source, self-hosting is not possible, not very feature rich

Persistence Distributed Cloud Protocols Throughput Price
clear - done HTTPS, TLS MEDIUM ~$.40-$.50 per million requests.

Apache Apollo

Apache Apollo is a message queue which was build from the foundations of Apache ActiveMQ. Apollo was made with performance and easy-of-use in mind. Apollo scales exceptionally well on multi-core systems. It can communicate in a wide variety of open protocols and has modern web/REST support.

Advantages: fast in multi-core setups, open protocols, open-source, REST endpoints

Disadvantages: lack of popularity, green-fields technology

Persistence Distributed Cloud Protocols Throughput Price
done done clear OpenWire, Stomp, AMQP, MQTT HIGH -

Beanstalkd

Beanstalkd is a lightweight messaging queue which uses its own protocol for message delivery. It has clients for most popular languages, and features disk persistence via binlog files (optional). Beanstalkd sees each running instance as only one active job queue, but can uses so called tubes to support multiple job queues.

Advantages: lightweight, fast, easy to configure

Disadvantages: Almost no features, Clients and monitoring tools are all third party.

Persistence Distributed Cloud Protocols Throughput Price
done via Binlog clear clear HTTP HIGH -

Iron MQ

Iron MQ is a cloud provider that also delivers queue services. The business model is exactly the same as with Amazon SQS, but the queue server is more feature rich.

Iron MQ sends messages via HTTP in JSON format and handles authentication. Library wrappers are available for Go, Ruby, .NET, Java, PHP, Python an JavaScript(Node). It features: monitoring via a web dashboard, high availability and data persistence.

Advantages: Libraries in some popular languages, Different types of queue’s (error queues) Disadvantages: Very expensive, latency introduced by being a cloud technology

Persistence Distributed Cloud Protocols Throughput Price
done - done proprietary MEDIUM at least $24.99/wk

RabbitMQ

RabbitMQ is made by Pivotal and is a very popular open-source message broker. RabbitMQ runs on the Erlang VM, and is programmed using the actor model. RabbitMQ is over 10 years old and is considered to be very stable and feature rich.

RabbitMQ provides libraries for most modern languages. It also sends messages via AMQP which is one of the open standards, other open protocols are also available.

Advantages: Different guarantee and routing strategies to choose from, can be distributed, open messaging standards, management UI and tools

Disadvantages: apparently not suited for real-time messaging4, not the fastest tool on the market

Persistence Distributed Cloud Protocols Throughput Price
clear done clear AMQP, STOMP, MQTT, HTTP MEDIUM FREE or variable prices in the cloud

Apache Kafka

Apache Kafka calls itself a distributed streaming platform. The model used by Kafka is slightly different than other message brokers, but still very feasible to use as one.

Normal message brokers let you choose either a queuing or publish-subscribe models, Apache Kafka combines both5, levering advantages of both. This ability makes it possible to create very resilient systems (through publish-subscribe) that also scale well (through queuing).

Apache Kafka defines one or more brokers as a cluster. It’s possible that Kafka sends acknowledgments to publishers once messages are successfully replicated throughout the cluster. This ensures that messages are not lost when a Kafka node dies. New nodes can be easily added to the cluster and load balancing is abstracted away from the developer.

Advantages: combines queuing and publish-subscribe, extremely fast, distributed

Disadvantages: Only Java API’s officially available (unofficial libraries in more language), adds complexity if you don’t need it’s features.

Persistence Distributed Cloud Protocols Throughput Price
done done clear proprietary HIGH -

ZeroMQ

ZeroMQ isn’t really a broker system, but is still relevant on our list. ZeroMQ can be seen as a library that layers on top of standard TCP sockets. Additional features provided by ZeroMQ are things like: request/response (with load balancing), publish-subscribe, and more.

advantages: extremely fast, handles receiving and sending messages asynchronously, doesn’t introduce a single point of failure (broker), open-source, multi-platform, very simple API

disadvantages: considerably more effort to implement, integration patterns should be implemented as part of the application, otherwise it’s just direct communication.

In the last article we looked at the disadvantages of direct communication.

Persistence Distributed Cloud Protocols Throughput Price
? ? clear INPROC, IPC, MULTICAST, TCP HIGH -

Conclusion

In this article we just looked at a few of the options available. When looking for a messaging platform it’s important to think about the requirements and choose accordingly. This is the end of these blog posts about communication technologies to use for microservice systems.


  1. Scaling 

  2. Because microservices separate smaller software components from each other, scaling these smaller parts is easier. This improved granularity enables us to more precisely scale specific parts of the application. 

  3. Like STOMP and AMQP 

  4. see 

  5. More information can be found here 


About the author

profile_pic

My name is Thomas Kleinendorst. I'm currently doing my internship at Bottomline where I'm investigating Reactive technologies and how they fit in their new architecture.