Publish Subscribe Model

Publish/Subscribe is a useful model which, although it is not new, is receiving increasing interest and attention.

"Publish and Subscribe is a well-established communications paradigm that allows any number of publishers to communicate with any number of subscribers asynchronously and anonymously via an event channel." -- DavidHoulding page088 DrDobbsJournal July 2000


Some are familiar with the concept of a "list-server", where a group of users subscribe to a particular topic from a given server and are notified and updated via email. It is a form of MultiCasting? which emulates what we see in the broadcast of television and radio programs to potentially large audiences. Whose desired effect is to deliver high value information to a mass-audience. This is precisely what the PublishSubscribeModel seeks to do. The PublishSubscribeModel lies at the core of EventDrivenInformationProcessing?. This model is used extensively in embedded systems where components need to respond in real time to events on a list. (See example, below.)

Pub/sub is handy for decoupling parts of a system, too; LindaLanguage, for example, is a sort of pub/sub. Spread, XmlBlaster, and mod_pubsub all provide pub/sub.


Use in Embedded Systems

Publish/Subscribe is very powerful in event driven embedded technologies. Suppose a system has an EventHandler that wants to send out Notification X whenever this event occurs. You have a priority component, Component A, which is there almost all the time and has to see this message right away. You also have a bunch of dynamic secondary components (B, C, and D) which may be there or not. They all have to receive the message sooner or later. Let's look at some possible scenarios:

Nothing is very time critical. Publisher sends out Note X to A, B, C, D... ad nauseum. B, C, and D all begin reacting to X. In the mean time, A gets all panicky and generates event Y. Notification Y (effectively a Cancel X) goes out to B, C, et al, and these components all roll their eyes, mutter something about these kids not being able to make up their minds, and toss the work they did to process the X note. Done.

Some things are time critical. X goes out as a conditional X. This means that a follow-up message will come out within a certain maximum time to confirm or cancel the X event. A gets a chance to chew on the X note and then generate a Confirm or Cancel. B, C, and D can all get their house in order in case this is the real thing. If they get a Confirm follow-up they complete the processing of X. Otherwise they get a Cancel and they can drop the whole thing. That finishes that.

Time critical and dynamic. The worst case is where A is dynamic as well as the rest of the subscribers and time is critical. Without A in the picture Note X just goes out to whoever else is there. When A comes into the picture it has to register with the X event handler. Then the X handler sends a conditional X notification. The business with the Confirm or Cancel still applies.


Publish and subscribe should be used very carefully. There are some problems in distributed systems:

Incomplete subscriber list. When you publish, are you publishing to everyone that is supposed to get the message? This is a system start/failover issue. Different nodes come up at different times, which means they will register at different times. Data can be lost.

Fast publisher, slow subscriber. Publish is a push technology, which means a chatty sender can overload the system. This may also cause memory and network problems (drops). Message order. Message order is often important. If the publisher maintains order, then how does it deal with a situation where one or more consumers have drops? The entire cohort will be held back waiting for retransmission. Distributed systems (of any kind, even multiple threads using the same processor but operating at different dispatch priorities) can have the kinds of problems listed here (and many more beside).


Asked in DeclarativeGui: What is a "subscription"?

A subscription is a request for a callback upon update to some state. Doing so allows low-latency update processing without the bandwidth and processing costs associated with polling. (Polling has a periodic query+reply cost, and might often reply that there has been no change, whereas subscription has a one-time subscribe+unsubscribe cost and replies with the current state only after at least one change. Polling and subscriptions are equally subject to powerful exponential-strength optimizations via caching at network boundaries.) The trade-off is complexity of subscription management, and slightly bulkier state management (due to maintaining lists of subscriptions for each cell). For very large state values, subscriptions can be optimized by just returning the deltas from the last callback - a compression mechanism DataDeltaIsolation. In general one may also subscribe to events that might never be committed to state, but that possibility isn't necessary - or even desirable - in this particular context.

As a note: the reason for rejecting event subscriptions in the context of keeping document definitions up-to-date for DeclarativeGui (or any document model, really) is that event subscriptions are not robust, they cannot be regenerated after partial network failure or disruption, even one missed event can be a big problem. They also resist ZoomableUserInterface where subscriptions are disabled for objects you aren't looking at any longer. This is actually true in general: event subscriptions (e.g. to signals, or to particular changes in state) aren't robust, whereas state subscriptions can simply be regenerated from scratch at any time and thus are very robust. It's worth keeping in mind when designing DistributedSystems atop PublishSubscribeModel.


CategoryPattern


EditText of this page (last edited May 14, 2009) or FindPage with title or text search