Serendipitous events

Podcast host Neal Ford and Zhamak Dehghani | Podcast guest Evan Bottcher and Erik Dörnenburg

February 08, 2019 | 17 min 25 sec

Read transcript

Listen on these platforms

Brief summary

We’re increasingly seeing a trend of organizations exposing events — particularly business domain events — before knowing who the consumers are or what the specific applications are, in the hope that people elsewhere in the organization can discover these events and create value, without us directly orchestrating it.

Podcast Transcript

Neal Ford:

Welcome to the Thoughtworks podcast. This is Neal Ford.

Zhamak Dehghani:

And I'm Zhamak Dehghani.

Neal Ford:

And we're here at the TAB face-to face meeting, and as often happens, we have interesting discussions that come up as we're face-to-face. So this morning, we're joined by a couple of the Doppler members that put together the technology radar.

Erik Dörnenburg:

Erik Dörnenburg, the head of technology in Germany.

Evan Bottcher:

Evan Bottcher, tech lead from Melbourne, Australia.

Neal Ford:

And we're going to talk about a topic this morning about a serendipitous events. So Zhamak.

Zhamak Dehghani:

We are seeing this emerging trend around the idea of exposing events, particularly business domained events, before knowing who the consumers are, or what the specific applications are, and hoping for discovery of those events by other people in the organizations, and creation of the value using those events without us directly orchestrating it. And I can imagine that this can be quite a heated discussion because, it sounds like things that we don't want to do, like creating big upfront architecture. But, I want to open the discussion with perhaps giving some examples of seeing this theme of inorganization and look at the pros and cons of it.

Zhamak Dehghani:

To give an example, a small example of that, that we have seen within Thoughtworks, is the events that are identifying submission of time sheets. So, we all put our time allocated for different projects at the end of the week, and the system that's capturing that has decided, or the team has decided, to expose these events that timesheet have been submitted. And the following effect of that has been, that other teams have found value in that and they created other applications consuming those events such as, calculating the leave that you have, or the payroll, defining where the tax allocation for your project might be. So this is an example, a small example, but I'm opening it to our guests to see if they have seen these things in the technology landscape.

Evan Bottcher:

Well, I think before we talk about where we may have seen it, the reason this is so controversial and so difficult to get a beat on from my perspective, is we've for a long time talked about emergent design; two things, emergent design, which is creating software and architecture and design in response to a genuine need without building a lot of material upfront, a lot of design or a lot of the architecture from the bottom up, hoping for reuse later. And this theme of use before reuse where we want to see that the usage of a particular feature, or piece of software, or data round or whatever it is, in context with delivering some piece of value.

Evan Bottcher:

And then, and actually we would just introduced the reusability as it becomes used in more context. So this is quite a counter to something that we've promoted for a very long time as a way to reduce the waste that we've observed in the software industry. So it's really hard to tackle when I'm working with organizations and saying, " Well what should our strategy be about exposing our data? ". I think it all is different in different areas.

Erik Dörnenburg:

Yeah, that's a good point. I'm probably the skeptic in this round. I've seen lots of systems that are being built that are connected by various means and of course events such as one way. And I think Evan made an important point too when he said about data and I think what we often want to do today is we want to take data that is available somewhere and do something with that data in a different context.

Erik Dörnenburg:

And, we've seen with the shift towards an architecture that's based on microservices sent, the microservices owning their own data stores. We've seen a shift to them locking up the data in them, which is good from an encapsulation perspective, but as a response, we also have data lakes and here it was relatively clear there was low effort on the teams to put the data into that data lake to stream the raw data that they had. And it was so obvious to the consumers that, that was raw data that they could have this serendipitous moment where they discover data that they can make use of with little effort upfront and also with an understanding that you're dumping the raw data in there. And that I think that worked really well for me, where I'm skeptical about this approach or where I think that there may be examples as Zhamak has given us that, in the vast majority, we wouldn't see those effects because, the events need to serve a certain purpose in the system.

Erik Dörnenburg:

And, if you design those events and you have this in mind that you may create value later, then there's a huge temptation that we've seen so many times that people are trying to do the perfect thing, that they are trying to think they're anticipating, they're not waiting for a serendipity, they're trying to anticipate and they're trying to create these adaptable system that can adapt to anything and they will put a lot of effort into this. And this is why I'm worried about it. I'm not saying that serendipity can't happen, but I'm concerned that the cost is higher or too high actually.

Evan Bottcher:

You quoted again to data lake architecture versus traditional data warehousing. One of the big costs in data warehousing, which we'll try to address the data lake, was the enormous effort and modeling the perfect chronicalized form of the data. And I can see that slope, that slippery slope towards team saying, "Okay, I want to expose this time sheet data. Now I've got to spend a lot of time on modeling this event data because I'm going to publish it into the event stream." Especially with these kind of long leaf, immutable, event streaming technology that they use, changes to that schema have a cost of an impacted in consumers. So, there's a lot of forces that will drive a lot more upfront design I think.

Neal Ford:

Erik brings up this distinction about adaptability, and Rebecca and I talked about this distinction in the building evolution architecture's book about adaptable versus evolvable systems because, there's a key difference between those two things. An adaptable system is one where you've tried to anticipate what people are going to need and you build adaptive hooks into it, either future toggles, or configuration, or some other way to adapt it to this future state that you imagined. But, the problem with doing too much of that is you end up with the eclipse configuration dialogue, where you can change and fiddle with all sorts of literal configuration parameters.

Neal Ford:

And of course that adds to testing and debugging and you frequently end up building a lot of things that people don't use or don't use very much; that even though it seems like a really good idea at the time, that's an adaptable system and that's actually counter to building truly evolvable systems Where you are trying to determine what is the real usage of this thing and keep it very grounded in the day-to-day usage of the system as a whole over time and not get too speculative about it. And so there's, it's tricky to get real value here. So, I think the real question is when do we think this seems like not a bad idea? Are there some indicators that might lead you in one system to do this more than another kind of system for example.

Erik Dörnenburg:

Maybe we can take a page out of the book of API design. I mean we've now totally corrupted, I guess about the entire industry, the term API, when I mean that, that's an API now, I mean the term that we're talking about is HTP endpoints they're rstful. They don't have to be restful, just an HTP end point that returns some data and that creates a platform and as always, we've advocated on the technology weight and elsewhere to have product people behind it.

Erik Dörnenburg:

But here, what is happening, is it's exactly that we are designing this. We're designing it with the anticipation of consumers and that of course takes a serendipity out of it, but maybe there's a middle ground somewhere where we're saying at least we take some ideas of understanding what we are trying to do in essence. But as I said, I'm worried as Evan said about the slippery slope in this. Yet I believe, we have some learnings in that space and we are trying I guess with a different way achieving the same goal. Maybe dissolving all the applications, I have this platform that we can, with relatively low cost, experiment on in our enterprises.

Zhamak Dehghani:

Maybe there is a subtlety here around planned serendipity, if there is such a thing. Where you're creating an environment that encourages that ecosystem behavior that people can easily discover, that end point, and learn about it. There is a product thinking behind it in terms of evolution. There was a point that you made, it might be related to that Evan, that you mentioned that the costs and the investments versus the exploratory nature. Like if you want to put something out there for exploration. And does that fit into the product thinking or applying a kind of product management techniques when you build something that's somewhat speculative?

Evan Bottcher:

I think there is. My view is that there is space for some speculation but, it has to be a very constrained, you're essentially taking a bit. And we, for a long time, just stayed so brutal because of the waste that we've seen across the industry of just lots and lots of predictive planning. If we crack that back a little bit and we say, " Okay, you know we are thinking of these internal business capabilities as crimes." And some large proportion of the funding and the budget that you are doing must be features that are going to be immediately used. It's going to be a strong recommendation. There is space for some small or I wouldn't say it's necessarily small, but some control or constraint investment in taking bets on things that will create value that's not going to be literally prison, but it's a very strong caution around it.

Evan Bottcher:

I think it's worth pointing out that there's two dimensions to this that are in the conversations I'm having in this space now. It's what events, of the dozen interesting domain events, which of them do we expose? And, how much of the data that we would normally recommend encapsulating and hiding as much as possible, how much time data do you put in? The data lake architecture really says you do stroke roll kind of feeds in the low cost raw feeds and then defer the under cleansing and modeling of it later, but how much information do you put in the event? That's tricky as well. It's two dimensions.

Evan Bottcher:

One example I have that drove some of this behavior was around organizations that have hacked decks. So it's not a business feature that you're building. You know they have hack teams to be exploratory, very exploratory and they need an increased access to data and information that these systems have. So if you have an organization that does that quite often, you may generate a need to expose certain information, like your example of time sheet data that isn't actually a proven business feature.

Erik Dörnenburg:

I want to pick up on Evan's point about, I guess you were arguing for slack or for some time that people have that is not directly going to known features and I think that is a great idea and you can view it from both angles. Even say build something that you don't know the value of right now or sometimes what I've seen is, give people in product teams the chance to respond to something that isn't on the immediate backlog or the immediate part of work that they're doing. And maybe that is another way of approaching this. I remember a number of years ago, we built a humane registry for developers and organizations to discover what data, what services were in the organization in the first place and that addresses the discoverability. And without having published the event, when you have discovered it, if you own the organization, you have the chance to go to that team and in the humane registry you maybe get some example data, you can get copies of real data that is not currently streamed.

Erik Dörnenburg:

And it is very clear this is just here to discover. You can then go to the other team, and if that team's given some time to actually help you, you can probably get that serendipity. If that team is completely swamped and if it is only measured, as we've unfortunately seen in some organizations, by the immediate output and if they are told you have to focus on that release, you can't help your colleagues, then they're all work. But, maybe that is another angle too, whether it's hack days, whether it is time allocated for planned serendipity, or there is time to respond to requests from your colleagues. That is probably a key thing that we are landing on here.

Zhamak Dehghani:

Yeah, absolutely. I think that discoverability is a big factor of it and enabling people to find things that they can do useful things. I have a counter example of that where one of our large retail clients that were super excited about Kafka, early days of Kafka. So, the initiative was started from a technology curiosity about technology and then they published essentially all the user interactions with the websites as events. It was a big effort to get that out there. But, nothing came after that and one reason for it because nobody knew about it, nobody knew about these events, nobody knew about their schemas, they couldn't discover it, they couldn't use it. So, people were still going around complaining that we have no insights around how customers are interacting with that website.

Neal Ford:

Well that's always a challenge to make organizations is how do you get the reuse the things that are already there.

Zhamak Dehghani:

Absolutely. And I think discovering is one aspect and paving the path is another make it easy to use. As I mentioned, you have a sample data, you have an example code perhaps.

Neal Ford:

Yeah. Make it easy to be successful using your stuff and then you're a lot of likely to make it more widely used.

Erik Dörnenburg:

And again, here, is that fine balance between trying to document it up front and being there when somebody needs it. I mean, we talked to what schemas, I mean in adjacent it is getting a little bit better, but even most developers, that I know and I include myself in this, we find it easiest to look at some sample data to get a rough understanding of what is in there, rather than a complex schema document that can outline all the possible edge cases. On the other hand, if you only see current sample data, you may not see everything. This might be a system, like a time sheet system, that seems very different uses that on certain parts of the week or you can see, if you look at investment banks, trading patterns can vary over time, within a week, within periods and so on.

Erik Dörnenburg:

And again here, that's why I'm thinking that some preselected sample data or maybe some... They might have written a stop anyway for their own testing. To be able to get that stuff and get data from that. That could be a good idea. Might Tracking middle ground between having a very abstract, complicated, schema and observing what is currently being sent.

Evan Bottcher:

So many of these conversations that we're having and having a new sound in this TAB context. All the way back to a blip that was on the previous radar, which was product management for internal platforms. It's such an important thing that we've kind of under-emphasized. I think generally that if we have these internal platforms and business capabilities and business applications, that part of that is understanding the customer and they need and what we may be describing in this serendipitous use of events or other forms of APIs and data is a customer group that's underrepresented, that product manager, that product owner for that platform can identify. That gives me a little bit more comfort that you can apply the other product management techniques around measuring the value of what you'll produce and trying to shorten the feedback cycle. So, even if you're building something that doesn't have an immediate need, you can still optimize to get that need validated quickly.

Neal Ford:

I think Evan makes an important point. If you're going to engage in this, you should track and see how well you're doing. Are you actually building things that people eventually end up using or if you end up building a bunch of things that nobody ever uses so.

Evan Bottcher:

You're turning the thing off. I mean if you've published 10 event streams, it's costing you time to maintain those gamers and the infrastructure that they're sitting on. If they've never been used in how long do you keep them running for?

Erik Dörnenburg:

Maybe since we are writing the radar, the good point here is also to look up is the idea of the events stream as a source of truth. So, that can really help. And I've seen that at clients where you have the ability and Kafka does this really well, allow you to actually replay all the events from the beginning. That means if you suddenly start to become a consumer, you can actually have a very good way of testing what would have happened if you had seen the entire stream. But also, if you are deriving protecting some state of the events, you have a chance to actually do that. It depends. With the time sheets it probably didn't matter because there was no overall state, so that's okay too. But, it is another important enabling technique or technology, if you will, that can help you with that approach.

Neal Ford:

Okay, so thanks everyone for joining us this morning. This, like many of the things we ended up talking about on the radar and don't make it on the radar is something that's a really interesting idea that's current but, it's way too complex to get into two or three sentences. And so, this gives us an idea to kind of poke around at some of the nuances, and edges, and facets, at some of the interesting things that are happening in the tech world. So thanks Eric. Thanks Evan.

Zhamak Dehghani:

Thank you.

Neal Ford:

And Zhamak and I will see you again soon.

Rebecca Parsons:

This is Rebecca Parsons and on the next edition of the Thoughtworks podcast, Mike Mason, and I will be speaking with Jonny Leroy and Zhamak Dehghani about architectural governance. And that might sound like a dry topic, but I think you'll find it a fascinating conversation and we hope you'll listen in.

View full transcript

View less

More episodes

Episode name

Published

Themes in Technology Radar Vol.32

April 17, 2025

We need to talk about vibe coding

April 02, 2025

Infrastructure as code in 2025

March 20, 2025

How fitness functions can help us govern and measure AI

March 06, 2025

Architecture as code

February 19, 2025

Decoding DeepSeek

February 06, 2025

AI testing, benchmarks and evals

January 23, 2025

Exploring the intersections of software architecture

January 09, 2025

Who should make software architecture decisions?

December 26, 2024

Generative AI's uncanny valley: Problem or opportunity?

December 12, 2024

Using generative AI for legacy modernization

November 28, 2024

Data contracts: What are they and why do they matter?

November 14, 2024

Themes from Technology Radar Vol.31

October 17, 2024

Build Your Own Radar: Using the Technology Radar as a governance tool

October 03, 2024

Exploring DuckDB: A relational database built for online analytical processing

September 19, 2024

Software service granularity: Getting it right

September 05, 2024

Measuring developer experience

August 22, 2024

How can AI support designers?

August 08, 2024

Sensible defaults: A way to think about our technology practices

July 25, 2024

Tracking technology stacks, practices and experiences across teams

July 11, 2024

Inside Bahmni: An open-source digital public good

June 27, 2024

How to assess your organization's security maturity

June 13, 2024

Continuous delivery vs. continuous deployment: What should be the default?

May 30, 2024

Themes from Technology Radar Vol.30

May 16, 2024

Building at the intersection of machine learning and software engineering

May 02, 2024

Refactoring with AI

April 18, 2024

How to measure your cloud carbon footprint

April 04, 2024

Technology through the Looking Glass: Preparing for 2024 and beyond

March 21, 2024

Diving head first into software architecture

March 07, 2024

Exploring the building blocks of distributed systems

February 22, 2024

Software-defined vehicles: The future of the automotive industry?

February 08, 2024

Beyond the DORA metrics: Measuring engineering excellence

January 25, 2024

Asynchronous collaboration: Getting it right

January 11, 2024

Looking back at key themes across technology in 2023

December 28, 2023

Leveraging generative AI at Bosch

December 14, 2023

Jugalbandi: Building with AI for social impact

November 30, 2023

AI-assisted coding: Experiences and perspectives

November 16, 2023

What's it like to maintain an award-winning open source tool?

November 02, 2023

Engineering platforms and golden paths: Building better developer experiences

October 19, 2023

Managing cost efficiency at scale-ups

October 03, 2023

Exploring SQL and ETL

September 21, 2023

Driving innovation in radio astronomy

September 07, 2023

XR with impact: Building experiences that drive business value

August 24, 2023

Leadership styles in technology teams

August 10, 2023

Making design matter in technology organizations

July 27, 2023

Generative AI and the future of knowledge work

July 13, 2023

Scaling mobile delivery

June 29, 2023

Making privacy a first-class citizen in data science

June 15, 2023

Multi-cloud: Exploring the challenges and opportunities

June 01, 2023

Scaling up at Etsy

May 18, 2023

TinyML: Bringing machine learning to the edge

May 04, 2023

The weaponization of complexity

April 20, 2023

How we put together the Technology Radar

April 06, 2023

Inside India's Drug Discovery Hackathon

March 23, 2023

Serverless in 2023

March 09, 2023

My Thoughtworks journey: Rebecca Parsons

February 23, 2023

How to tackle friction between product and engineering in scale-ups

February 09, 2023

6 key technology trends for 2023

January 26, 2023

Tackling system complexity with domain-driven design

January 12, 2023

Shifting left on accessibility

December 29, 2022

Data Mesh revisited

December 15, 2022

Low-code/no-code platforms: The 10% trap and the limits of abstractions

December 01, 2022

Welcome to the fediverse: Exploring Mastodon, ActivityPub and beyond [Special]

November 24, 2022

Rethinking software governance: Reflecting on the second edition of Building Evolutionary Architectures

November 17, 2022

Reckoning with the force of Conway's Law

November 03, 2022

Exploring the Basal Cost of software

October 20, 2022

Why full-stack testing matters

October 05, 2022

Acknowledging and addressing technical debt in startups and scale-ups

September 22, 2022

XR in practice: the engineering challenges of extending reality

September 08, 2022

Agent-based modelling for epidemiology: EpiRust and BharatSim

August 19, 2022

Mastering architectural metrics

August 12, 2022

Building a culture of innovation

July 28, 2022

Starting out with sensible default practices

July 14, 2022

Better testing through mutations

June 30, 2022

Patterns of legacy displacement — Part two

June 16, 2022

Patterns of legacy displacement — Part one

June 02, 2022

Mitigating cognitive bias when coding

May 19, 2022

Following an usual career path: from dev to CEO

May 05, 2022

Software engineering with Dave Farley

April 21, 2022

Tackling bottlenecks at scale-ups

April 07, 2022

Coding lessons from the pandemic

March 24, 2022

Is there ever a good time for a code freeze?

March 10, 2022

Navigating the perils of multicloud

February 25, 2022

Compliance as a product

February 10, 2022

The big five tech trends for 2022

January 27, 2022

Fluent Python revisited

January 13, 2022

Creating a developer platform for a networked-enabled organization

December 30, 2021

The art of Lean inceptions

December 16, 2021

The hard parts of data architecture

December 02, 2021

TDD for today

November 18, 2021

You can't buy integration

November 04, 2021

The rise of NoSQL

October 21, 2021

The hard parts of software architecture

October 07, 2021

Machine learning in the wild

September 24, 2021

Delivering innovation at scale

September 09, 2021

Securing the software supply chain

August 12, 2021

Making retrospectives effective — and fun

July 22, 2021

Patterns of distributed systems

July 08, 2021

Refactoring databases — or evolutionary database design

June 24, 2021

Making developer effectiveness a reality

June 10, 2021

Team topologies and effective software delivery

May 20, 2021

How green is your cloud?

May 07, 2021

Green software engineering

April 22, 2021

Twenty years of agile

April 08, 2021

Talking with tech leads with Pat Kua

March 25, 2021

My Thoughtworks Journey: Patricia Mandarino

March 11, 2021

Exploring infrastructure as code

February 25, 2021

XR in the enterprise

February 11, 2021

Getting to grips with data visualization

January 21, 2021

Computational notebooks: the benefits and pitfalls

January 07, 2021

The architect elevator

December 24, 2020

The future of Clojure

December 10, 2020

The future of digital trust

November 27, 2020

Integration challenges in an ERP-heavy world — Pt 2

November 12, 2020

Democratizing programming

October 28, 2020

Integration challenges in an ERP-heavy world

October 16, 2020

Models of open sourcing software

October 01, 2020

Applying software engineering practices to data science

September 17, 2020

Using visualization tools to understand large polyglot code bases

September 03, 2020

Machine learning in astrophysics

August 20, 2020

Programming languages geek out

August 06, 2020

Observability does not equal monitoring

July 23, 2020

Working with 50% of code in the browser

July 09, 2020

Realising the full potential of CD

June 25, 2020

Testing the user journey

June 12, 2020

Continuous delivery in the wild

June 01, 2020

Lessons from a remote Tech Radar

May 13, 2020

The future of Python

April 30, 2020

A sensible approach to multi-cloud

April 17, 2020

Digital transformation: a tech perspective

April 02, 2020

IT delivery in unusual circumstances

March 20, 2020

Continuous delivery for today's enterprise

March 06, 2020

Fundamentals of Software Architecture

February 21, 2020

Cloud migration — part two

February 10, 2020

The price of reuse

January 24, 2020

Towards self-serve infrastructure

January 13, 2020

Martin Fowler: my Thoughtworks journey

December 27, 2019

Building an autonomous drone

December 13, 2019

Cloud migration is a journey not a destination

November 28, 2019

Getting to grips with functional programming

November 14, 2019

Compliance as code

November 01, 2019

Data meshes: a distributed domain-oriented data platform

October 18, 2019

Edge — a guide to value-driven digital transformation

October 04, 2019

Tech choices: CIO or CTO?

September 20, 2019

Microservices as complex adaptive systems

September 05, 2019

Supporting the Citizen Developer

August 22, 2019

Getting hands-on with RESTful web services

August 08, 2019

Zhong Tai: innovation in enterprise platforms from China

July 25, 2019

What’s so cool about micro frontends?

July 11, 2019

Unravelling the monoglot monopoly

June 27, 2019

Breaking down the barriers to innovation

June 13, 2019

Delivering strategic architectural transformation

May 30, 2019

Exploring programming languages via paradigms vs labels

May 16, 2019

Multicloud in a regulated environment

May 03, 2019

Can DevSecOps help secure the enterprise?

April 18, 2019

A11Y — Making web accessibility easier

April 04, 2019

Continuous delivery for modern architectures

March 21, 2019

Delivering developer value through platform thinking

March 07, 2019

Architectural governance: rethinking the Department of ‘No’

February 21, 2019

Serendipitous Events

February 08, 2019

Diving into serverless architecture

January 24, 2019

Seismic Shifts

January 10, 2019

Understanding bias in algorithmic systems

December 28, 2018

Microservices: The State of the Art

December 14, 2018

Evolving Interactions

November 29, 2018

The state of API design

November 15, 2018

How we build the Tech Radar

November 01, 2018

IoT Hardware

October 18, 2018

Continuous Intelligence

October 04, 2018

Distributed systems antipatterns

September 13, 2018

Agile Data Science

August 23, 2018

Solutions

Industries

Resource Hubs

Publications and Tools

All Insights

Serendipitous Events

Brief summary

Check out the latest edition of the Technology Radar