Erlang Solutions were kind enough to invite me to speak at their user group last month. The talk was pretty high level and goes into some of the problems we wanted to solve, technology choices and discusses our new time series database - Dalmatiner DB.
We've started to use a new time-series database called Dalmatiner DB which relies on the filesystem for features like compression and consistency checking. Initially this database was written for SmartOS which has native ZFS support but given the maturity of ZFS on Linux we thought we'd run our setup on Linux.
Monitoring at scale is a hard task so we often get asked by people what our architecture looks like. The reality is that it's constantly changing over time. This blog aims to capture our current design based upon what we've learnt to date. It may all be different given another year. To provide some background we initially started Dataloop.IO just under 18 months ago. Before then we had all been involved in creating SaaS products at various companies where monitoring and deployments were always a large part of our job.
Create a monitoring system that dev and ops can easily setup without intimate knowledge of a monitoring product, supportive of organisational shifts to micro services where people who may want to add a monitoring check might not even know what Nagios is. Agents should pop up and collect config automatically, and deregister gracefully when environments shutdown. The agent should run Nagios scripts written in any language. We’re aiming for dropbox level of simplicity!