Unraveling Time Series Data for Root Cause Analysis (RCA) | Ranjan Kumar

I’m sure many of you can relate to the feeling of being overwhelmed by a sea of data, especially when it comes to time series data. I recently came across a Reddit post from a user who’s struggling to make sense of their data for a Root Cause Analysis (RCA) system. They’re tasked with building a system that can identify the cause of any apparent issue or degradation in a microservices app, but they’re not sure where to start.

The Data Conundrum

The data is comprised of logs, metrics, and traces from the microservices app, along with some cluster info. The data is scattered across multiple folders, each containing files with injection time, logs, and metrics data. The folders are named after the type of issue, such as ‘carts_cpu’, but the data itself doesn’t provide a clear label for the issue.

The Challenge

The user is struggling to choose an appropriate algorithm for their RCA system. They want a model that can understand the causal relationships in the data, rather than just reading the data row by row. The problem is that the data spikes and then returns to normal, making it difficult to identify a clear pattern or label.

A Possible Approach

One possible approach to this problem is to use a time series analysis algorithm that can handle causal relationships. One such algorithm is the Vector Autoregression (VAR) model, which can model the relationships between multiple time series variables.

Another approach could be to use a machine learning algorithm that’s specifically designed for time series data, such as the Prophet algorithm or the LSTM algorithm.

The Importance of Data Preprocessing

Before diving into any algorithm, it’s essential to preprocess the data to ensure it’s clean and consistent. This may involve handling missing values, normalizing the data, and transforming the data into a suitable format for the chosen algorithm.

Final Thoughts

Building an RCA system can be a complex task, especially when dealing with time series data. However, by choosing the right algorithm and preprocessing the data correctly, it’s possible to uncover the causal relationships in the data and build a reliable RCA system.

If you’re facing a similar challenge, I hope this post has been helpful. Remember to take your time, and don’t be afraid to ask for help along the way.

The Data Conundrum

The Challenge

A Possible Approach

The Importance of Data Preprocessing

Final Thoughts

Leave a Comment Cancel Reply