Increase error response rate

Resolved·Partial outage

We have implemented some safeguard to avoid our yesterday's cluster issues. We're closing this incident & keeping an eye on the metrics.

Wed, Mar 4, 2026, 08:50 AM

(5 months ago)

Affected components

Mar 3, 2026, 09:44 AM

09:54 AM

Admin Dashboard

AI Agents

Public API & MCP

Customer Portal

Updates

Resolved

We have implemented some safeguard to avoid our yesterday's cluster issues. We're closing this incident & keeping an eye on the metrics.

Wed, Mar 4, 2026, 08:50 AM

Monitoring

The issue affecting our Aurora cluster is fixed and jobs haved catched up. We are still investigating why our cluster behaved in such a way to not have it happen again.

Tue, Mar 3, 2026, 09:54 AM(22 hours earlier)

Identified

Our jobs are pilling up du to an issue with our infrastructure, we are implementing counter measures

Tue, Mar 3, 2026, 09:50 AM

Investigating

We are investigating an issue affecting our application as we see an increase in our error rate and timeouts happening

Tue, Mar 3, 2026, 09:44 AM