Berlioz Monitoring

Full-Stack Observability Solution for Microservices

Complete real-time observability platform combining automatic Python error capture, structured storage, WebSocket broadcasting, AI analysis with Claude (Bedrock), and Next.js 15 dashboard for microservices diagnosis and analysis.

Berlioz Monitoring Preview

Berlioz Monitoring is a full-stack observability solution developed to track, diagnose, and analyze Python microservices errors in real-time. It combines backend instrumentation, PostgreSQL persistence, real-time streaming via WebSocket, and intelligent AI analysis to contextualize and quickly resolve issues.

Core Features

๐Ÿ“Š

Real-Time Monitoring

Live stream of microservices errors with instant updates via WebSocket. Global paginated view of all logs with sorting and filtering by date, type, status, microservice, and severity.

๐Ÿ“ˆ

Analytics & Metrics

System health indicators: total error count, top affected microservices, top exception types, HTTP code distribution, temporal trends with automatic statistical anomaly detection.

๐Ÿค–

Integrated AI Analysis

Automatic AI analysis generation via Claude (Bedrock) with real-time SSE streaming. Error classification (Python type, functional category, severity level) and root cause analysis (RCA) based on stack trace and local variables.

๐Ÿ”

Advanced Debugging

Complete visualization of Python stack traces (up to 18 frames), local variable inspection, environment context (middleware, scope, sender), and automatic error categorization (IntegrityError, ImportError, etc.).

๐Ÿ”Ž

Filtering & Search

Advanced filtering system by date, microservice, error type, status (resolved/unresolved), severity. Text search in logs and stack traces to quickly locate specific issues.

โš™๏ธ

Microservices Management

Complete list of microservices (iSearch, iAssist, iSummarize, File Manager, etc.) with independent analysis: error rates, average severity, frequency, trends per service.

๐Ÿ””

Notifications & Alerts

Automatic alert system via Email and Slack for critical events. Statistical anomaly detection (e.g., +900% errors) with real-time notifications for immediate triage.

๐ŸŽจ

Visual Themes

Dark/light mode support with dynamic adaptation of chart colors and contrasts. Professional dark theme (amber/orange palette) and clean light theme for optimal visual experience.

Interface Showcase

Berlioz Monitoring Dashboard 1

View

Berlioz Monitoring Dashboard 2

View

Berlioz Monitoring Dashboard 3

View

Berlioz Monitoring Dashboard 4

View

Berlioz Monitoring Dashboard 5

View

Technical Architecture

FrontendBackendDatabaseInfrastructureAI Integration

Detailed Modules

๐Ÿ“‹Log Management

Unified ingestion via Celery/Redis with structured payloads and error UUIDs. Persistent logs API with pagination, text search, and advanced filters. Detailed view with complete stack trace and local variables.

๐Ÿ“ŠAdmin Dashboard

Centralized dashboard with global log view, interactive analytics charts (histograms, pie charts, bar charts), trend indicators, and real-time system health metrics.

๐Ÿ”งMicroservices View

Independent analysis per microservice: error rates, exception type distribution, temporal evolution, identification of most critical services for intervention prioritization.

๐Ÿง AI Insights

Automatic AI analysis generation with SSE streaming directly in modal. Executive summary, categorization (database, HTTP, validation, test), severity level (low, moderate, critical), and recommendations.

๐Ÿ‘ฅSupport Agent Interface

Merges console logs, screenshots, and backend logs for complete diagnosis. User guidance or automatic ticket creation on severe errors for optimized support workflow.

๐Ÿ””Notifications System

Configurable Email and Slack alerts for critical events. Automatic statistical anomaly detection with customizable thresholds. Real-time notifications for immediate triage.

Strategic Objectives

1

Quickly detect and diagnose Python microservices errors via real-time observability and automatic analysis

2

Contextualize errors through AI to understand root cause and accelerate resolution

3

Provide centralized view of system health with advanced analytics and anomaly detection

4

Facilitate error triage and prioritization through intelligent classification and proactive notifications

5

Scale observability for complex microservices ecosystems with high-performance distributed architecture

Sami Ben Hassine โ€“ Tech Lead & Software Architect | Python, FastAPI & Next.js Expert