April 2026 Roundup: Proactive RTC Alerts, Deterministic Conversational Graphs, and Intelligent Context Management.

Welcome to the April edition of the VideoSDK Monthly Updates! This month, we have focused on moving from reactive monitoring to proactive reliability and giving AI agents a structured "brain" for complex business logic.

From the launch of RTC Alerts to the new Conversational Graph support and the ContextWindow for smarter memory, April marks a significant leap in our AI and infrastructure capabilities. Let's dive into what we shipped!

Proactive RTC Alerts

In any real-time application, quality is the product. Unlike traditional systems, RTC failures are often subtle degradations - choppy video, breaking audio, or lagging screenshare. To solve this, we are introducing VideoSDK Alerts.

RTC Alerts shift monitoring from passive observation to active detection. Instead of waiting for user complaints, you can now define metric-driven thresholds that trigger instant notifications.

Core Capabilities of RTC Alerts:

  • Metric-Driven Monitoring: Build alerts based on Jitter, Latency, and Packet Loss across Camera or Screen Share streams.
  • Dimensional Filtering: Scope alerts by Region (e.g., APAC, US-East), OS, Browser, or SDK to isolate specific environmental issues.
  • Aggregation Logic: Use Max (worst-case), Average, or Percentiles to define what constitutes a failure.
  • Noise Reduction: Configure Minimum Session Impact (to ignore single-user outliers) and Time-Based Validation (to ignore transient spikes).
  • Historical Validation: Test your alert logic against past data before deployment to ensure accuracy.
  • Multi-Channel Delivery: Direct integration with Slack, PagerDuty, and Webhooks.

▶️ Overview Video


AI Agents: Rigid Control & Large-Scale Automation

Conversational Graphs: Deterministic Workflows

Agents SDK v1.0.6 introduces directed graph support. This allows you to build multi-turn agents where you define the exact transitions between states (e.g., "Greeting" → "Booking" → "Payment"). The engine follows your routing exactly - the LLM handles the conversation, but never makes unauthorized routing decisions.

📄 Read the Conversational Graphs docs

▶️ Watch Introduction Video

Batch Calling: Voice Outreach at Scale

Our new Batch Calling infrastructure allows you to trigger hundreds of AI-powered voice calls simultaneously from the dashboard. This is designed for high-concurrency outbound use cases like automated reminders and large-scale customer surveys.

Video SDK Image

AI-Native Development: Agents MCP Server

We've launched the official Agents MCP Server. This allows you to provide the full, real-time context of our Agents documentation to AI coding assistants like Claude, Cursor, and VS Code. By connecting your AI to our docs, you can generate implementation code that follows our latest v1 standards and patterns automatically.

👉 View the Agents MCP Server Setup Guide


Agent Intelligence: Smarter Memory & Gated Speech

ContextWindow: Intelligent Memory

Agents SDK v1.0.3 introduces the ContextWindow. Instead of "blindly" truncating the oldest messages when token limits are reached, this feature uses history compression and summarization. It preserves the last N turns raw to maintain conversation quality while summarizing older history.

python
from videosdk.agents import ContextWindow

pipeline = Pipeline(
    ....
    context_window=ContextWindow(
        max_tokens=4000,
        max_context_items=20,
        keep_recent_turns=3,          # Keep last 3 turns raw
        max_tool_calls_per_turn=10,   # Prevent infinite loops
    ),
)

Startup Speech Gating

To prevent the user from interrupting an agent's initial greeting, we have implemented Early Speech Gating (v1.0.7). User audio is now gated until the agent has initiated its first say() or reply(), ensuring a professional and clear start to every interaction.

Read full release notes on GitHub


Observability & Metrics

Unified Observability Configuration

You can now pass all observability settings - recording, traces, metrics, and logs - directly at session startup (v1.0.9).

python
await session.start(
   wait_for_participant=True,
   observability=ObservabilityOptions(
       recording=RecordingOptions(video=True),        
       traces=TracesOptions(export_url="https://otlp-endpoint"),
       logs=LoggingOptions(level=["INFO", "DEBUG"]),  
   ),
)

Pipeline & Metrics Hooks

Agents SDK v1.0.7 adds deep visibility hooks. Use @pipeline.metrics.on("stt") or @pipeline.metrics.on("realtime") to capture latency and token usage. You can also now use session.get_context_history to access session data for debugging or post-processing.

Native SDK Agent Metrics

Real-time agent performance data is now exposed directly in the client-side SDKs. Monitor latency, tokens, and component health via:

Hardware-Silence Detection

Available in JS SDK v0.7.2 and React SDK v0.9.1, the audio-input-silence event triggers when a microphone is publishing but no audio signal is detected. This allows you to alert users to hardware mutes or Bluetooth issues immediately.


Interactive Transcripts & Ecosystem

Word-level Timestamps & Interim Transcripts

Agents SDK v1.0.9 adds support for streaming agent transcripts with word-level timestamps (for Cartesia and ElevenLabs). Additionally, you can now enable interim user results to show text on the UI as the user is still speaking.

python
# Opt-in to word-level sync
tts = CartesiaTTS(word_timestamps=True)
# Opt-in to interim user results (v1.0.10)
stt = DeepgramSTT(forward_interim_transcripts=True)

Core SDK Stability & Performance


Dashboard Power-ups

  • Agent Logs: A dedicated tab in session details to view raw execution logs for debugging.
  • Composite Recording: Download a unified view of agent-user interactions in a single MP4.
  • Batch Calling & Alerts: Centralized UI to manage large-scale campaigns and configure proactive quality alerts.

📚 New Content & Resources

Featured Videos


✨ Community Spotlight

SDK Sketches

Video SDK Image
This month's sketch: Mass calling leads and getting only 3-4% of them qualified?....save your time for the important clients by giving the boring calls to agent with batch-calling

Build with April's Updates

Configure RTC Alerts and Batch Calling via the dashboard, or upgrade to Agents SDK v1.0.10 today.

Dashboard Join our Discord

Happy building!
Team VideoSDK