Portfolio Details

Each project in this portfolio reflects my hands-on experience in solving real-world problems using data science, machine learning, and full-stack development. From predictive models to intelligent web applications, these solutions are built with a focus on scalability, functionality, and impact.

Project Information

  • Project Name: Deputy - Local-First AI Meeting & Interview Copilot
  • Category: Agentic AI Systems & Real-time AI Assistants
  • System Type: Desktop AI Copilot (Electron + Rust + Multi-LLM Orchestration)
  • Github Repo: Link
  • Architecture: Local-first, BYOK-enabled, Multi-provider AI system

Technologies Used

This project is built using Electron with React + TypeScript for the desktop interface, Rust for low-level audio capture and DSP processing, and Node.js for orchestration logic. It integrates multiple AI providers including OpenAI, Gemini, Claude, Groq, and local Ollama models. The system uses SQLite with vector indexing (sqlite-vec) for local memory storage and retrieval, along with streaming STT APIs such as Deepgram, Google STT, Soniox, Azure Speech, and ElevenLabs.

Details of Project

Deputy is a local-first AI copilot designed to assist users during live meetings, interviews, and technical discussions. Unlike traditional note-taking tools, it operates in real time by capturing system and microphone audio, generating live transcripts, and providing contextual AI suggestions directly inside a stealth overlay interface.

The system solves the challenge of thinking, listening, and responding simultaneously by converting live conversation context into actionable intelligence such as suggested answers, follow-up questions, clarifications, and summaries.

It supports dual-channel audio processing, real-time speech-to-text streaming, screenshot-based context understanding, and multi-mode AI reasoning (Answer, Clarify, Follow-up, Code Hint, Brainstorm, Recap). All conversation data is stored locally, enabling privacy-first usage with optional cloud enhancement through BYOK (Bring Your Own Key).

Project Features

  • Real-time dual-channel audio capture (system audio + microphone separation)
  • Live transcription using multi-provider STT (Google, Deepgram, OpenAI, Azure, Soniox, ElevenLabs)
  • AI-powered overlay assistant for instant interview and meeting support
  • Multiple AI modes: Answer, Clarify, Follow-up, Code Hint, Brainstorm, Recap
  • Screenshot understanding for context-aware reasoning during discussions
  • Local-first memory system with SQLite + vector embeddings (RAG-based retrieval)
  • Multi-provider LLM routing (OpenAI, Gemini, Claude, Groq, Ollama)
  • Stealth overlay UI with always-on-top and pass-through interaction mode
  • Secure credential handling with in-memory scrubbing and OS key storage
  • Meeting history dashboard with searchable transcript and export support

Technical Implementation

The system is architected as a multi-layer desktop intelligence engine. The UI layer is built in Electron with React, while the performance-critical audio pipeline is implemented in Rust using N-API bindings for low-latency capture and DSP processing. A central orchestration engine manages transcription streams, AI routing, and context memory generation.

Incoming audio is processed through a dual-stream architecture separating system audio and microphone input. These streams are transcribed in real time using pluggable STT providers with fallback chains and retry logic. The resulting text is fed into a rolling context engine that maintains conversation state and prevents repetition.

A local RAG pipeline chunks and embeds meeting data into SQLite vector storage, enabling semantic retrieval across both active and historical sessions. AI responses are generated through a mode-based intent classifier that dynamically selects response strategies depending on user actions such as answering, clarifying, or summarizing.

The system also includes reliability engineering features such as provider failover, streaming cancellation control, epoch-based summarization for long sessions, and crash-safe queue recovery to ensure uninterrupted real-time performance.