Summary

I care more about building things right than building things fast. I research before I code, I understand before I implement, and I don't move on until it's something I'd be proud to show. I'm drawn to backend and systems because I like problems that have answers—not opinions.

Open Source

mem0ai/mem0 (53K+★) — Memory Layer for AI Agents
  • Fixed critical graph memory KeyError cascade affecting multi-agent systems. Implemented defensive handling across memory search pipeline, preventing production crashes that broke session continuity (#3524)
  • Resolved AWS authentication serialization flaw in enterprise deployments. Designed safe fallback mechanism ensuring graceful degradation under network failures (#3544)
  • Fixed PyPI compliance (PEP 639) enabling 1000+ dependency managers to resolve package correctly. Demonstrates ability to work in large established codebases with peer review rigor (#3522)

Featured Projects

  • TokBurn — Developer Tooling · github.com/lsvishaal/tokburn
    Built and published CLI tool that parses Claude Code's local JSONL session files to surface token waste patterns and cost attribution via a FastAPI dashboard. Zero external API calls — fully local.
    • Launch post hit 67K+ views on r/ClaudeCode; featured in Awesome Claude Tools. Community user analyzed 1,765 sessions revealing $5,209 equivalent API cost — the data story drove adoption.
    • Handles edge cases in Claude Code's session format across versions; published on PyPI (pip install tokburn). Zero external API calls — fully local analysis.
    Python · FastAPI · PyPI · Async Python
  • Document Intelligence System — RAG Pipeline · github.com/lsvishaal/constructure-rag
    Built production RAG pipeline for construction document Q&A with source citations.
    • Designed hybrid retrieval system (vector + BM25 + Reciprocal Rank Fusion) to balance recall vs latency. Handles exact code/identifier matching that pure semantic search misses. Hybrid retrieval completes <100ms; full Q&A responses 5-10 seconds with local LLM inference
    • Optimized ingestion with structure-aware chunking (300 words, 75-word overlap) achieving 30 pages/sec throughput. Reduced multi-document processing from hours to minutes, enabling practical production deployment
    • Production-hardened with LRU query caching (1hr TTL), Docker Compose deployment, structured logging, and comprehensive test suite. Cached queries return in <10ms
    FastAPI · Qdrant · Ollama (Qwen 2.5 7B) · PyMuPDF · BGE-small-v1.5
  • Multi-Agent Code Review System — LLM Orchestration · github.com/lsvishaal/Lyzer_PR_Review
    Automated code review using LLM orchestration to catch issues humans miss in initial reviews.
    • Designed 4-agent architecture (logic, readability, performance, security) with specialized responsibilities and parallel execution. Each agent independently analyzes code changes, with deduplication and severity-based aggregation to consolidate findings across domains
    • Achieved high-precision security analysis using specialized prompts grounded in OWASP Top 10 and CWE standards. Validated on real-world GitHub PRs with structured output enabling human-in-the-loop review workflows
    • Integrated with GitHub API for live PR reviews and standalone offline analysis via raw diff input. Supports multiple programming languages with automatic language detection and binary file filtering
    • Production-grade observability: Structured JSON logging with request tracing, Prometheus metrics (request rate, latency, error tracking), Grafana dashboards, Docker Compose orchestration across 5 services with health checks and non-root execution. 60+ unit and integration tests covering agents, diff parsing, GitHub API client, and endpoints
    FastAPI · LLM Agents (Ollama) · Pydantic · GitHub API · Prometheus · Docker · Async Python
  • AstroHomes — Real Estate Platform (Live) · astrohomes.in
    Built production platform for property management startup, enabling non-technical client to manage 50+ listings without code changes.
    • Designed JSON-driven CMS enabling non-technical client to manage 50+ properties independently. Reduced update cycle from 30 min to 6 min (80% faster), eliminating scaling bottleneck
    • Implemented responsive design with accessibility compliance (WCAG 2.1 AA). 95+ Lighthouse scores, ensuring usability on rural connections and older devices
    • Built analytics dashboard tracking performance, traffic, and conversions. Client used insights to optimize portfolio focus on top revenue-driving properties
    Next.js · TypeScript · Tailwind CSS · React Query · Vercel

Technical Skills

Backend & APIs
Python 3.12, FastAPI, async/await, REST API design, Pydantic, type hints
AI/ML Systems
RAG pipelines, vector databases (Qdrant), LLM orchestration (Ollama), semantic chunking, hybrid retrieval (BM25 + RRF)
Infrastructure
Docker, Docker Compose, PostgreSQL, MongoDB, structured logging, Prometheus, GitHub Actions
Testing
pytest, integration testing, 90%+ coverage practice

Education

B.Tech CSE, SRM Institute of Science and Technology (Expected May 2026)