Skip to content
View arindampaul1993's full-sized avatar
  • Bengaluru

Block or report arindampaul1993

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
arindampaul1993/README.md

Arindam Paul

Senior backend engineer (10 years) building production-grade AI systems. I focus on the infrastructure that makes LLM applications reliable at scale — evaluation harnesses, retrieval pipelines, observability, and resilience patterns. The kind of engineering that separates a demo from a product.

AI Engineering Portfolio

Repository What It Demonstrates
ShipIt Capstone: Production AI Readiness ScannerLive Demo
agent-exercises Agentic AI patterns — tool use, multi-step reasoning, ReAct loops, human-in-the-loop, multi-agent orchestration
eval-exercises LLM evaluation — assertion frameworks, scoring rubrics, LLM-as-judge, comparative eval, regression testing, end-to-end pipelines
rag-exercises Production RAG — chunking strategies, BM25, hybrid search, reranking, query transformation, retrieval metrics
observability-exercises LLM observability — token tracking, caching, latency monitoring, streaming, budget controls, structured logging, dashboards
resilience-exercises LLM resilience — retry with backoff, circuit breakers, graceful degradation, hallucination detection, prompt injection defense

Technical Focus

  • LLM Infrastructure: Building the systems around the model — not just calling the API, but making it production-ready
  • Reliability Engineering: Circuit breakers, retry budgets, and cost guards applied to non-deterministic AI systems
  • Evaluation & Quality: Measuring LLM output quality systematically, not vibes-based
  • Retrieval Systems: End-to-end RAG pipelines from chunking to evaluation metrics

Stack

Python · Google Gemini API · pytest · distributed systems · event-driven architecture


ShipIt — my capstone project. Paste LLM code, get a production-readiness scorecard. The app itself uses every pattern it checks for (circuit breaker, caching, token budgets, structured logging, RAG).

Popular repositories Loading

  1. php php Public

    Contains basic PHP programs

    PHP

  2. City-without-Crime City-without-Crime Public

    A digital FIR system

    PHP

  3. springRestApp springRestApp Public

    Java

  4. react-tictactoe-app react-tictactoe-app Public

    A basic react app

    JavaScript

  5. spring-boot-app spring-boot-app Public

    Restful CRUD API using Spring Boot, Mysql, Jpa and Hibernate

    Shell

  6. arindampaul1993 arindampaul1993 Public

    Config files for my GitHub profile.