Skip to main content

Tech Stack

Technology Stack

Frontend

  • Framework: Next.js 14 (React 18)
  • UI: Tailwind CSS + shadcn/ui
  • State: Zustand + React Query
  • Real-time: WebSockets (Socket.io)

Backend

  • API: Node.js + Express (REST + GraphQL)
  • Auth: Clerk / Auth0 (SSO support)
  • Payments: Razorpay (India), Stripe (international)
  • Jobs: BullMQ (Redis-based)

Database

  • Primary: PostgreSQL (user data, transactions)
  • Cache: Redis (sessions, leaderboards)
  • Vector: Pinecone / Weaviate (semantic search, job matching)
  • Analytics: ClickHouse (events, usage tracking)

AI/ML

  • LLM APIs: Claude 3.5 Sonnet, GPT-4 (Phase 1)
  • Self-hosted: Llama 3 fine-tuned (Phase 2+)
  • Code Execution: Judge0 / Piston (sandboxed)
  • Embeddings: OpenAI text-embedding-3

Infrastructure

  • Hosting: AWS / GCP (start with AWS)
  • CDN: Cloudflare
  • Monitoring: Datadog / New Relic
  • Logging: ELK Stack

AI Cost Optimization Roadmap

Phase 1 (Month 1-6): API-Based

  • Use Claude 3.5 Sonnet API
  • Cost: ₹1.50-2/question
  • Quick to market, no ML ops

Phase 2 (Month 6-12): Fine-Tuning

  • Fine-tune Llama 3 on expert questions
  • Self-host on AWS EC2/GCP
  • Cost: ₹0.60/question (60% reduction)
  • Requires ML engineer

Phase 3 (Month 12-18): Custom Models

  • Build smaller task-specific models
  • Distill from Llama 3
  • Cost: ₹0.30/question (80% reduction)
  • Requires ML team (2-3 people)

Phase 4 (Month 18+): Edge Optimization

  • Cache common questions/feedback
  • Pre-generate question banks (indexed)
  • Cost: ₹0.10-0.20/question (90% reduction)

Goal: ₹24 crore AI costs → ₹6 crore (18% → 5% of revenue)

Open Source Strategy

Year 1: Closed (Build & Validate)

  • Focus on product-market fit
  • Validate algorithms work
  • Build community trust

Year 2: Open Algorithms

  • IRT/BKT implementations (MIT license)
  • Question generation prompts
  • Dataset of 10K+ questions (Creative Commons)

Year 3: Selective Platform

  • Frontend components (UI library)
  • API SDKs (Python, JavaScript)
  • Self-hosting guides
  • Keep core platform proprietary (non-profit IP)

Why selective:

  • Prevent for-profit clones
  • Maintain mission control
  • Balance transparency with sustainability

Security & Privacy

Data Protection:

  • End-to-end encryption (user data at rest)
  • GDPR/CCPA compliant
  • Regular security audits
  • Bug bounty program (Year 2)

Code Security:

  • Sandboxed code execution (Judge0/Piston)
  • Rate limiting (prevent abuse)
  • DDoS protection (Cloudflare)

Privacy:

  • Minimal data collection
  • User controls (delete data anytime)
  • No selling data (non-profit = no incentive)

Scalability Plan

100K Users (Month 12)

  • 2-3 backend instances
  • PostgreSQL primary + 1 read replica
  • Redis single instance
  • Cost: ₹2-3L/month infra

1M Users (Month 18)

  • 10+ backend instances (auto-scaling)
  • PostgreSQL sharding
  • Redis cluster
  • CDN for static assets
  • Cost: ₹10-15L/month infra

5M Users (Month 24)

  • 50+ backend instances
  • Multi-region deployment (AWS Mumbai + Singapore)
  • Dedicated ML inference servers
  • Cost: ₹40-50L/month infra

Cost per user drops as we scale (economies of scale)