GenAI ChatBot using Bedrock
Case Study: Building a High-Performance GenAI Chatbot for Higher Education Institutions with AWS Bedrock
Client Overview: NoPaperForms
NoPaperForms is a leading unified technology platform for education, enabling universities and institutions to manage admissions, student engagement, and marketing automation. Their chatbot, built on OpenAI APIs, was facing latency issues, prompting them to explore a more efficient solution using AWS Bedrock.
Problem Statement
The existing chatbot struggled with:
- High Latency: OpenAI API responses were slow, affecting user experience.
- Limited Knowledge Base: The chatbot relied on predefined data sources, lacking real-time information updates.
- Scalability Concerns: The system needed to handle a growing number of queries from students, faculty, and parents.
- Inability to Crawl University Data: The chatbot couldn’t dynamically fetch university details from public websites.
Opstree's Solution
Opstree implemented a GenAI chatbot using AWS Bedrock, Anthropic Claude/Sonnet, and AWS Web Crawler to ensure fast, accurate, and scalable knowledge retrieval.
Architecture Overview
Key Components:
- AWS Bedrock: Hosts and runs the Claude/Sonnet LLM model for generating responses.
- Web Crawling Layer: Utilizes AWS Web Crawler to fetch and update knowledge from university websites dynamically.
- Amazon S3: Stores processed knowledge base documents.
- Semantic Parsing & Chunking: Breaks raw text into meaningful sections for better comprehension.
- Streamlit – Provides an interactive UI for querying the knowledge base.
- LiteLLM (LLM Gateway): Optimizes API latency and reduces costs.
- Session Memory & Caching: Tracks conversation history and minimizes redundant API calls.
- Caching Mechanism: Reduces redundant API calls and speeds up responses.
Implementation Breakdown
Phase 1: Setting Up the Knowledge Base
- Web Crawler Deployment
- Configured AWS Bedrock Web Crawler to fetch university data.
- Respected robots.txt in accordance with RFC 9309.
- Limited by AWS Bedrock’s internal constraints on pages crawled and content size.
- Data Processing & Storage 4. Filtered irrelevant content (ads, navigation links). 5. Stored structured data in Amazon S3.
- Semantic Parsing & Chunking
- Applied NLP techniques to segment text into digestible chunks.
- Indexed data for faster retrieval.
Phase 2: AI Model Integration
- Claude/Sonnet Model Deployment
- Integrated AWS Bedrock's Claude & Sonnet models for high-speed response generation.
- Tuned prompt engineering for context-aware replies.
- Latency Reduction Techniques
- Implemented LiteLLM as a gateway to reduce API response time.
- Added caching & session memory to optimize repeat queries.
Phase 3: Performance Optimization & Testing
- Load & Stress Testing
- Simulated high query volume to test scalability.
- Optimized response times from ~7s to ~1.2s.
- Accuracy & Relevance Testing
- Benchmarked Claude/Sonnet responses against OpenAI GPT models.
- Increased query relevance by 35% through synonym expansion & acronym resolution.
- Security & Compliance
- Ensured data privacy with access-controlled S3 storage.
- Encrypted knowledge base to prevent unauthorized access.
Challenges & Solutions
Challenge | Solution |
---|---|
High response time with OpenAI | Moved to Claude/Sonnet on AWS Bedrock with LiteLLM for optimization. |
Data retrieval gaps | Used AWS Web Crawler to dynamically fetch university website data. |
Inconsistent knowledge base updates | Implemented automated crawl scheduling with AWS Lambda triggers. |
Scalability concerns | Leveraged serverless AWS infrastructure for auto-scaling. |
Results & Impact
- Latency reduced from ~7s to ~1.2s (80% improvement).
- Expanded chatbot knowledge base dynamically using web crawling.
- Increased response accuracy by 35% with NLP-based semantic search.
- Reduced API cost by 40% using caching & LiteLLM optimizations.
- Enhanced user engagement with context-aware, real-time responses.
Conclusion & Future Roadmap
By migrating NoPaperForms’ chatbot to AWS Bedrock with Claude/Sonnet, Opstree successfully overachieved the project goals in just 2 weeks. Future improvements include:
- Expanding university coverage through broader web crawling.
- Real-time data updates to keep information current.
- Multilingual support for international students.
With this scalable, AI-driven solution, NoPaperForms can now deliver high-speed, knowledge-rich responses to thousands of education queries, transforming the student experience. 🚀