Internship Report > Workshop > AI Service Architecture > Lambda Functions

Lambda Functions

Overview

The AI Service layer consists of four Lambda functions that power the IELTS learning platform. These functions process requests asynchronously via SQS queues and integrate with Google Gemini API and Amazon Bedrock for AI-powered evaluations.

AI Service Architecture

Lambda Function 1: Writing Evaluator

Evaluates IELTS Writing Task 1 and Task 2 essays using Gemini API with detailed band scoring.

Setting	Value
Function name	`bandup-writing-evaluator`
Runtime	Python 3.11
Memory	1024 MB
Timeout	5 minutes
Trigger	SQS (`bandup-writing-queue`)
AI Model	Google Gemini 2.0 Flash

Core Implementation:

import json
import os
import boto3
import logging
from typing import Dict, Any

logger = logging.getLogger()
logger.setLevel(logging.INFO)

# Import from Lambda layer
from lambda_shared.gemini_client import GeminiClient
from secrets_helper import get_gemini_api_key

def lambda_handler(event: Dict[str, Any], context: Any) -> Dict[str, Any]:
    """Evaluate IELTS Writing essays using Gemini API."""
    
    # Parse SQS message or API Gateway request
    if is_sqs_event(event):
        request_data, job_id = parse_sqs_message(event)
        update_job_status(job_id, 'processing', 'writing')
    else:
        request_data = json.loads(event.get('body', '{}'))
    
    # Get API key securely from Secrets Manager
    gemini_api_key = get_gemini_api_key()  # Retrieved from AWS Secrets Manager
    gemini_client = GeminiClient(api_key=gemini_api_key)
    
    # Extract request parameters
    user_id = request_data.get('user_id')
    essay_content = request_data.get('essay_content')
    task_type = request_data.get('task_type', 'TASK_2')
    
    # Build evaluation prompt
    prompt = build_writing_prompt(essay_content, task_type)
    
    # Call Gemini API for evaluation
    response = gemini_client.generate_evaluation(
        prompt=prompt,
        feature='writing_task2',
        max_retries=3,
        timeout=60
    )
    
    # Parse and validate band scores
    evaluation = parse_gemini_response(response['content'])
    
    # Build result with IELTS criteria
    result = {
        'session_id': request_data.get('session_id'),
        'overall_band': evaluation.get('overall_band'),
        'task_achievement_band': evaluation['task_achievement']['band'],
        'coherence_band': evaluation['coherence_cohesion']['band'],
        'lexical_band': evaluation['lexical_resource']['band'],
        'grammar_band': evaluation['grammatical_range_accuracy']['band'],
        'feedback': evaluation
    }
    
    # Save to DynamoDB
    dynamodb = boto3.resource('dynamodb')
    table = dynamodb.Table(os.environ.get('DYNAMODB_EVALUATIONS'))
    table.put_item(Item={
        'evaluation_id': result['session_id'],
        'user_id': user_id,
        'evaluation_type': 'writing',
        'status': 'completed',
        **result
    })
    
    return {'statusCode': 200, 'body': json.dumps(result)}

Gemini Prompt Template:

def build_writing_prompt(essay_content: str, task_type: str) -> str:
    return f"""You are an experienced IELTS examiner. Evaluate this essay:

Task Type: {task_type}

ESSAY:
{essay_content}

Evaluate using IELTS band descriptors (1-9, 0.5 increments):
1. Task Achievement - Addresses all parts of task
2. Coherence and Cohesion - Logical organization
3. Lexical Resource - Vocabulary range and accuracy
4. Grammatical Range and Accuracy - Sentence structures

RESPOND IN JSON FORMAT:
{{
  "overall_band": <float>,
  "task_achievement": {{"band": <float>, "feedback": "..."}},
  "coherence_cohesion": {{"band": <float>, "feedback": "..."}},
  "lexical_resource": {{"band": <float>, "feedback": "..."}},
  "grammatical_range_accuracy": {{"band": <float>, "feedback": "..."}},
  "quoted_examples": [{{"quote": "...", "issue": "...", "suggestion": "..."}}]
}}"""

Lambda Function 2: Speaking Evaluator

Evaluates IELTS Speaking using Gemini native audio processing - 72% cheaper and 2x faster than AWS Transcribe alternatives.

Setting	Value
Function name	`bandup-speaking-evaluator`
Runtime	Python 3.11
Memory	2048 MB
Timeout	5 minutes
Trigger	SQS (`bandup-speaking-queue`)
AI Model	Gemini 2.5 Flash (Native Audio)

Core Implementation:

import json
import os
import boto3
import logging
from typing import Dict, Any, Tuple

logger = logging.getLogger()

# Import from Lambda layer
from lambda_shared.gemini_client import GeminiClient
from secrets_helper import get_gemini_api_key

def download_audio_from_s3(audio_url: str) -> Tuple[bytes, str]:
    """Download audio file from S3 and determine MIME type."""
    s3_client = boto3.client('s3')
    
    # Parse S3 URL: s3://bucket-name/path/to/file.mp3
    parts = audio_url.replace('s3://', '').split('/', 1)
    bucket, key = parts[0], parts[1]
    
    response = s3_client.get_object(Bucket=bucket, Key=key)
    audio_bytes = response['Body'].read()
    
    # Determine MIME type from extension
    mime_types = {'.mp3': 'audio/mp3', '.wav': 'audio/wav', '.m4a': 'audio/m4a'}
    ext = '.' + key.split('.')[-1].lower()
    mime_type = mime_types.get(ext, 'audio/mp3')
    
    return audio_bytes, mime_type

def lambda_handler(event: Dict[str, Any], context: Any) -> Dict[str, Any]:
    """Evaluate IELTS Speaking using Gemini native audio."""
    
    # Parse request
    request_data = parse_request(event)
    
    # Get API key from Secrets Manager
    gemini_api_key = get_gemini_api_key()
    gemini_client = GeminiClient(api_key=gemini_api_key)
    
    # Extract parameters
    audio_url = request_data.get('audio_url')
    part = request_data.get('part', 'PART_1')
    questions = request_data.get('questions', [])
    
    # Step 1: Download audio from S3
    audio_bytes, mime_type = download_audio_from_s3(audio_url)
    logger.info(f"Downloaded {len(audio_bytes)} bytes, MIME: {mime_type}")
    
    # Step 2: Send audio directly to Gemini (ONE API call)
    # No AWS Transcribe needed - Gemini processes audio natively
    evaluation = gemini_client.evaluate_audio(
        audio_bytes=audio_bytes,
        part=part,
        questions=questions,
        mime_type=mime_type,
        max_retries=3,
        timeout=120
    )
    
    # Step 3: Build response with IELTS Speaking criteria
    result = {
        'session_id': request_data.get('session_id'),
        'transcript': evaluation.get('transcript'),
        'duration': evaluation.get('duration_seconds'),
        'overall_band': evaluation.get('overall_band'),
        'fluency_band': evaluation['fluency_coherence']['band'],
        'lexical_band': evaluation['lexical_resource']['band'],
        'grammar_band': evaluation['grammatical_range_accuracy']['band'],
        'pronunciation_band': evaluation['pronunciation']['band'],
        'model_used': 'gemini-2.5-flash-audio',
        'estimated_cost': evaluation['usage']['cost']
    }
    
    # Save to DynamoDB
    save_evaluation(result, request_data.get('user_id'))
    
    return {'statusCode': 200, 'body': json.dumps(result)}

Cost Comparison:

Approach	Cost per 3-min Audio	Latency
Gemini Native Audio	~$0.021	30-45s
AWS Transcribe + LLM	~$0.076	60-90s
Savings	72%	2x faster

Lambda Function 3: Flashcard Generator (RAG)

Generates flashcards from PDF documents using lightweight RAG pipeline with Titan Embeddings (in-memory vector store, optimized for <50MB Lambda package).

Setting	Value
Function name	`bandup-flashcard-generator`
Runtime	Python 3.11
Memory	1024 MB
Timeout	10 minutes
Trigger	SQS (`bandup-flashcard-queue`)
AI Model	Gemini + Amazon Titan Embeddings V2

RAG Pipeline Flow:

┌─────────────┐     ┌──────────────┐     ┌─────────────────┐
│  PDF Upload │ ──▶ │   Chunking   │ ──▶ │ Titan Embeddings│
│     (S3)    │     │ (3000 chars) │     │   (Bedrock)     │
└─────────────┘     └──────────────┘     └────────┬────────┘
                                                  │
                                                  ▼
┌─────────────┐     ┌──────────────┐     ┌─────────────────┐
│  Flashcards │ ◀── │   Gemini     │ ◀── │ In-Memory Store │
│   (JSON)    │     │  Generation  │     │  (Cosine Sim)   │
└─────────────┘     └──────────────┘     └─────────────────┘

Core Implementation:

import json
import os
import boto3
import time
import google.generativeai as genai
from typing import Dict, Any, List
from concurrent.futures import ThreadPoolExecutor, as_completed

logger = logging.getLogger()

# Global instance for warm starts (Lambda optimization)
_rag_instance = None
_s3_client = None

def get_s3_client():
    """Get cached S3 client."""
    global _s3_client
    if _s3_client is None:
        _s3_client = boto3.client('s3')
    return _s3_client

def get_rag_instance(api_key: str):
    """Get cached RAG instance for warm starts."""
    global _rag_instance
    if _rag_instance is None:
        _rag_instance = RAG(
            api_key=api_key,
            chunk_size=int(os.environ.get('RAG_CHUNK_SIZE', '500')),
            chunk_overlap=int(os.environ.get('RAG_CHUNK_OVERLAP', '100'))
        )
        logger.info("Cold start: RAG instance created")
    else:
        logger.info("Warm start: Reusing RAG instance")
    return _rag_instance

def download_pdf_from_s3(bucket: str, key: str) -> str:
    """Download PDF from S3 to /tmp."""
    s3 = get_s3_client()
    local_path = f"/tmp/{key.split('/')[-1]}"
    s3.download_file(bucket, key, local_path)
    return local_path

def lambda_handler(event: Dict[str, Any], context: Any) -> Dict[str, Any]:
    """RAG-based flashcard generation ."""
    
    start_time = time.time()
    is_async = is_sqs_event(event)
    
    # Parse request
    if is_async:
        request, job_id = parse_sqs_message(event)
        update_job_status(job_id, 'processing')
    else:
        request = json.loads(event.get('body', '{}')) if isinstance(event.get('body'), str) else event
    
    # Get S3 location
    pdf_url = request.get('pdf_url')
    s3_bucket, s3_key = parse_s3_url(pdf_url)
    
    # Get API key from Secrets Manager
    secret_arn = os.environ.get('GEMINI_API_KEY_SECRET_ARN')
    secrets_client = boto3.client('secretsmanager')
    api_key = secrets_client.get_secret_value(SecretId=secret_arn)['SecretString']
    
    # Get parameters
    num_cards = int(request.get('num_cards', 10))
    difficulty = request.get('difficulty', 'MEDIUM')
    question_types = request.get('question_types', ['DEFINITION', 'VOCABULARY', 'COMPREHENSION'])
    
    # Step 1: Download PDF from S3
    local_pdf = download_pdf_from_s3(s3_bucket, s3_key)
    
    # Step 2: Index document with RAG (Titan Embeddings + in-memory store)
    rag = get_rag_instance(api_key)
    rag._vector_store = None  # Reset for new document
    rag._chunks = []
    
    index_result = rag.index_document(local_pdf, document_id=s3_key)
    logger.info(f"Indexed {index_result['chunk_count']} chunks from {index_result['page_count']} pages")
    
    # Step 3: Retrieve relevant chunks (hybrid approach)
    if index_result['chunk_count'] <= 15:
        # Small document: use representative chunks
        chunks = rag.get_representative_chunks(num_chunks=min(10, index_result['chunk_count']))
        retrieval_method = "representative"
    else:
        # Large document: use smart keyword-based queries
        chunks = rag.retrieve_with_smart_queries(top_k_per_query=3)
        retrieval_method = "smart_queries"
    
    # Step 4: Generate flashcards with Gemini
    prompt = generate_flashcards_prompt(chunks, num_cards, difficulty, question_types)
    flashcard_result = call_gemini(prompt, api_key)
    
    # Clean up
    os.remove(local_pdf)
    
    # Build response
    total_time = time.time() - start_time
    response_body = {
        'status': 'success',
        'set_id': request.get('set_id'),
        'user_id': request.get('user_id'),
        'document': {
            's3_bucket': s3_bucket,
            's3_key': s3_key,
            'page_count': index_result['page_count'],
            'chunk_count': index_result['chunk_count']
        },
        'retrieval': {
            'method': retrieval_method,
            'chunks_used': len(chunks),
            'keywords': index_result.get('keywords', [])[:5]
        },
        'flashcards': flashcard_result.get('flashcards', []),
        'total_cards': len(flashcard_result.get('flashcards', [])),
        'metrics': {
            'total_time_ms': round(total_time * 1000)
        }
    }
    
    # Save to DynamoDB (bandup-flashcard-sets table)
    dynamodb = boto3.resource('dynamodb')
    table = dynamodb.Table(os.environ.get('DYNAMODB_FLASHCARD_SETS'))
    table.put_item(Item={
        'set_id': request.get('set_id'),
        'user_id': request.get('user_id'),
        'document_id': s3_key,
        'status': 'completed',
        'flashcards': json.dumps(response_body['flashcards']),
        'total_cards': response_body['total_cards'],
        'page_count': index_result['page_count'],
        'chunk_count': index_result['chunk_count'],
        'created_at': int(time.time())
    })
    
    if is_async:
        return {'statusCode': 200, 'body': 'OK'}
    
    return create_response(200, response_body)

Titan Embeddings with Parallel Processing:

class TitanEmbeddings:
    """Amazon Titan Text Embeddings V2 via Bedrock with parallel processing."""
    
    MODEL_ID = "amazon.titan-embed-text-v2:0"
    
    def __init__(self, region: str = None):
        self.region = region or os.environ.get('BEDROCK_REGION', 'us-east-1')
        self._client = None
    
    @property
    def client(self):
        if self._client is None:
            self._client = boto3.client('bedrock-runtime', region_name=self.region)
        return self._client
    
    def embed(self, text: str) -> List[float]:
        """Get embedding for single text using Titan V2."""
        response = self.client.invoke_model(
            modelId=self.MODEL_ID,
            body=json.dumps({
                "inputText": text[:8000],  # Max input length
                "dimensions": 512,
                "normalize": True
            }),
            contentType="application/json",
            accept="application/json"
        )
        result = json.loads(response['body'].read())
        return result['embedding']
    
    def embed_batch_parallel(self, texts: List[str], max_workers: int = 10) -> List[List[float]]:
        """Embed multiple texts in PARALLEL using ThreadPoolExecutor."""
        embeddings = [None] * len(texts)
        
        with ThreadPoolExecutor(max_workers=max_workers) as executor:
            futures = {executor.submit(self.embed, t): i for i, t in enumerate(texts)}
            for future in as_completed(futures):
                idx = futures[future]
                embeddings[idx] = future.result()
        
        return embeddings

RAG Pipeline (In-Memory):

import math
import fitz  # PyMuPDF

class RAG:
    """Lightweight RAG using Titan Embeddings + in-memory cosine similarity."""
    
    def __init__(self, api_key: str, chunk_size: int = 3000, chunk_overlap: int = 300):
        self.api_key = api_key
        self.chunk_size = chunk_size
        self.chunk_overlap = chunk_overlap
        self._chunks = []
        self._embeddings = []
        self._titan = TitanEmbeddings()
        self._keywords = []
    
    def index_document(self, pdf_path: str, document_id: str = None) -> Dict:
        """Index PDF with Titan V2 embeddings (parallel processing)."""
        # Load PDF pages
        pages = []
        with fitz.open(pdf_path) as doc:
            for page_num, page in enumerate(doc):
                text = page.get_text()
                if text.strip():
                    pages.append({'content': text, 'page': page_num + 1})
        
        # Chunk text with overlap
        self._chunks = []
        for page in pages:
            chunks = self._chunk_text(page['content'])
            for chunk in chunks:
                self._chunks.append({
                    'text': chunk,
                    'page': page['page']
                })
        
        # Extract keywords for smart query generation
        all_text = " ".join([c['text'] for c in self._chunks])
        self._keywords = self._extract_keywords(all_text, top_n=20)
        
        # Generate embeddings in parallel (10 concurrent Bedrock calls)
        texts = [c['text'] for c in self._chunks]
        self._embeddings = self._titan.embed_batch_parallel(texts, max_workers=10)
        
        return {
            'page_count': len(pages),
            'chunk_count': len(self._chunks),
            'keywords': self._keywords[:10]
        }
    
    def _cosine_similarity(self, a: List[float], b: List[float]) -> float:
        """Calculate cosine similarity between two vectors."""
        dot_product = sum(x * y for x, y in zip(a, b))
        norm_a = math.sqrt(sum(x * x for x in a))
        norm_b = math.sqrt(sum(x * x for x in b))
        if norm_a == 0 or norm_b == 0:
            return 0.0
        return dot_product / (norm_a * norm_b)
    
    def similarity_search(self, query: str, top_k: int = 5) -> List[Dict]:
        """Search for similar chunks using in-memory cosine similarity."""
        query_embedding = self._titan.embed(query)
        
        # Calculate similarities
        similarities = []
        for i, embedding in enumerate(self._embeddings):
            score = self._cosine_similarity(query_embedding, embedding)
            similarities.append((i, score))
        
        # Sort by similarity (descending) and return top-k
        similarities.sort(key=lambda x: x[1], reverse=True)
        
        results = []
        for rank, (idx, score) in enumerate(similarities[:top_k]):
            chunk = self._chunks[idx]
            results.append({
                'text': chunk['text'],
                'page': chunk['page'],
                'score': score,
                'rank': rank + 1
            })
        return results
    
    def generate_smart_queries(self, num_queries: int = 5) -> List[str]:
        """Generate document-specific queries using extracted keywords."""
        kw = self._keywords
        queries = []
        
        if len(kw) >= 2:
            queries.append(f"definition and explanation of {kw[0]} and {kw[1]}")
        if len(kw) >= 4:
            queries.append(f"key concepts about {kw[2]} {kw[3]}")
        if len(kw) >= 6:
            queries.append(f"important information regarding {kw[4]} {kw[5]}")
        
        return queries[:num_queries]
    
    def retrieve_with_smart_queries(self, top_k_per_query: int = 3) -> List[Dict]:
        """Retrieve chunks using multiple smart queries for better coverage."""
        queries = self.generate_smart_queries()
        seen_texts = set()
        all_results = []
        
        for query in queries:
            results = self.similarity_search(query, top_k=top_k_per_query)
            for r in results:
                if r['text'] not in seen_texts:
                    seen_texts.add(r['text'])
                    all_results.append(r)
        
        return sorted(all_results, key=lambda x: x['score'], reverse=True)
    
    def get_representative_chunks(self, num_chunks: int = 10) -> List[Dict]:
        """Get evenly distributed chunks across document."""
        if len(self._chunks) <= num_chunks:
            return [{'text': c['text'], 'page': c['page'], 'score': 1.0} 
                    for c in self._chunks]
        
        step = len(self._chunks) // num_chunks
        return [{'text': self._chunks[i * step]['text'], 
                 'page': self._chunks[i * step]['page'], 
                 'score': 1.0} 
                for i in range(num_chunks)]

Flashcard Generation Prompt:

def generate_flashcards_prompt(chunks: List[Dict], num_cards: int, difficulty: str, question_types: List[str]) -> str:
    """Build prompt for Gemini flashcard generation."""
    context = "\n\n".join([
        f"[Chunk {i+1}] (Page {c.get('page', '?')}):\n{c['text']}"
        for i, c in enumerate(chunks)
    ])
    
    return f"""Based on the following document excerpts, generate {num_cards} flashcards.

CONTEXT:
{context}

REQUIREMENTS:
- Difficulty: {difficulty}
- Generate exactly {num_cards} flashcards
- Each flashcard should have a clear question and concise answer
- Focus on key concepts, definitions, and important facts
- Use these question types: {", ".join(question_types)}

OUTPUT FORMAT (JSON):
{{
  "flashcards": [
    {{
      "question": "...",
      "answer": "...",
      "type": "DEFINITION",
      "difficulty": "{difficulty}",
      "source_chunk": 1
    }}
  ]
}}

Return ONLY valid JSON."""

def call_gemini(prompt: str, api_key: str) -> Dict:
    """Call Gemini API for flashcard generation."""
    import google.generativeai as genai
    
    genai.configure(api_key=api_key)
    model = genai.GenerativeModel(
        model_name=os.environ.get('GEMINI_MODEL', 'gemini-2.0-flash'),
        generation_config={
            'temperature': 0.3,
            'max_output_tokens': 4096
        }
    )
    
    response = model.generate_content(prompt)
    text = response.text
    
    # Extract JSON if wrapped in markdown
    if '```json' in text:
        text = text.split('```json')[1].split('```')[0]
    
    return json.loads(text.strip())

Lambda Function 4: S3 Upload Handler

Generates presigned URLs for secure file uploads to S3.

Setting	Value
Function name	`bandup-s3-upload`
Runtime	Python 3.11
Memory	256 MB
Timeout	30 seconds
Trigger	API Gateway (sync)

Core Implementation:

import json
import os
import boto3
from datetime import datetime
from typing import Dict, Any

s3_client = boto3.client('s3')

def lambda_handler(event: Dict[str, Any], context: Any) -> Dict:
    """Generate presigned URL for S3 upload."""
    
    request = json.loads(event.get('body', '{}'))
    
    user_id = request.get('user_id')
    filename = request.get('filename')
    content_type = request.get('content_type', 'application/octet-stream')
    upload_type = request.get('upload_type', 'general')
    
    # Determine bucket based on upload type
    bucket_map = {
        'speaking_audio': os.environ.get('S3_BUCKET_AUDIO'),
        'flashcard_pdf': os.environ.get('S3_BUCKET_DOCUMENTS'),
        'writing_essay': os.environ.get('S3_BUCKET_DOCUMENTS'),
    }
    bucket = bucket_map.get(upload_type)
    
    # Generate organized S3 key
    timestamp = datetime.now().strftime('%Y%m%d_%H%M%S')
    key = f"uploads/{upload_type}/{user_id}/{timestamp}_{filename}"
    
    # Generate presigned PUT URL (15 min expiry)
    upload_url = s3_client.generate_presigned_url(
        'put_object',
        Params={'Bucket': bucket, 'Key': key, 'ContentType': content_type},
        ExpiresIn=900
    )
    
    # Generate presigned GET URL (1 hour expiry)
    get_url = s3_client.generate_presigned_url(
        'get_object',
        Params={'Bucket': bucket, 'Key': key},
        ExpiresIn=3600
    )
    
    return {
        'statusCode': 200,
        'headers': {'Content-Type': 'application/json'},
        'body': json.dumps({
            'upload_url': upload_url,
            'get_url': get_url,
            'file_url': f"s3://{bucket}/{key}",
            'expires_in': 900
        })
    }

Secure Secrets Management

All Lambda functions use AWS Secrets Manager to retrieve API keys:

# secrets_helper.py (in Lambda Layer)
import boto3
import os
from functools import lru_cache

@lru_cache(maxsize=1)
def get_gemini_api_key() -> str:
    """Retrieve Gemini API key from Secrets Manager (cached)."""
    client = boto3.client('secretsmanager')
    secret_arn = os.environ.get('GEMINI_API_KEY_SECRET_ARN')
    
    response = client.get_secret_value(SecretId=secret_arn)
    return response['SecretString']

Security Best Practices:

Never hardcode API keys in Lambda code
Use AWS Secrets Manager for all sensitive credentials
Rotate secrets regularly using automatic rotation
Use IAM roles with least-privilege permissions

IAM Role for Lambda Functions

{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Sid": "BedrockAccess",
      "Effect": "Allow",
      "Action": ["bedrock:InvokeModel"],
      "Resource": "arn:aws:bedrock:*:*:foundation-model/amazon.titan-embed-text-v2*"
    },
    {
      "Sid": "DynamoDBAccess",
      "Effect": "Allow",
      "Action": ["dynamodb:PutItem", "dynamodb:GetItem", "dynamodb:Query"],
      "Resource": [
        "arn:aws:dynamodb:*:*:table/bandup-evaluations",
        "arn:aws:dynamodb:*:*:table/bandup-flashcard-sets"
      ]
    },
    {
      "Sid": "S3Access",
      "Effect": "Allow",
      "Action": ["s3:GetObject", "s3:PutObject"],
      "Resource": "arn:aws:s3:::bandup-*/*"
    },
    {
      "Sid": "SQSAccess",
      "Effect": "Allow",
      "Action": ["sqs:ReceiveMessage", "sqs:DeleteMessage"],
      "Resource": "arn:aws:sqs:*:*:bandup-*-queue"
    },
    {
      "Sid": "SecretsAccess",
      "Effect": "Allow",
      "Action": ["secretsmanager:GetSecretValue"],
      "Resource": "arn:aws:secretsmanager:*:*:secret:bandup/*"
    },
    {
      "Sid": "CloudWatchLogs",
      "Effect": "Allow",
      "Action": ["logs:CreateLogStream", "logs:PutLogEvents"],
      "Resource": "arn:aws:logs:*:*:log-group:/aws/lambda/bandup-*"
    }
  ]
}

DynamoDB Tables

Lambda functions store results in two DynamoDB tables:

Table	Used By	Purpose
`bandup-evaluations`	Writing + Speaking Evaluators	Stores IELTS band scores, feedback, transcripts
`bandup-flashcard-sets`	Flashcard Generator	Stores generated flashcards and document metadata

Evaluations Table Schema (Writing & Speaking):

# Used by Writing Evaluator
table.put_item(Item={
    'evaluation_id': session_id,      # Partition Key
    'user_id': user_id,               # Sort Key
    'evaluation_type': 'writing',     # 'writing' or 'speaking'
    'status': 'completed',
    'overall_band': '7.0',
    'task_achievement_band': '7.0',   # Writing only
    'fluency_band': '6.5',            # Speaking only
    'pronunciation_band': '7.0',      # Speaking only
    'transcript': '...',              # Speaking only
    'feedback': json.dumps(feedback),
    'created_at': timestamp
})

Flashcard Sets Table Schema:

# Used by Flashcard Generator
table.put_item(Item={
    'set_id': set_id,                 # Partition Key
    'user_id': user_id,               # Sort Key
    'document_id': document_id,
    'status': 'completed',
    'flashcards': json.dumps(flashcards),
    'total_cards': 10,
    'page_count': 5,
    'chunk_count': 12,
    'created_at': timestamp
})

Environment Variables

Variable	Description	Example
`GEMINI_API_KEY_SECRET_ARN`	Secrets Manager ARN	`arn:aws:secretsmanager:...:secret:bandup/gemini-api-key`
`DYNAMODB_EVALUATIONS`	Evaluations table (Writing + Speaking)	`bandup-evaluations`
`DYNAMODB_FLASHCARD_SETS`	Flashcard sets table	`bandup-flashcard-sets`
`S3_BUCKET_AUDIO`	Audio bucket	`bandup-audio-bucket`
`S3_BUCKET_DOCUMENTS`	Documents bucket	`bandup-documents-bucket`
`BEDROCK_REGION`	Bedrock region for Titan	`us-east-1`
`RAG_CHUNK_SIZE`	Chunk size for RAG	`3000`
`RAG_CHUNK_OVERLAP`	Chunk overlap	`300`
`GEMINI_MODEL`	Gemini model name	`gemini-2.0-flash`

Deploy Lambda Functions


cd rag_flashcard
pip install -r requirements.txt -t package/
cp lambda_handler.py rag_pipeline.py package/
cd package && zip -r ../function.zip . && cd ..

# Create Lambda function
aws lambda create-function \
    --function-name bandup-flashcard-generator \
    --runtime python3.11 \
    --handler lambda_handler.lambda_handler \
    --role arn:aws:iam::${AWS_ACCOUNT_ID}:role/bandup-lambda-role \
    --timeout 600 \
    --memory-size 1024 \
    --zip-file fileb://function.zip \
    --environment Variables="{
        GEMINI_API_KEY_SECRET_ARN=arn:aws:secretsmanager:${AWS_REGION}:${AWS_ACCOUNT_ID}:secret:bandup/gemini-api-key,
        BEDROCK_REGION=us-east-1,
        RAG_CHUNK_SIZE=3000
    }"

# Add SQS trigger
aws lambda create-event-source-mapping \
    --function-name bandup-flashcard-generator \
    --event-source-arn arn:aws:sqs:${AWS_REGION}:${AWS_ACCOUNT_ID}:bandup-flashcard-queue \
    --batch-size 1

Next Steps

Proceed to DynamoDB to configure the database tables.