Sebastián Sifontes

Future Blog Post

2199-01-01T00:00:00+01:00

This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.

Safe Alembic Migrations in Production: A Practical Guide

2026-05-14T00:00:00+02:00

Running database migrations in production is one of those tasks that’s routine until it isn’t. Here’s the approach I use with Alembic to keep migrations safe and reversible.

The Golden Rules

1. Never edit a migration that has already been applied to production.

If a migration is live, create a new one to fix it. Editing applied migrations breaks the revision chain and causes alembic upgrade head to fail or apply unexpected changes.

2. Always test the downgrade.

Write downgrade() properly, not just pass. You’ll need it the day a deploy goes wrong:

def downgrade():
    op.drop_column('users', 'new_column')  # explicit
    # NOT just: pass

3. Check current revision before deploying.

alembic current
alembic history --verbose

Always verify the database is at the expected revision before running upgrade head in CI/CD.

The Pattern I Use in Docker

In ViraClip, migrations run as a one-shot step before the API starts:

# docker-compose.yaml
backend:
  command: >
    sh -c "alembic upgrade head &&
           uvicorn src.main:app --host 0.0.0.0 --port 8000"
  depends_on:
    postgres:
      condition: service_healthy

This ensures the schema is always up to date when the API starts, and the service_healthy condition prevents migrations from running before PostgreSQL is ready.

Dangerous Operations to Avoid

Operation	Why dangerous	Safe alternative
`DROP COLUMN` immediately	Old code still references it	Deprecate first, drop in next deploy
Rename column in one step	Breaks existing queries	Add new column, migrate data, drop old
Add NOT NULL without default	Fails on existing rows	Add with default, then remove default

Zero-Downtime Strategy

For tables with millions of rows, use op.execute() with ALTER TABLE ... ADD COLUMN and a DEFAULT to avoid locking:

def upgrade():
    op.execute(
        "ALTER TABLE videos ADD COLUMN processed BOOLEAN DEFAULT FALSE NOT NULL"
    )

This is faster than the ORM-generated equivalent and avoids table locks on PostgreSQL 11+.

Building ViraClip: My Journey Creating an AI Video Processing Platform

2026-05-14T00:00:00+02:00

For the past several months, I’ve been building ViraClip — an AI-powered platform that automates video creation and processing for social media content.

The Problem

Content creators and marketers spend hours manually editing videos, adding subtitles, resizing for different platforms (TikTok, Instagram Reels, YouTube Shorts), and generating engaging clips. I wanted to automate all of this.

The Stack

ViraClip is built with a multi-service architecture:

Backend: FastAPI (Python) for the core API
Video Processing: FFmpeg with a custom pool manager for concurrent processing
AI Integration: Groq API, OpenAI, and custom ComfyUI workflows
Queue: Redis for job management and real-time progress tracking
Database: PostgreSQL with Alembic migrations
Orchestration: Docker Compose with dedicated services for each component
Rust Agent: A self-healing watchdog agent that monitors and restores services

Key Challenges

FFmpeg Concurrency

Running multiple FFmpeg processes simultaneously is tricky. Too many concurrent jobs crash the server; too few and the queue backs up. I built a custom ffmpeg_pool.py that manages worker slots dynamically based on CPU and memory availability.

ComfyUI Integration

Integrating ComfyUI for AI video generation required building a custom workflow executor that translates API requests into ComfyUI node graphs, monitors progress via WebSocket, and handles GPU memory gracefully.

Self-Healing Architecture

The Rust agent (rust-agent/) monitors all services and automatically restarts failed containers, notifies via webhook, and runs diagnostics — all without human intervention.

What’s Next

LTX video model integration for high-quality AI video generation
Multi-tenant support for agency clients
Automated content pipelines with n8n integration

If you’re building something similar or want to collaborate, feel free to reach out via GitHub.

Docker Multi-Service Architecture: Lessons from Running 6 Containers in Production

2026-05-14T00:00:00+02:00

Running a production system with 6+ Docker containers taught me more about resource management and failure modes than any tutorial ever could. Here’s what I learned building ViraClip’s infrastructure.

The Setup

ViraClip runs as a Docker Compose stack with these services:

services:
  backend:    # FastAPI API server
  worker:     # FFmpeg processing workers
  comfyui:    # AI video generation
  redis:      # Job queue + progress tracking
  postgres:   # Primary database
  rust-agent: # Self-healing watchdog

Lesson 1: Resource Limits Are Not Optional

Without mem_limit and cpus constraints, ComfyUI will happily consume all available RAM when generating video, killing every other container. Always set explicit limits:

comfyui:
  mem_limit: 8g
  cpus: '4'

Lesson 2: Health Checks Save Your Sanity

Docker’s depends_on only waits for a container to start, not to be ready. A PostgreSQL container is “running” 3 seconds before it accepts connections. Add proper health checks:

postgres:
  healthcheck:
    test: ["CMD-SHELL", "pg_isready -U $POSTGRES_USER"]
    interval: 5s
    timeout: 5s
    retries: 10

Lesson 3: The Rust Watchdog Pattern

Instead of relying solely on Docker’s restart policies, I built a Rust agent that:

Polls health endpoints every 30s
Runs diagnostic commands on failure
Attempts a graceful restart before forcing one
Sends a webhook notification with full context

This gives much richer failure information than a bare restart: always.

Lesson 4: Redis as the Source of Truth for Job State

When a worker crashes mid-job, you need to know exactly where it failed. Storing granular progress in Redis (not just “running/done”) lets you resume or retry intelligently:

redis.hset(f"job:{job_id}", mapping={
    "status": "processing",
    "step": "ffmpeg_encode",
    "progress": 67,
    "started_at": timestamp
})

Lesson 5: Shared Volumes Need Clear Ownership

When multiple containers write to the same volume (e.g., the output/ directory), file permission conflicts are inevitable. Use a single writer pattern — one container writes, others read via API.

Building a robust multi-service system is mostly about designing for failure gracefully. Each of these lessons came from a real production outage — the best kind of teacher.

FastAPI Background Tasks vs Celery: When to Use Each

2026-05-14T00:00:00+02:00

FastAPI ships with a built-in BackgroundTasks system. It’s tempting to use it for everything async — but there’s a clear line where you should switch to a proper task queue like Celery or Redis workers.

FastAPI BackgroundTasks: What It Is

BackgroundTasks runs a function in the same process after the HTTP response is sent. It shares memory with the web server and runs in the same event loop.

@app.post("/send-email")
async def send_email(background_tasks: BackgroundTasks, email: str):
    background_tasks.add_task(send_notification, email)
    return {"status": "queued"}

When BackgroundTasks Is Fine

Sending a welcome email after registration
Logging an event to a database
Invalidating a cache entry
Sending a webhook notification

Anything that’s fast, stateless, and non-critical. If it fails, the user doesn’t need to know.

When You Need a Real Queue

Scenario	Use
Task takes > 5 seconds	Redis/Celery worker
Task uses lots of CPU/RAM	Separate worker process
Task needs retry logic	Celery with exponential backoff
User needs progress tracking	Redis + WebSocket
Task must survive server restart	Persistent queue

For ViraClip, video rendering takes 30–300 seconds and uses all available CPU. Running that inside FastAPI would block every other request. It lives in a dedicated worker container.

The Pattern I Use

HTTP request comes in → validate input → create job record in PostgreSQL
Push job ID to Redis queue
Return 202 Accepted with job ID immediately
Separate worker container picks up job, processes it, updates Redis state
Frontend polls /jobs/{id}/status or subscribes to WebSocket

This keeps your API server snappy and your workers scalable independently.

GitHub Actions CI/CD for Python Projects: My Minimal Setup

2026-05-14T00:00:00+02:00

After setting up CI/CD for ViraClip and agente_seguros_ai, I’ve distilled my GitHub Actions setup to the smallest config that still gives me real value.

What I Actually Need from CI

For a solo developer on a SaaS product, CI doesn’t need to be complex. My requirements:

Run tests on every push to main and on PRs
Build and push Docker image on merge to main
Notify me if anything breaks

That’s it. No staging environments, no approval gates — just fast feedback.

The Workflow

# .github/workflows/ci.yml
name: CI

on:
  push:
    branches: [main]
  pull_request:
    branches: [main]

jobs:
  test:
    runs-on: ubuntu-latest
    services:
      postgres:
        image: postgres:15
        env:
          POSTGRES_PASSWORD: testpass
          POSTGRES_DB: testdb
        options: >-
          --health-cmd pg_isready
          --health-interval 5s
          --health-timeout 5s
          --health-retries 10
      redis:
        image: redis:7
        options: >-
          --health-cmd "redis-cli ping"
          --health-interval 5s

    steps:
      - uses: actions/checkout@v4

      - name: Set up Python
        uses: actions/setup-python@v5
        with:
          python-version: '3.11'
          cache: 'pip'

      - name: Install dependencies
        run: pip install -r requirements.txt

      - name: Run Alembic migrations
        run: alembic upgrade head
        env:
          DATABASE_URL: postgresql://postgres:testpass@localhost/testdb

      - name: Run tests
        run: pytest tests/ -v --tb=short
        env:
          DATABASE_URL: postgresql://postgres:testpass@localhost/testdb
          REDIS_URL: redis://localhost:6379

Key Decisions

Use services: for dependencies — GitHub Actions can spin up Postgres and Redis as sidecar containers. No mocking, no SQLite workarounds — your tests run against the real thing.

Cache pip dependencies — cache: 'pip' in setup-python cuts install time from ~45s to ~5s on repeat runs.

Run migrations in CI — This catches broken migrations before they hit production. If alembic upgrade head fails, the build fails.

Docker Build on Merge

  build:
    needs: test
    if: github.ref == 'refs/heads/main'
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4

      - name: Build and push Docker image
        uses: docker/build-push-action@v5
        with:
          push: true
          tags: ghcr.io/$:latest

This only runs after tests pass and only on main — safe and simple.

Redis as a Job Queue: Patterns I Use in Production

2026-05-14T00:00:00+02:00

After running Redis as a job queue in ViraClip for months, I’ve settled on a set of patterns that make async video processing reliable and observable.

Why Redis Over a Dedicated Queue

Tools like Celery + RabbitMQ are powerful but heavy. For a single-product SaaS with predictable load, Redis gives you 90% of what you need with far less operational overhead. The key is being disciplined about your data structures.

Pattern 1: Hash-Based Job State

Don’t store job state as a JSON blob in a string key. Use a Hash so you can update individual fields atomically:

# Set initial state
redis.hset(f"job:{job_id}", mapping={
    "status": "queued",
    "created_at": time.time(),
    "user_id": user_id,
    "type": "video_render"
})

# Update just one field mid-processing
redis.hset(f"job:{job_id}", "status", "processing")
redis.hset(f"job:{job_id}", "progress", 42)

Pattern 2: Sorted Sets for Priority Queues

A plain LPUSH/RPOP list queue doesn’t support priority. Sorted sets do:

# Enqueue with priority score (lower = higher priority)
redis.zadd("job_queue", {job_id: priority_score})

# Dequeue the highest priority job
job_id = redis.zpopmin("job_queue", count=1)

Pattern 3: Pub/Sub for Real-Time Progress

For live progress bars in the UI, I publish updates to a channel and the frontend subscribes via WebSocket:

# Worker publishes progress
redis.publish(f"job:progress:{job_id}", json.dumps({
    "progress": 67,
    "step": "encoding",
    "eta_seconds": 12
}))

Pattern 4: TTL on Everything

Always set a TTL on job keys. Completed jobs accumulate fast and will eventually fill your Redis memory:

# After job completes, keep state for 24h for UI polling
redis.expire(f"job:{job_id}", 86400)

Pattern 5: Dead Letter Queue

Jobs that fail repeatedly go to a dead letter list so they don’t block the main queue:

if job_attempts >= MAX_RETRIES:
    redis.lpush("job_queue:dead", job_id)
    redis.hset(f"job:{job_id}", "status", "failed")

These five patterns together give you a production-grade async system with zero external dependencies beyond Redis itself.

Building a WhatsApp Lead Qualification Bot with n8n and LLMs

2026-05-14T00:00:00+02:00

For the agente_seguros_ai project, I built a WhatsApp bot that qualifies insurance leads automatically — 24/7, no human needed unless the lead is hot. Here’s how the architecture works.

Why n8n

n8n is a self-hostable workflow automation tool (think Zapier, but open source and deployable on your own VPS). It handles the WhatsApp webhook, conversation state, and routing logic without writing a single line of code for those parts.

I only write code for the parts that need it: the LLM prompt, the database writes, and the lead scoring.

The Flow

WhatsApp message
    ↓
n8n webhook trigger
    ↓
Load conversation history from PostgreSQL
    ↓
Build prompt with context + user message
    ↓
Groq API (Llama 3.1) generates response
    ↓
Parse structured data (name, phone, interest)
    ↓
Update PostgreSQL + score lead
    ↓
Send reply via WhatsApp Business API
    ↓
If hot lead → notify agent via Telegram

The Prompt Strategy

The system prompt has two modes:

Qualification mode: The bot asks 4–5 questions naturally (not as a form) to gather: full name, type of insurance needed, current coverage, budget range.
Handoff mode: Once qualified, the bot says a human advisor will contact them and stops generating AI responses.

The trick is injecting the conversation history as context so the model doesn’t repeat questions already answered.

Handling State in PostgreSQL

Each conversation has a state field: qualifying, qualified, handed_off, cold. n8n checks this before deciding whether to call the LLM or just send a static message.

Lessons Learned

Groq is fast enough for real-time chat — typical latency is 400–800ms including the WhatsApp API round-trip.
Structured output matters — ask the LLM to return JSON for extracted data, not free text.
Test edge cases manually — users who answer in voice notes, send images, or write in regional dialect.
n8n’s error handling is weak — wrap your custom function nodes in try/catch and log to a separate table.

Blog Post number 4

2015-08-14T00:00:00+02:00

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Headings are cool

You can have many headings

Aren’t headings cool?

Blog Post number 3

2014-08-14T00:00:00+02:00

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Headings are cool

You can have many headings