IngestIQ

Requirements

System requirements for self-hosting IngestIQ

System Requirements#

Minimum Requirements#

ResourceMinimumRecommended
CPU2 cores4+ cores
RAM4 GB8+ GB
Storage20 GB100+ GB
OSLinux (x86_64)Ubuntu 22.04+, Debian 12+

For Production#

ResourceRecommended
CPU8+ cores
RAM16+ GB
Storage500+ GB SSD
Network1 Gbps

Software Requirements#

Required#

Docker

Docker 20.10+ and Docker Compose v2

Node.js

Node.js 18+ (for local development)

Optional#

  • PostgreSQL client - Direct database access
  • Redis CLI - Queue monitoring
  • nats-cli - Event debugging

API Keys Required#

You'll need API keys from:

ServicePurposeGet Key
OpenAIEmbeddingsplatform.openai.com
Google AIDocument parsingaistudio.google.com

Optional API Keys#

ServicePurpose
Google OAuthGoogle Drive connector
FirecrawlWeb scraping (optional)

Network Requirements#

Ports#

PortServiceRequired
3000API ServerYes
5432PostgreSQLInternal
5433Vector DBInternal
6379RedisInternal
4222NATSInternal
9000MinIOInternal
3001GotenbergInternal

External Access#

  • Outbound: Access to OpenAI API, Google AI API
  • Inbound: Port 3000 for API access

Storage Requirements#

Database Storage#

ComponentSize Estimate
Management DB1-5 GB
Vector DB10-100+ GB (depends on document count)

Object Storage#

Content TypeSize Estimate
Uploaded documentsVaries
Processed files~2x original size

Vector storage grows with document count. Plan for approximately 10KB per document chunk.

Docker Compose Services#

IngestIQ's docker-compose.yml runs:

ServiceImagePurpose
vectordbpostgres:16 + pgvectorVector storage
redisredis:7-alpineJob queue
natsnats:2.10Event messaging
miniominio/minioS3 storage
gotenberggotenberg:8Document conversion

Quick Check#

Verify your system is ready:

# Check Docker
docker --version
docker compose version

# Check Node.js
node --version  # Should be 18+

# Check available memory
free -h

# Check disk space
df -h

Next Steps#

Documentation