Prerequisites

Before setting up the DataSuite ETL system, ensure your development environment meets the following requirements.

System Requirements

Hardware Requirements

  • RAM: 8GB minimum, 16GB recommended for full stack

  • Storage: 20GB free disk space for containers and data

  • CPU: 4+ cores recommended for optimal performance

  • Network: Stable internet connection for downloading container images

Operating System Support

  • Linux: Ubuntu 20.04+, CentOS 8+, RHEL 8+ (recommended)

  • macOS: macOS 10.15+ with Intel or Apple Silicon

  • Windows: Windows 10/11 with WSL2 enabled

Required Software

Docker Platform

Docker Engine: Version 20.10 or later

Docker Compose: Version 2.0 or later

Installation Resources:

Git Version Control

Git: Version 2.30 or later

Network Configuration

Port Requirements

The following ports must be available on your system:

Service
Port
Protocol
Purpose

MySQL

3306

TCP

Database connections

ClickHouse HTTP

8123

TCP

Query interface

ClickHouse Native

9000

TCP

Native protocol

LogStash

5044

TCP

Data ingestion

Airflow Web UI

8080

TCP

Workflow management

Port Conflict Check:

Firewall Configuration

  • Allow inbound connections on required ports for local development

  • Configure Docker daemon to access external registries

  • Ensure Docker containers can communicate with each other

Code Editor

  • VS Code with SQL and Docker extensions

  • IntelliJ IDEA with database plugins

  • Vim/Emacs with SQL syntax highlighting

Database Clients

  • DBeaver - Universal database client

  • MySQL Workbench - MySQL-specific client

  • ClickHouse Client - Native ClickHouse CLI

Terminal/Shell

  • Bash or Zsh with command completion

  • Windows PowerShell or WSL2 on Windows

Validation Checklist

Run through this checklist to ensure your environment is ready:

✅ Docker Validation

✅ Docker Compose Validation

✅ Resource Validation

✅ Network Validation

Common Issues and Solutions

Issue: Docker Permission Denied

Symptoms: permission denied while trying to connect to the Docker daemon socket

Solutions:

Issue: Port Already in Use

Symptoms: port is already allocated errors during startup

Solutions:

Issue: Insufficient Memory

Symptoms: Containers exit with OOMKilled status

Solutions:

  • Increase Docker Desktop memory allocation (Mac/Windows)

  • Close unnecessary applications

  • Reduce number of concurrent containers during development

Issue: WSL2 Configuration (Windows)

Symptoms: Docker commands not found or slow performance

Solutions:

Performance Optimization

Docker Configuration

System Optimization

Next Steps

Once your prerequisites are met:

  1. Quick Start Setup - Get running fast with Docker Compose

  2. Advanced Setup - Custom configuration with individual containers

  3. Service Verification - Confirm everything is working correctly

Getting Help

If you encounter issues during prerequisite setup:

  • System Issues: Consult your operating system documentation

  • Network Issues: Check firewall and antivirus settings

  • Hardware Issues: Verify system meets minimum requirements

The setup process becomes much smoother when prerequisites are properly configured, so take time to validate each requirement before proceeding.

Last updated