Workshop 9: Local Deployment and Observability

Learning Objectives

By the end of this workshop, you will be able to:

Deploy ICN locally - Run ICN via native binary or Docker
Verify health status - Check component health via API endpoints
Monitor with Prometheus - Scrape and query ICN metrics
Configure logging - Adjust log levels and understand log output
Troubleshoot issues - Diagnose common deployment problems

Goal

Deploy ICN locally using Docker or native binaries, verify health endpoints, explore Prometheus metrics, and understand logging and tracing.

Prerequisites

Completed Module 9: Operations
Docker installed (for container deployment) or Rust toolchain (for native)
curl and jq installed

Estimated time

2-3 hours

Related Materials

Module 9: Operations - Background reading
HOMELAB_DEPLOYMENT.md - K3s production deployment
production-hardening.md - Security hardening

Part 1: Deployment Options

Overview

ICN can be deployed in several ways:

Method	Use Case
Native binary	Development, single-node testing
Docker	Local testing, reproducible environments
Docker Compose	Multi-node local clusters
Kubernetes/K3s	Production deployment

Locate deployment files

ls -la deploy/
ls -la deploy/docker/ 2>/dev/null
ls -la deploy/k8s/ 2>/dev/null

Checkpoint

You identified available deployment methods
You know which files configure each method

Part 2: Native Binary Deployment

Steps

Build release binaries:
```
cd icn && cargo build --release
```

Create a data directory:

export ICN_DATA=$(mktemp -d)
export ICN_PASSPHRASE="workshop"

Initialize identity:

./target/release/icnctl --data-dir "$ICN_DATA" id init

Start the daemon:

./target/release/icnd --data-dir "$ICN_DATA"

Configuration file

Create $ICN_DATA/config.toml:

data_dir = "$ICN_DATA"

[network]
listen_addr = "0.0.0.0:4001"
mdns_enabled = true

[gateway]
enabled = true
bind_addr = "0.0.0.0:8080"

[observability]
metrics_port = 9100
health_port = 8081
log_level = "info"

Questions to answer

What ports are used by default?
How do you change the gateway port?
Where are logs written?

Checkpoint

Daemon starts successfully
You can access the gateway

Part 3: Docker Deployment

Steps

Build the Docker image:

docker build -t icn:local -f deploy/docker/Dockerfile .

Run the container:
```
docker run -d \
  --name icn-node \
  -p 8080:8080 \
  -p 9100:9100 \
  -p 8081:8081 \
  -p 4001:4001 \
  -e ICN_KEYSTORE_PASSPHRASE=workshop \
  icn:local
```
Security Note: The example above passes the passphrase via environment variable for simplicity. In production, use Docker secrets or mount a credentials file to avoid exposing passphrases in process listings or container inspect output.
Check logs:
```
docker logs -f icn-node
```

Docker Compose (multi-node)

If available:

cd deploy/docker
docker-compose up -d
docker-compose logs -f

Questions to answer

How are secrets passed to the container?
What volumes are mounted?
How do containers discover each other?

Checkpoint

Docker container running
Gateway accessible from host

Part 4: Health Endpoint

Steps

Check the basic health endpoint:

curl -s http://localhost:8080/v1/health | jq

Basic health response:

{
  "status": "ok",
  "version": "0.1.0"
}

For detailed component checks, use /v1/health/detailed:

curl -s http://localhost:8080/v1/health/detailed | jq

Detailed health response:

{
  "status": "healthy",
  "version": "0.1.0",
  "checks": {
    "cooperative_manager": {"status": "ok", "details": "..."},
    "notification_queue": {"status": "ok", "details": "..."},
    "system": {"status": "ok", "details": "..."}
  }
}

Health check details

Find the health endpoint implementation:

grep -r "health" icn/crates/icn-gateway/src/ --include="*.rs" | head -10

Questions to answer

What components are checked in /v1/health/detailed?
What's the difference between /v1/health and /v1/health/detailed?
How often should health be polled?

Checkpoint

Basic health endpoint returns 200
Detailed health shows component status

Part 5: Prometheus Metrics

Steps

Access the metrics endpoint:

curl -s http://localhost:9100/metrics | head -50

Identify key metric types:
- Counters (total counts)
- Gauges (current values)
- Histograms (distributions)

Key metrics to find

Metric	Type	Description
`icn_gateway_requests_total`	Counter	Total HTTP requests
`icn_gossip_messages_received_total`	Counter	Gossip messages received
`icn_ledger_entries_total`	Counter	Total ledger entries
`icn_trust_edges_total`	Gauge	Current trust edges
`icn_network_peers_connected`	Gauge	Connected peers

Exercise

Search for metric definitions:

grep -r "counter!\|gauge!\|histogram!" icn/crates/icn-obs/src/ --include="*.rs" | head -20

Questions to answer

What naming convention do metrics follow?
How are labels used?
What is the scrape interval for Prometheus?

Checkpoint

Metrics endpoint accessible
You can identify key metrics

Part 6: Prometheus + Grafana Setup (Optional)

Steps

Create prometheus.yml:

global:
  scrape_interval: 15s

scrape_configs:
  - job_name: 'icn'
    static_configs:
      - targets: ['localhost:9100']

Run Prometheus:

docker run -d \
  --name prometheus \
  -p 9090:9090 \
  -v $(pwd)/prometheus.yml:/etc/prometheus/prometheus.yml \
  prom/prometheus

Access Prometheus UI: http://localhost:9090

Run Grafana (optional):

docker run -d \
  --name grafana \
  -p 3000:3000 \
  grafana/grafana

Access Grafana: http://localhost:3000 (admin/admin)

PromQL queries to try

# Request rate over 5 minutes
rate(icn_gateway_requests_total[5m])

# Current connected peers
icn_network_peers_connected

# Gossip message rate by type
rate(icn_gossip_messages_received_total[5m])

Checkpoint

Prometheus scraping ICN metrics
You can query metrics in PromQL

Part 7: Logging Configuration

Steps

Find logging configuration:

grep -r "tracing\|logging" icn/crates/icn-obs/src/ --include="*.rs" | head -15

Test different log levels:

RUST_LOG=debug ./target/release/icnd --data-dir "$ICN_DATA"
RUST_LOG=icn_gossip=trace ./target/release/icnd --data-dir "$ICN_DATA"

Log levels

Level	Use Case
error	Critical failures
warn	Recoverable issues
info	Normal operations
debug	Development debugging
trace	Deep inspection

Log format (JSON)

{
  "timestamp": "2024-01-01T12:00:00Z",
  "level": "INFO",
  "target": "icn_gossip::actor",
  "message": "Subscribed to topic",
  "topic": "ledger:entries"
}

Questions to answer

How do you filter logs by module?
What format is used for timestamps?
How are structured fields included?

Checkpoint

You can adjust log levels
You understand log format

Part 8: Distributed Tracing (Optional)

Steps

Find tracing configuration in icn-obs
Look for OpenTelemetry integration
Understand span structure

Tracing concepts

Request arrives at gateway
  └── Span: "gateway.handle_request"
        └── Span: "ledger.create_entry"
              └── Span: "store.put"
        └── Span: "gossip.announce"
              └── Span: "network.broadcast"

Jaeger setup (optional)

docker run -d \
  --name jaeger \
  -p 16686:16686 \
  -p 6831:6831/udp \
  jaegertracing/all-in-one

Questions to answer

What is a trace vs a span?
How are trace IDs propagated?
What exporter is configured?

Checkpoint

You understand tracing structure
You know how to enable tracing

Part 9: Troubleshooting Checklist

Common issues and diagnostics

Daemon won't start:

# Check port availability
lsof -i :8080
lsof -i :4001

# Check keystore
./target/release/icnctl --data-dir "$ICN_DATA" id show

Gateway not responding:

# Check health
curl -v http://localhost:8080/v1/health

# Check logs
RUST_LOG=debug ./target/release/icnd --data-dir "$ICN_DATA" 2>&1 | head -50

Metrics not available:

# Check metrics endpoint
curl -v http://localhost:9100/metrics

# Verify metrics are enabled in config

Gossip not working:

# Check network connectivity
./target/release/icnctl --data-dir "$ICN_DATA" status

# Check for mDNS
RUST_LOG=icn_net=debug ./target/release/icnd --data-dir "$ICN_DATA"

Checkpoint

You can diagnose common issues
You know which logs to check

Part 10: Production Considerations

Security checklist

TLS certificates configured
Keystore passphrase secured
Rate limiting enabled
CORS origins restricted

Monitoring checklist

Prometheus scraping metrics
Alerting configured for critical metrics
Log aggregation in place
Dashboards created for key metrics

Backup checklist

Keystore backed up securely
Sled database backup strategy
Configuration versioned

Summary

After completing this workshop you should be able to:

Deploy ICN using native binaries or Docker
Verify health and component status
Access and understand Prometheus metrics
Configure and filter logging
Troubleshoot common operational issues

Key Takeaways

Concept	Key Point
Health Endpoints	`/v1/health` (basic), `/v1/health/detailed` (component status)
Metrics Port	9100 by default; Prometheus format
Gateway Port	8080 by default; REST + WebSocket
Network Port	4001 by default; QUIC/UDP
Log Levels	error < warn < info < debug < trace
Log Filtering	`RUST_LOG=icn_gossip=debug,icn_ledger=trace`

Try It Yourself

Challenge 1: Create a Grafana dashboard

Set up Prometheus + Grafana (as shown in Part 6)
Create a dashboard with:
- Connected peers gauge
- Request rate graph
- Gossip message throughput
- Ledger entry count

Challenge 2: Alert configuration Create a Prometheus alerting rule for:

Health check failure
No connected peers for 5 minutes
Gossip message rate drops to zero

Challenge 3: Log analysis

Run with RUST_LOG=debug for 10 minutes
Analyze the logs to answer:
- How many gossip messages were processed?
- Were there any warnings or errors?
- What was the most chatty component?

Challenge 4: Multi-node deployment Use Docker Compose to deploy 3 ICN nodes that form a network:

Nodes discover each other via mDNS
Subscribe to a shared topic
Verify message propagation across all nodes

Cleanup

# Stop containers
docker stop icn-node prometheus grafana jaeger 2>/dev/null
docker rm icn-node prometheus grafana jaeger 2>/dev/null

# Remove temp data
rm -rf "$ICN_DATA"

Troubleshooting

Port already in use

Stop any existing processes using the ports or change the configuration.

Docker image build fails

Ensure you're in the repository root and Docker daemon is running.

Metrics not appearing in Prometheus

Check that the scrape target is accessible and the job is configured correctly.

Next steps

Proceed to Workshop 10: Contributor Workflow