Project Overview

sps2 is a modern package manager written in Rust, designed exclusively for macOS ARM64 (Apple Silicon). Built from the ground up with atomic operations, rollback capabilities, and a clean async architecture. This is a completely new package manager with its own package format, build system, and distribution infrastructure.

Installation

Prerequisites

macOS with Apple Silicon (M1/M2/M3)
Rust 1.86.0 or later
SQLite 3.x
sudo access for /opt/pm directory

Setup

# Clone the repository
git clone https://github.com/yourusername/sps2.git
cd sps2

# Build the project
cargo build --release

# Run setup script (requires sudo)
sudo ./setup.sh

# Add to PATH in your shell config
echo 'export PATH="/opt/pm/live/bin:$PATH"' >> ~/.zshrc
source ~/.zshrc

# Verify installation
sps2 --version

SQLx Setup (for development)

The state crate uses SQLx compile-time checked queries. For development:

# Install sqlx-cli
cargo install sqlx-cli --no-default-features --features sqlite

# Set database URL
export DATABASE_URL="sqlite:///opt/pm/state.sqlite"

# Prepare offline queries (run in crates/state/)
cd crates/state
cargo sqlx prepare

General Development Rules

Rust Standards

Edition: Rust 2021
Resolver: Version 3 (set in workspace Cargo.toml)
MSRV: 1.86.0 (latest stable)
Target: aarch64-apple-darwin only

Code Quality Requirements

All code must pass cargo fmt - No exceptions
All code must pass cargo clippy - With pedantic lints enabled
No warnings allowed - Use #![deny(warnings)] in lib.rs or enforce via CI with cargo clippy -- -D warnings
Deny unsafe code by default - Use #![deny(clippy::pedantic, unsafe_code)] in every lib.rs
Unsafe code requires justification - If needed, use #[allow(unsafe_code)] with detailed safety comment

Best Practices

Prefer &str over String for function parameters
Use Arc<str> instead of String for shared immutable strings
Return Result<T, Error> for all fallible operations
Use thiserror for error types, not manual implementations
Prefer iterators over manual loops
Use tokio for all async operations
Leverage RAII - resources should clean themselves up
Version constraints should be parsed into structured types, not passed as strings internally

Cross-Cutting Conventions

Aspect	Decision	Justification
Async runtime	`tokio` everywhere	Shared reactor, zero thread explosion
Database	`sqlx` for SQLite	Async-first, compile-time checked queries
HTTP client	`reqwest` with tokio	Async HTTP with connection pooling
Error model	`thiserror` per crate + fine-grained types in `errors` crate	Type-safe error handling, Clone when possible
Version specs	Python-style constraints (`==`, `>=`, `~=`, etc.)	Flexible and familiar syntax for developers
Version parsing	`semver` crate with custom constraint parser	Battle-tested semver implementation
Logging	Events only - NO `info!`, `warn!`, `error!`	All output via event channel; JSON logs from subscriber
Progress	Broadcast `Event` enum via channels	Decouples core from UI details
SBOM	SPDX 3.0 JSON (primary), CycloneDX 1.6 (optional)	Built into every package via Syft
Crypto	Minisign signatures; BLAKE3 for content hashing	Small trust root + fast hashing
Linting	`#![deny(clippy::pedantic, unsafe_code)]`	Forces deliberate unsafe usage
CI	`cargo deny` plus `cargo audit`	Catches transitive vulnerabilities

Note: We avoid sys-info due to GPL-2.0 license. Load average detection uses num_cpus only.

Project Structure

.
├── apps
│   └── sps2
│       ├── Cargo.toml
│       └── src
│           ├── cli.rs
│           ├── display.rs
│           ├── error.rs
│           ├── events.rs
│           ├── main.rs
│           └── setup.rs
├── Cargo.lock
├── Cargo.toml
├── crates
│   ├── audit
│   │   ├── Cargo.toml
│   │   └── src
│   │       ├── lib.rs
│   │       ├── sbom_parser.rs
│   │       ├── scanner.rs
│   │       ├── types.rs
│   │       └── vulndb
│   │           ├── cache.rs
│   │           ├── database.rs
│   │           ├── manager.rs
│   │           ├── mod.rs
│   │           ├── parser.rs
│   │           ├── schema.rs
│   │           ├── sources
│   │           │   ├── github.rs
│   │           │   ├── mod.rs
│   │           │   ├── nvd.rs
│   │           │   └── osv.rs
│   │           ├── statistics.rs
│   │           └── updater.rs
│   ├── builder
│   │   ├── Cargo.toml
│   │   └── src
│   │       ├── api.rs
│   │       ├── archive.rs
│   │       ├── build_systems
│   │       │   ├── autotools.rs
│   │       │   ├── cargo.rs
│   │       │   ├── cmake.rs
│   │       │   ├── core.rs
│   │       │   ├── go.rs
│   │       │   ├── meson.rs
│   │       │   ├── mod.rs
│   │       │   ├── nodejs.rs
│   │       │   └── python.rs
│   │       ├── builder.rs
│   │       ├── cache
│   │       │   └── mod.rs
│   │       ├── compression.rs
│   │       ├── config.rs
│   │       ├── cross.rs
│   │       ├── dependencies
│   │       │   └── mod.rs
│   │       ├── environment
│   │       │   ├── core.rs
│   │       │   ├── dependencies.rs
│   │       │   ├── directories.rs
│   │       │   ├── execution.rs
│   │       │   ├── hermetic.rs
│   │       │   ├── isolation.rs
│   │       │   ├── mod.rs
│   │       │   ├── sandbox.rs
│   │       │   ├── types.rs
│   │       │   └── variables.rs
│   │       ├── error_handling
│   │       │   └── mod.rs
│   │       ├── events.rs
│   │       ├── fileops.rs
│   │       ├── format.rs
│   │       ├── lib.rs
│   │       ├── manifest.rs
│   │       ├── monitoring
│   │       │   ├── aggregator.rs
│   │       │   ├── config.rs
│   │       │   ├── metrics.rs
│   │       │   ├── mod.rs
│   │       │   ├── pipeline.rs
│   │       │   ├── telemetry.rs
│   │       │   └── tracing.rs
│   │       ├── orchestration
│   │       │   └── mod.rs
│   │       ├── packaging.rs
│   │       ├── quality_assurance
│   │       │   ├── config.rs
│   │       │   ├── linters
│   │       │   │   ├── cargo.rs
│   │       │   │   ├── clang.rs
│   │       │   │   ├── eslint.rs
│   │       │   │   ├── generic.rs
│   │       │   │   ├── mod.rs
│   │       │   │   └── python.rs
│   │       │   ├── mod.rs
│   │       │   ├── pipeline.rs
│   │       │   ├── policy
│   │       │   │   ├── license.rs
│   │       │   │   ├── mod.rs
│   │       │   │   ├── permissions.rs
│   │       │   │   └── size.rs
│   │       │   ├── reports.rs
│   │       │   ├── scanners
│   │       │   │   ├── cargo_audit.rs
│   │       │   │   ├── mod.rs
│   │       │   │   ├── npm_audit.rs
│   │       │   │   ├── python_scanner.rs
│   │       │   │   └── trivy.rs
│   │       │   └── types.rs
│   │       ├── quality.rs
│   │       ├── recipe.rs
│   │       ├── sbom.rs
│   │       ├── signing.rs
│   │       ├── starlark_bridge.rs
│   │       ├── timeout_utils.rs
│   │       └── workflow.rs
│   ├── config
│   │   ├── Cargo.toml
│   │   └── src
│   │       └── lib.rs
│   ├── errors
│   │   ├── Cargo.toml
│   │   └── src
│   │       ├── audit.rs
│   │       ├── build.rs
│   │       ├── config.rs
│   │       ├── install.rs
│   │       ├── lib.rs
│   │       ├── network.rs
│   │       ├── ops.rs
│   │       ├── package.rs
│   │       ├── state.rs
│   │       ├── storage.rs
│   │       └── version.rs
│   ├── events
│   │   ├── Cargo.toml
│   │   └── src
│   │       ├── lib.rs
│   │       └── progress
│   │           ├── algorithms.rs
│   │           ├── config.rs
│   │           ├── manager.rs
│   │           ├── mod.rs
│   │           ├── speed.rs
│   │           ├── tracker.rs
│   │           └── update.rs
│   ├── hash
│   │   ├── Cargo.toml
│   │   └── src
│   │       └── lib.rs
│   ├── index
│   │   ├── Cargo.toml
│   │   └── src
│   │       ├── cache.rs
│   │       ├── lib.rs
│   │       └── models.rs
│   ├── install
│   │   ├── Cargo.toml
│   │   └── src
│   │       ├── atomic
│   │       │   ├── filesystem.rs
│   │       │   ├── installer.rs
│   │       │   ├── linking.rs
│   │       │   ├── mod.rs
│   │       │   ├── rollback.rs
│   │       │   └── transition.rs
│   │       ├── installer.rs
│   │       ├── lib.rs
│   │       ├── operations.rs
│   │       ├── parallel.rs
│   │       ├── pipeline
│   │       │   ├── batch.rs
│   │       │   ├── config.rs
│   │       │   ├── decompress.rs
│   │       │   ├── download.rs
│   │       │   ├── mod.rs
│   │       │   ├── operation.rs
│   │       │   ├── resource.rs
│   │       │   └── staging.rs
│   │       ├── python.rs
│   │       ├── staging
│   │       │   ├── directory.rs
│   │       │   ├── guard.rs
│   │       │   ├── manager.rs
│   │       │   ├── mod.rs
│   │       │   ├── utils.rs
│   │       │   └── validation.rs
│   │       └── validation
│   │           ├── content
│   │           │   ├── limits.rs
│   │           │   ├── manifest.rs
│   │           │   ├── mod.rs
│   │           │   ├── tar.rs
│   │           │   └── zstd.rs
│   │           ├── format
│   │           │   ├── detection.rs
│   │           │   ├── extension.rs
│   │           │   ├── mod.rs
│   │           │   └── size_limits.rs
│   │           ├── mod.rs
│   │           ├── pipeline
│   │           │   ├── context.rs
│   │           │   ├── mod.rs
│   │           │   ├── orchestrator.rs
│   │           │   └── recovery.rs
│   │           ├── security
│   │           │   ├── mod.rs
│   │           │   ├── paths.rs
│   │           │   ├── permissions.rs
│   │           │   ├── policies.rs
│   │           │   └── symlinks.rs
│   │           └── types.rs
│   ├── manifest
│   │   ├── Cargo.toml
│   │   └── src
│   │       └── lib.rs
│   ├── net
│   │   ├── Cargo.toml
│   │   └── src
│   │       ├── client.rs
│   │       ├── download
│   │       │   ├── config.rs
│   │       │   ├── core.rs
│   │       │   ├── mod.rs
│   │       │   ├── resume.rs
│   │       │   ├── retry.rs
│   │       │   ├── stream.rs
│   │       │   └── validation.rs
│   │       └── lib.rs
│   ├── ops
│   │   ├── Cargo.toml
│   │   └── src
│   │       ├── build.rs
│   │       ├── context.rs
│   │       ├── health.rs
│   │       ├── install.rs
│   │       ├── keys.rs
│   │       ├── lib.rs
│   │       ├── maintenance.rs
│   │       ├── query.rs
│   │       ├── repository.rs
│   │       ├── security.rs
│   │       ├── self_update.rs
│   │       ├── small_ops.rs
│   │       ├── types.rs
│   │       ├── uninstall.rs
│   │       ├── update.rs
│   │       └── upgrade.rs
│   ├── package
│   │   ├── Cargo.toml
│   │   └── src
│   │       ├── error_helpers.rs
│   │       ├── lib.rs
│   │       ├── recipe.rs
│   │       ├── sandbox.rs
│   │       └── starlark
│   │           ├── build_systems.rs
│   │           ├── context.rs
│   │           ├── cross.rs
│   │           ├── features.rs
│   │           ├── mod.rs
│   │           └── parallel.rs
│   ├── progress
│   │   ├── Cargo.toml
│   │   └── src
│   │       └── lib.rs
│   ├── resolver
│   │   ├── Cargo.toml
│   │   └── src
│   │       ├── execution.rs
│   │       ├── graph.rs
│   │       ├── lib.rs
│   │       ├── resolver.rs
│   │       └── sat
│   │           ├── clause.rs
│   │           ├── conflict_analysis.rs
│   │           ├── mod.rs
│   │           ├── solver.rs
│   │           ├── types.rs
│   │           └── variable_map.rs
│   ├── root
│   │   ├── Cargo.toml
│   │   └── src
│   │       └── lib.rs
│   ├── state
│   │   ├── build.rs
│   │   ├── Cargo.toml
│   │   ├── migrations
│   │   │   ├── 0001_initial_schema.sql
│   │   │   ├── 0002_add_build_deps.sql
│   │   │   ├── 0003_add_package_files.sql
│   │   │   ├── 0004_add_venv_tracking.sql
│   │   │   └── 0005_add_package_map.sql
│   │   └── src
│   │       ├── lib.rs
│   │       ├── manager.rs
│   │       ├── models.rs
│   │       ├── queries_runtime.rs
│   │       └── queries.rs
│   ├── store
│   │   ├── Cargo.toml
│   │   └── src
│   │       ├── archive.rs
│   │       ├── archive.rs.backup
│   │       ├── format_detection.rs
│   │       ├── lib.rs
│   │       └── package.rs
│   └── types
│       ├── Cargo.toml
│       └── src
│           ├── format.rs
│           ├── lib.rs
│           ├── package.rs
│           ├── reports.rs
│           ├── state.rs
│           └── version.rs
├── LICENSE.md
├── README.md
├── setup.sh
├── STARLARK_API_DOCUMENTATION.md
├── test_build
│   ├── autotools
│   │   ├── configure.ac
│   │   ├── hello.c
│   │   ├── Makefile.am
│   │   └── recipe.star
│   ├── cargo
│   │   ├── Cargo.toml
│   │   ├── recipe.star
│   │   └── src
│   │       └── main.rs
│   ├── cmake
│   │   ├── CMakeLists.txt
│   │   ├── hello.c
│   │   └── recipe.star
│   ├── go
│   │   ├── go.mod
│   │   ├── main.go
│   │   └── recipe.star
│   ├── make
│   │   ├── hello.c
│   │   ├── Makefile
│   │   └── test-make.star
│   ├── meson
│   │   ├── hello.c
│   │   ├── meson.build
│   │   └── recipe.star
│   ├── nodejs-npm
│   │   ├── hello.js
│   │   ├── package.json
│   │   └── recipe.star
│   ├── nodejs-pnpm
│   │   ├── hello.js
│   │   └── package.json
│   ├── nodejs-yarn
│   │   ├── hello.js
│   │   └── package.json
│   ├── packages
│   │   ├── hello-autotools-1.0.0
│   │   │   └── bin
│   │   │       └── hello-autotools
│   │   ├── hello-autotools-1.0.0-1.arm64.sp
│   │   ├── hello-cargo-1.0.0
│   │   │   └── bin
│   │   │       └── hello-cargo
│   │   ├── hello-cargo-1.0.0-1.arm64.sp
│   │   ├── hello-cmake-1.0.0
│   │   │   └── bin
│   │   │       └── hello-cmake
│   │   ├── hello-cmake-1.0.0-1.arm64.sp
│   │   ├── hello-go-1.0.0
│   │   │   └── bin
│   │   │       └── hello-go
│   │   ├── hello-go-1.0.0-1.arm64.sp
│   │   ├── hello-meson-1.0.0
│   │   │   └── bin
│   │   │       └── hello-meson
│   │   ├── hello-meson-1.0.0-1.arm64.sp
│   │   ├── manifest.toml
│   │   ├── manual
│   │   │   ├── hello-autotools-1.0.0
│   │   │   │   └── bin
│   │   │   │       └── hello-autotools
│   │   │   ├── hello-autotools-1.0.0-1.arm64.sp
│   │   │   ├── hello-cargo-1.0.0
│   │   │   │   └── bin
│   │   │   │       └── hello-cargo
│   │   │   ├── hello-cargo-1.0.0-1.arm64.sp
│   │   │   ├── manifest.toml
│   │   │   └── sbom.spdx.json
│   │   ├── sbom.spdx.json
│   │   ├── test-make-1.0.0
│   │   │   └── bin
│   │   │       └── hello
│   │   └── test-make-1.0.0-1.arm64.sp
│   ├── python-pyproject
│   │   ├── hello.py
│   │   ├── pyproject.toml
│   │   └── recipe.star
│   └── python-setuppy
│       ├── hello.py
│       └── setup.py
└── VENV_CLEANUP_IMPLEMENTATION.md

Architecture Overview

Crate Dependencies

Foundation layer: errors and types depend on nothing except std/serde/thiserror
Base services: events, config, hash depend only on foundation crates
Core services: Can depend on foundation + base + other core services as needed
Higher services: Can depend on any lower layer
Orchestration: ops can depend on everything, but nothing depends on it
Maintain acyclic dependencies - no circular imports
Keep crates focused on single responsibilities

Dependency Structure

Foundation Layer (no dependencies):

errors - Error type definitions
types - Core data structures, version parsing

Base Services (depend on foundation):

events - Event definitions (depends on: errors, types)
config - Configuration structures (depends on: errors, types)
hash - BLAKE3 hashing (depends on: errors, types)

Core Services:

net - Network operations (depends on: errors, types, events)
manifest - Package manifests (depends on: errors, types, config, hash)
package - Starlark package definitions (depends on: errors, types, hash)
root - Filesystem operations (depends on: errors, types)
index - Package index (depends on: errors, types, manifest)
store - Content-addressed storage (depends on: errors, types, hash, root)

Higher Services:

resolver - Dependency resolution (depends on: errors, types, index, manifest)
builder - Package building (depends on: errors, types, events, package, manifest, hash, resolver)
state - State management (depends on: errors, types, events, store, root)
audit - Future CVE scanning (depends on: errors, types, manifest)

Integration Layer:

install - Binary package installation (depends on: errors, types, events, net, resolver, state, store, audit + external: dashmap, crossbeam)

Orchestration Layer:

ops - Command orchestration (depends on: all service crates)
sps2 - CLI application (depends on: ops, events)

Key principles:

Acyclic dependencies - no circular imports allowed
ops can depend on everything, but nothing depends on ops
All crates can use errors and types as they're the foundation

Example crate dependencies:

install needs: state (transitions), store (storage), resolver (deps), net (downloads)
builder needs: package (Starlark), manifest (metadata), hash (checksums), resolver (build deps), SBOM generation
state needs: store (linking), root (filesystem ops)
resolver needs: index (available packages), manifest (dependencies)

Version Resolution: The resolver uses Python-style version specifiers for maximum flexibility:

Uses semver crate for version parsing and comparison
Finds the highest version that satisfies all constraints
Supports complex version ranges with multiple constraints
Detects conflicts when constraints cannot be satisfied
Handles both runtime dependencies (for install) and build dependencies (for build)
Provides parallel execution plan for maximum performance

Dependency resolution behavior:

During sps2 install: Downloads and installs binary packages with runtime dependencies
During sps2 build: Resolves and installs build dependencies to temporary environment
Build dependencies are installed to a temporary build environment
Runtime dependencies of build deps are also installed (transitive)
Circular dependencies are detected separately for runtime and build graphs
Build environment is isolated from user's installed packages
Only runtime dependencies are included in the final .sp package

Dependency Resolution Architecture

The resolver uses a state-of-the-art SAT solver implementing DPLL with Conflict-Driven Clause Learning (CDCL) for deterministic, optimal dependency resolution:

SAT Solver Implementation

The resolver translates dependency resolution into a Boolean Satisfiability Problem and solves it using advanced algorithms:

Core SAT Components:

DPLL Algorithm: Davis-Putnam-Logemann-Loveland with modern optimizations
CDCL: Conflict-Driven Clause Learning for improved performance
Two-Watched Literals: Efficient unit propagation scheme
VSIDS Heuristic: Variable State Independent Decaying Sum for decision making
First UIP Analysis: Unique Implication Point cut for optimal learned clauses

SAT Problem Construction:

Each package version becomes a boolean variable
Dependencies become implication clauses (A → B₁ ∨ B₂ ∨ ...)
Version constraints become CNF clauses
At-most-one constraints ensure single version selection

Key Features:

Version Preference: Biases toward newer versions via VSIDS initialization
Conflict Learning: Learns from conflicts to prune search space
Non-chronological Backtracking: Jumps to relevant decision levels
Restart Strategy: Periodic restarts every 100 conflicts
Human-readable Explanations: Generates clear conflict messages

Core Types

#[derive(Clone, Debug)]
pub enum DepKind { Build, Runtime }

#[derive(Clone, Debug)]
pub struct DepEdge {
    pub name: String,
    pub spec: VersionSpec,      // e.g. ">=1.2.0,<2.0.0"
    pub kind: DepKind,
}

#[derive(Clone, Debug)]
pub struct ResolvedNode {
    pub name: String,
    pub version: Version,
    pub deps: Vec<DepEdge>,
}

Resolution Algorithm

Translate to SAT: Convert dependency problem to boolean satisfiability
DPLL Search: Use unit propagation and intelligent branching
Conflict Analysis: Learn clauses from conflicts using first UIP
Solution Extraction: Map satisfying assignment back to packages
Topological Sort: Order packages respecting dependencies

Parallel Execution

// Concurrent download/install with dependency ordering
struct NodeMeta {
    action: NodeAction,
    in_degree: AtomicUsize,      // Remaining dependencies
    parents: Vec<PackageId>,     // For decrement notification
}

// Key data structures
let graph: HashMap<PackageId, Arc<NodeMeta>>;
let inflight: DashMap<PackageId, JoinHandle<Result<()>>>;  // Deduplication (dashmap crate)
let ready_queue: SegQueue<PackageId>;                      // Lock-free queue (crossbeam)
let semaphore: Arc<Semaphore>;                            // Concurrency limit (tokio)

// Execution flow
1. Push all nodes with in_degree=0 to ready_queue
2. Workers pull from queue, download/extract package
3. On completion, decrement parent in_degrees
4. Push newly-ready parents to queue
5. Continue until queue empty

Dependencies: The install crate needs dashmap for concurrent deduplication and crossbeam for lock-free queues.

Install vs Build Behavior

During sps2 install:

Resolves runtime dependencies only
Downloads binary packages in parallel
Installs to main system state (/opt/pm/live/)
No symlink management (single-prefix design)
User must ensure /opt/pm/live/bin is in PATH

During sps2 build:

Resolves build dependencies from recipe
Downloads and installs build deps to /opt/pm/build/<pkg>/deps/
Sets up isolated environment (PATH, PKG_CONFIG_PATH, etc.)
Build deps are binary packages from repository
After build completes, deps directory is cleaned up
Only runtime deps are recorded in output .sp manifest

Performance Characteristics

Parallelism: Limited by graph width (number of packages with no pending deps)
Deduplication: Shared dependencies downloaded/installed only once
Early start: Packages begin installing the moment their deps are ready
Network utilization: Downloads overlap with extraction/linking
Typical speedup: 3-5x over serial installation on fast networks

Example Resolution

Installing: jq (depends on oniguruma)
            curl (depends on openssl, zlib)

Execution order:
1. [Parallel] Download oniguruma, openssl, zlib
2. [Parallel] Install oniguruma, openssl, zlib
3. [Parallel] Download jq, curl
4. [Parallel] Install jq, curl

For builds, same logic applies but to temporary build environment.

Error Handling Architecture

The errors crate provides fine-grained error types organized by domain:

crates/errors/:
src/:
src/lib.rs       # Re-exports all error types
src/network.rs   # NetworkError (connection, timeout, etc.)
src/storage.rs   # StorageError (disk full, permissions, etc.)
src/state.rs     # StateError (invalid transitions, conflicts)
src/package.rs   # PackageError (corrupt, missing deps, etc.)
src/...

Each error type:

Implements Clone where possible (avoid storing non-clonable types)
Uses #[derive(thiserror::Error)] for automatic Display/Error impl
Provides context via #[error("...")] attributes
Can be converted to a generic error for cross-crate boundaries

Example:

// In errors/src/network.rs
#[derive(Debug, Clone, thiserror::Error)]
pub enum NetworkError {
    #[error("connection timeout to {url}")]
    Timeout { url: String },

    #[error("download failed: {0}")]
    DownloadFailed(String),
}

Type Definitions Architecture

The types crate provides core data structures including version specifications:

// types/src/version.rs
#[derive(Debug, Clone)]
pub enum VersionSpec {
    Exact(Version),           // ==1.2.3
    GreaterEqual(Version),    // >=1.2.0
    LessEqual(Version),       // <=2.0.0
    Greater(Version),         // >1.0.0
    Less(Version),            // <2.0.0
    Compatible(Version),      // ~=1.2.0
    NotEqual(Version),        // !=1.5.0
    And(Box<VersionSpec>, Box<VersionSpec>), // Multiple constraints
}

impl FromStr for VersionSpec {
    // Parse "package>=1.2.0,<2.0.0,!=1.5.0" into constraint tree
}

impl VersionSpec {
    pub fn matches(&self, version: &Version) -> bool { ... }
}

Common types for events and operations:

// types/src/lib.rs
#[derive(Debug, Clone)]
pub struct PackageInfo {
    pub name: String,
    pub version: Version,
    pub description: Option<String>,
    pub installed: bool,
    pub size: u64,
}

#[derive(Debug, Clone)]
pub struct SearchResult {
    pub name: String,
    pub version: Version,
    pub description: String,
    pub homepage: Option<String>,
}

Event System and Async Architecture

Async Runtime

Full async/await from the ground up
Use tokio runtime with multi-threaded scheduler
All I/O operations must be async (tokio::fs, sqlx, etc.)
Use spawn_blocking sparingly for CPU-intensive work
Channels for cross-crate communication (via events crate)

Important: Use modern async crates when tokio doesn't provide the functionality:

Database: Use sqlx for SQLite operations (NOT rusqlite or blocking alternatives)
HTTP: Use reqwest with tokio runtime (NOT blocking HTTP crates)
Process spawning: Use tokio::process (NOT std::process)
File watching: Use notify with tokio integration
Only use sync/blocking crates when absolutely no async alternative exists

Event Communication

Use tokio::sync::mpsc for all async channels
Prefer UnboundedSender/UnboundedReceiver for event passing

The events crate should export type aliases:

pub type EventSender = tokio::sync::mpsc::UnboundedSender<Event>;
pub type EventReceiver = tokio::sync::mpsc::UnboundedReceiver<Event>;

All crates take an EventSender in their public APIs
Only the CLI should own the EventReceiver
Use tokio::select! for handling multiple channels
Consider tokio::sync::broadcast if multiple consumers needed
NO direct logging - No println!, eprintln!, info!, warn!, error! outside CLI
All output goes through events - the CLI decides how to display

Core Event enum variants:

use crate::types::{PackageInfo, SearchResult, Version};

#[derive(Debug, Clone)]
pub enum Event {
    // Download events
    DownloadStarted { url: String, size: Option<u64> },
    DownloadProgress { url: String, bytes_downloaded: u64, total_bytes: u64 },
    DownloadCompleted { url: String },

    // Build events
    BuildStepStarted { package: String, step: String },
    BuildStepOutput { package: String, line: String },
    BuildStepCompleted { package: String, step: String },

    // State management
    StateTransition { from: Uuid, to: Uuid, operation: String },
    StateRollback { from: Uuid, to: Uuid },

    // Package operations
    PackageInstalling { name: String, version: Version },
    PackageInstalled { name: String, version: Version },
    PackageRemoving { name: String, version: Version },
    PackageRemoved { name: String, version: Version },

    // Resolution
    ResolvingDependencies { package: String },
    DependencyResolved { package: String, version: Version },

    // Command completion
    ListComplete { packages: Vec<PackageInfo> },
    SearchComplete { results: Vec<SearchResult> },

    // Errors and warnings
    Warning { message: String, context: Option<String> },
    Error { message: String, details: Option<String> },

    // Debug logging (when --debug enabled)
    DebugLog { message: String, context: HashMap<String, String> },

    // General progress
    OperationStarted { operation: String },
    OperationCompleted { operation: String, success: bool },
}

Crates that emit events (take EventSender):

net - Download progress, connection status
install - Installation steps, file operations
state - State transitions, rollback operations
builder - Build progress, compilation status
resolver - Dependency resolution progress
audit - CVE scan results
ops - High-level operation status

Execution Flow

Entry Point

The sps2 CLI application is the sole entry point and manages all user interaction:

Parses command-line arguments
Initializes the tokio runtime
Creates event channels for async communication
Owns the EventReceiver and handles all display/output
Delegates operations to the ops crate

Command Flow Architecture

Flow sequence:

User invokes command
sps2 CLI parses arguments
CLI creates event channel
CLI calls ops crate with EventSender
ops executes or delegates to specialized crates
Crates emit events through EventSender
CLI receives events via EventReceiver
CLI displays output to user

Communication pattern:

One-way event flow: crates → EventSender → EventReceiver → CLI
No direct output from crates (no println/logging)
All user feedback goes through event channel

Operations Hierarchy

The ops crate serves as the orchestration layer with a key architectural distinction:

Small operations (list, info, search, etc.): Implementation logic lives IN the ops crate
Large operations (install, build, etc.): ops just delegates to specialized crates

This keeps complex workflows isolated in their dedicated crates while simple operations don't need entire crates.

Operations Context

pub struct OpsCtx<'a> {
    pub store: &'a Store,
    pub state: &'a StateManager,
    pub index: &'a Index,
    pub net:   &'a NetClient,
    pub resolver: &'a Resolver,
    pub builder: &'a Builder,
    pub tx: EventSender,
}

Command Implementations

Important Architecture Rule:

Small operations: Logic lives in ops crate, which calls into service crates for specific functionality
Large operations: ops merely delegates to specialized crates that contain the full implementation

Command	Type	Implementation	Calls into crates
`reposync`	Small	Logic in `ops`	`net` (download), `index` (update)
`list`	Small	Logic in `ops`	`state` (query packages)
`info`	Small	Logic in `ops`	`index` (details), `state` (status)
`search`	Small	Logic in `ops`	`index` (search)
`cleanup`	Small	Logic in `ops`	`state` (find orphans), `store` (GC)
`install`	Large	Delegates to `install` crate	`resolver` (runtime deps), `net` (downloads)
`update`	Large	Delegates to `install` crate	`resolver` (constraints), `net` (downloads)
`upgrade`	Large	Delegates to `install` crate	`resolver` (latest versions), `net` (downloads)
`uninstall`	Large	Delegates to `install` crate	`state` (removes package and orphaned deps)
`build`	Large	Delegates to `builder` crate	`resolver` (build deps), `install` (dep setup)
`rollback`	Small	Logic in `ops`	`state` (restore previous state)
`history`	Small	Logic in `ops`	`state` (list all states)
`check-health`	Small	Logic in `ops`	`state` (verify integrity), `store` (check refs)

check-health command specification:

Input: No arguments required
Operation: Verifies system integrity by checking:
- Database consistency (all referenced packages exist in store)
- Store integrity (all store entries have valid manifests)
- State directory structure (no orphaned staging dirs)
- Permissions on critical paths
Output: Table showing component status, or JSON with --json flag
Exit code: 0 if healthy, 1 if issues found

Example of small operation (in ops):

// ops/src/list.rs
use crate::types::PackageInfo;

pub async fn list(ctx: &OpsCtx) -> Result<Vec<PackageInfo>> {
    // Logic lives here in ops
    let installed = ctx.state.get_installed_packages().await?;
    let enriched = installed.into_iter()
        .map(|p| enrich_with_metadata(p, ctx))
        .collect::<Result<Vec<_>>>()?;
    ctx.tx.send(Event::ListComplete { packages: enriched.clone() })?;
    Ok(enriched)
}

Example of large operation (in ops):

// ops/src/install.rs
pub async fn install(ctx: &OpsCtx, package_specs: &[String]) -> Result<OpReport> {
    // Determine if specs are local files or package names
    let install_requests = package_specs.iter()
        .map(|s| {
            if s.ends_with(".sp") && std::path::Path::new(s).exists() {
                InstallRequest::LocalFile(s.to_string())
            } else {
                // Parse version constraints for remote packages
                let spec = PackageSpec::parse(s)?;
                InstallRequest::Remote(spec)
            }
        })
        .collect::<Result<Vec<_>>>()?;

    // Delegate to specialized crate (binary install only)
    crate::install::execute_install(
        install_requests,
        ctx.resolver,
        ctx.net,
        ctx.state,
        ctx.store,
        ctx.tx.clone()
    ).await
}

Update vs Upgrade Distinction

update: Only bumps compatible versions (respects ~= semantics)
- Package with foo~=1.2.0 can update to 1.2.9 but not 1.3.0
- Package with foo>=1.0,<2.0 stays within those bounds
upgrade: Allows major version jumps (ignores upper bounds)
- Package with foo~=1.2.0 can upgrade to 2.0.0 or higher
- Still respects explicit != exclusions for known bad versions
Both return an OpReport that can be rendered as table, JSON, or plain text

Event Flow Pattern

User invokes command:
- sps2 install package - Install from repository
- sps2 install "package>=1.2.0,<2.0.0" - Install with version constraints
- sps2 install ./package-1.2.0-1.arm64.sp - Install local .sp file
CLI creates event channel and passes EventSender to ops
ops::install() called with package spec and event sender
ops determines if local file or remote package
For remote: delegates to install::install() with parsed version constraints
For local: delegates to install::install_local() with file path

Each crate sends progress/status events:

sender.send(Event::DownloadProgress {
    url: download_url,
    bytes_downloaded: 1024000,
    total_bytes: 5242880
})?;
sender.send(Event::StateTransition { from, to })?;

CLI receives events and updates display accordingly
Final success/error event sent back to CLI

Note: Install is for binary packages only. To build from source, use sps2 build recipe.star which produces a .sp file.

CLI Display Responsibilities

Progress bars for downloads
Status messages for operations
Error formatting with helpful context
Confirmation prompts when needed
NO direct println!/eprintln! outside of CLI
Machine-readable output modes (--json flag)
Parse and validate version constraints before passing to ops
PATH reminder: Show hint to add /opt/pm/live/bin to PATH after first install

CLI usage examples:

sps2 install jq - Install latest binary package from repository
sps2 install "jq==1.7" - Install exact version from repository
sps2 install "jq>=1.6,<2.0" - Install with constraints from repository
sps2 install ./jq-1.7-1.arm64.sp - Install from local .sp file
sps2 build jq.star - Build package from recipe (produces .sp file)
sps2 build --network jq.star - Build with network access enabled
sps2 update - Update all packages respecting constraints
sps2 upgrade jq - Upgrade to latest, ignoring upper bounds
sps2 rollback - Revert to previous state
sps2 rollback <state-id> - Revert to specific state
sps2 history - List all states with timestamps
sps2 check-health - Verify system integrity

Note: Ensure /opt/pm/live/bin is in your PATH after installation.

Global CLI flags:

--json - Output in JSON format (for all commands)
--debug - Enable debug logging to /opt/pm/logs/
--color <auto|always|never> - Override color output
--config <path> - Use alternate config file

Command-specific flags:

sps2 build --network - Allow network access during build
sps2 build --jobs <n> - Override parallel build jobs (0=auto)
sps2 rollback - Revert to previous state
sps2 rollback <state-id> - Revert to specific state
sps2 history - List all states with timestamps
sps2 check-health - Verify system integrity

Operation Lifecycle

Validation Phase - Check permissions, validate arguments
Planning Phase - Resolve dependencies, check conflicts
Execution Phase - Perform actual operations
Commit Phase - Atomic state transitions
Cleanup Phase - Remove temporary files, update caches

Each phase emits appropriate events for CLI feedback.

Core Systems

Configuration Management

Configuration File

Location: ~/.config/sps2/config.toml (follows XDG Base Directory spec)
Format: TOML for consistency with Rust ecosystem
Precedence: CLI flags > Environment variables > Config file > Defaults
Defaults location: Hard-coded in config crate via impl Default

Default values (in code):

impl Default for Config {
    fn default() -> Self {
        Self {
            general: GeneralConfig {
                default_output: OutputFormat::Tty,
                color: ColorChoice::Auto,
                parallel_downloads: 4,
            },
            build: BuildConfig {
                build_jobs: 0,  // 0 = num_cpus::get()
                network_access: false,
                compression_level: "balanced".to_string(),
            },
            security: SecurityConfig {
                verify_signatures: true,
                allow_unsigned: false,
                index_max_age_days: 7,
            },
            state: StateConfig {
                retention_count: 10,  // Keep last 10 states
                retention_days: 30,   // Or 30 days, whichever is less
            },
        }
    }
}

Example config.toml:

[general]
default_output = "tty"  # Options: plain, tty, json
color = "auto"          # Options: always, auto, never
parallel_downloads = 4

[build]
build_jobs = 0          # 0 = auto-detect from CPU count
network_access = false  # Allow network during builds
compression_level = "balanced"  # Options: "fast", "balanced", "maximum", or "1"-"22"

[security]
verify_signatures = true
allow_unsigned = false
index_max_age_days = 7

[state]
retention_count = 10    # Keep last N states
retention_days = 30     # Keep states newer than N days

[paths]
# Override default paths if needed (usually not recommended)
# store_path = "/opt/pm/store"
# state_path = "/opt/pm/states"

Environment Variables

SPS2_OUTPUT - Override output format
SPS2_COLOR - Override color setting
SPS2_DEBUG - Enable debug logging

Atomic Update System

Filesystem Layout

/opt/pm/:
store/:                 # (A) Package Store - content-addressed storage
store/<hash>/:          # Immutable package contents
store/<hash>/files/     # Actual files
store/<hash>/blobs/     # Binary artifacts
store/<hash>/manifest   # Package metadata

states/:                # State directories
states/<uuid>/:         # (B) Archived states (previous roots)
states/staging-<uuid>/: # (C) Staging state (APFS clone)

live/:                  # (D) Active prefix (current root)
live/bin/:              # All installed binaries (add to PATH)

state.sqlite            # (F) State database with WAL
state.sqlite-wal        # SQLite write-ahead log
state.sqlite-shm        # SQLite shared memory

logs/:                  # Debug logs (when --debug is used)
logs/sps2-<timestamp>.jsonl  # Structured JSON logs

State Management Architecture

Components:

Package Store (A)
- Content-addressed storage using BLAKE3 hashes
- Immutable files - never modified after creation
- Hard-linked into state directories
- Garbage collected based on reference counting
State Directories (B, C, D)
- Each state is a complete root filesystem
- Contains hard links to package store
- Archived states kept for rollback
- Staging state is APFS clone of current state
SQLite State Database (F)
- Path: /opt/pm/state.sqlite (not in user's $HOME)
- WAL mode for consistency
- Tracks package references
- Stores active state pointer
- Records state transition history
- Must use sqlx for all database operations (async-first)
- Migrations: Embedded using sqlx migrate with versioned SQL files
- Schema version tracked in database for safe upgrades
Repo Manifest Cache (G)
- Immutable binary blobs
- Read-only lookups during resolution
- Updated via reposync operation

Atomic Update Process

Installation Flow:

// 1. Create staging directory as APFS clone
let staging_id = Uuid::new_v4();
let staging_path = format!("/opt/pm/states/staging-{}", staging_id);
apfs_clonefile("/opt/pm/live", &staging_path)?;

// 2. Modify staging directory
// - Add new package hard links from store
// - Remove old package hard links
// - Update metadata files

// 3. Begin database transaction
let tx = sqlx::Transaction::begin(&state_db).await?;

// 4. Record new state in database
sqlx::query("INSERT INTO states (id, parent, timestamp) VALUES (?, ?, ?)")
    .bind(&staging_id)
    .bind(&current_state_id)
    .bind(&now)
    .execute(&mut tx).await?;

// 5. Update package references
sqlx::query("INSERT INTO package_refs (state_id, package_hash) VALUES (?, ?)")
    .bind(&staging_id)
    .bind(&package_hash)
    .execute(&mut tx).await?;

// 6. Atomic filesystem swap
rename_swap(&staging_path, "/opt/pm/live")?;

// 7. Update active state pointer
sqlx::query("UPDATE active_state SET id = ?")
    .bind(&staging_id)
    .execute(&mut tx).await?;

// 8. Commit transaction
tx.commit().await?;

// 9. Archive previous state
rename(&old_live_path, &format!("/opt/pm/states/{}", old_state_id))?;

Rollback Process:

// 1. Find target state
let target_state = sqlx::query_as::<_, (String,)>(
    "SELECT path FROM states WHERE id = ?"
)
.bind(&rollback_id)
.fetch_one(&state_db).await?;

// 2. Atomic swap
rename_swap(&target_state.0, "/opt/pm/live")?;

// 3. Update database
sqlx::query("UPDATE active_state SET id = ?")
    .bind(&rollback_id)
    .execute(&state_db).await?;

Key Safety Properties

Atomicity: All updates use renameat2 with RENAME_SWAP flag
Consistency: WAL-mode SQLite ensures database consistency
Isolation: Staging directory invisible until swap
Durability: Previous states preserved for rollback

APFS-Specific Optimizations

Use clonefile() for instant, space-efficient copies
Hard links for deduplication within states
Set compression flags on /opt/pm/store/
Leverage APFS snapshots for system-wide backups

Garbage Collection

Reference counting in SQLite for store objects
Configurable retention policy for old states
Default retention: Keep last 10 states AND states from last 30 days (whichever is more)
Never delete currently referenced packages
Clean orphaned staging directories on startup
GC schedule:
- Runs automatically after install/uninstall/rollback operations
- Also runs on sps2 startup if last GC was >7 days ago
- Startup GC runs after state DB is opened but before any operation planning
- Manual trigger via sps2 cleanup
- No background daemon - GC is always user-initiated or operation-triggered

Build System

The builder crate provides a production-ready, enterprise-grade build system with comprehensive features for secure, reproducible software packaging.

Supported Build Systems

The builder supports 7 major build systems with full feature parity:

Autotools - ctx.autotools()
- Configure/make/make install workflow
- Cross-compilation support
- Out-of-source builds
- VPATH builds
CMake - ctx.cmake()
- CMake 3.x with Ninja/Make generators
- Toolchain file generation
- CTest integration
- Cache variable management
Meson - ctx.meson()
- Full Meson/Ninja workflow
- Cross file generation
- Subproject support
- Wrap dependency management
Cargo - ctx.cargo()
- Release/debug builds
- Feature flag management
- Vendored dependencies
- sccache integration
Make - ctx.make()
- Parallel builds with -j
- Custom targets
- Environment variable control
Go - Implemented but not yet exposed in Starlark API
- Go modules support
- Vendoring for offline builds
- Cross-compilation with GOOS/GOARCH
Python - Implemented but not yet exposed in Starlark API
- PEP 517/518 compliance
- Multiple backends (setuptools, poetry, flit, hatch, pdm, maturin)
- Virtual environment isolation
Node.js - Implemented but not yet exposed in Starlark API
- npm, yarn, pnpm support
- Offline builds with vendored deps
- Build script execution

Production Features

Environment Isolation & Hermetic Builds:

Complete environment variable whitelisting
Private /tmp directory per build
Network isolation via proxy settings
Clean PATH management
Locale normalization (C.UTF-8)
macOS sandbox-exec integration

Quality Assurance System:

Linters: clippy, clang-tidy, ESLint, pylint, shellcheck
Security Scanners: cargo-audit, npm audit, Trivy
Policy Enforcement: License compliance, size limits, permissions
Configurable severity levels with CI/CD integration

Advanced Features:

SBOM Generation: SPDX and CycloneDX formats via Syft
Cross-compilation: Full toolchain management and sysroot support
Build Caching: ccache/sccache integration, incremental builds
Monitoring: Real-time metrics, OpenTelemetry tracing
Package Signing: Minisign integration with detached signatures

Build Architecture

Build pipeline flow:

sps2 build recipe.star command invoked
Sandboxed Starlark VM loads and validates recipe
Recipe calls Builder API methods (fetch, cmake, etc.)
Builder executes in hermetic environment with:
- Build dependencies in isolated prefix
- Resource limits and sandboxing
- Quality checks and SBOM generation
Package created and saved as .sp file
Output: <name>-<version>-<revision>.<arch>.sp

Important: sps2 build only produces packages, it does NOT install them. This follows Unix package manager conventions where building and installing are separate operations.

Starlark Recipe Format

Build recipes are written in Starlark (Python-like) with a sandboxed, deterministic API:

# Example 1: Simple C program with autotools
def metadata():
    """Return package metadata as a dictionary."""
    return {
        "name": "curl",
        "version": "8.5.0",
        "description": "Command line tool for transferring data with URLs",
        "homepage": "https://curl.se",
        "license": "MIT",
        "depends": ["openssl>=3.0.0", "zlib~=1.2.11"],
        "build_depends": ["pkg-config>=0.29", "autoconf>=2.71", "automake>=1.16"]
    }

def build(ctx):
    """Build curl using autotools."""
    # Fetch source archive
    fetch(ctx, "https://curl.se/download/curl-8.5.0.tar.gz")
    
    # Configure with autotools
    configure(ctx, [
        "--prefix=" + ctx.PREFIX,
        "--with-openssl",
        "--with-zlib",
        "--enable-http",
        "--enable-ftp",
        "--disable-ldap"
    ])
    
    # Build with parallel jobs
    make(ctx, ["-j" + str(ctx.JOBS)])
    
    # Run tests
    make(ctx, ["test"])
    
    # Install to staging
    make(ctx, ["install", "DESTDIR=stage"])

# Example 2: CMake-based C++ project
def metadata():
    return {
        "name": "fmt",
        "version": "10.2.1",
        "description": "Fast and safe formatting library",
        "homepage": "https://fmt.dev",
        "license": "MIT",
        "depends": [],
        "build_depends": ["cmake>=3.16", "ninja>=1.10"]
    }

def build(ctx):
    """Build fmt library using CMake."""
    fetch(ctx, "https://github.com/fmtlib/fmt/archive/10.2.1.tar.gz")
    
    # Configure with CMake
    cmake(ctx, [
        "-DCMAKE_BUILD_TYPE=Release",
        "-DCMAKE_INSTALL_PREFIX=" + ctx.PREFIX,
        "-DFMT_TEST=ON",
        "-DBUILD_SHARED_LIBS=ON",
        "-G", "Ninja"
    ])
    
    # Build and test
    command(ctx, "ninja -j" + str(ctx.JOBS))
    command(ctx, "ninja test")
    command(ctx, "DESTDIR=stage ninja install")

# Example 3: Rust project with cargo
def metadata():
    return {
        "name": "ripgrep",
        "version": "14.1.0",
        "description": "Recursively search directories for a regex pattern",
        "homepage": "https://github.com/BurntSushi/ripgrep",
        "license": "MIT",
        "depends": [],
        "build_depends": ["rust>=1.72.0"]
    }

def build(ctx):
    """Build ripgrep using cargo."""
    fetch(ctx, "https://github.com/BurntSushi/ripgrep/archive/14.1.0.tar.gz")
    
    # Build with cargo in release mode
    cargo(ctx, ["build", "--release", "--locked"])
    
    # Run tests
    cargo(ctx, ["test", "--release"])
    
    # Install manually since cargo install rebuilds
    command(ctx, "mkdir -p stage" + ctx.PREFIX + "/bin")
    command(ctx, "cp target/release/rg stage" + ctx.PREFIX + "/bin/")
    
    # Install completions
    command(ctx, "mkdir -p stage" + ctx.PREFIX + "/share/bash-completion/completions")
    command(ctx, "cp complete/rg.bash stage" + ctx.PREFIX + "/share/bash-completion/completions/rg")

# Example 4: Python package (when API is exposed)
def metadata():
    return {
        "name": "black",
        "version": "24.1.0",
        "description": "The uncompromising Python code formatter",
        "homepage": "https://black.readthedocs.io",
        "license": "MIT",
        "depends": ["python3>=3.8"],
        "build_depends": ["python3>=3.8", "pip>=21.0"]
    }

def build(ctx):
    """Build black using Python build system."""
    fetch(ctx, "https://github.com/psf/black/archive/24.1.0.tar.gz")
    
    # For now, use command until python build system is exposed
    command(ctx, "python3 -m venv venv")
    command(ctx, "source venv/bin/activate && pip install --upgrade pip wheel")
    command(ctx, "source venv/bin/activate && pip install .")
    command(ctx, "source venv/bin/activate && python -m pytest tests/")
    
    # Install to staging
    command(ctx, "mkdir -p stage" + ctx.PREFIX)
    command(ctx, "source venv/bin/activate && pip install --prefix=stage" + ctx.PREFIX + " .")

Version specifiers (Python-style):

==1.2.3 - Exact version match
>=1.2.0 - Minimum version (inclusive)
<=2.0.0 - Maximum version (inclusive)
>1.0.0 - Greater than (exclusive)
<2.0.0 - Less than (exclusive)
~=1.2.0 - Compatible release (>=1.2.0, <1.3.0)
~=1.2 - Compatible release (>=1.2.0, <2.0.0)
!=1.5.0 - Exclude specific version
>=1.2,<2.0,!=1.5.0 - Multiple constraints (comma-separated)

Compatible release (~=) explanation:

~=1.2.3 means >=1.2.3, <1.3.0 (patch updates only)
~=1.2 means >=1.2.0, <2.0.0 (minor updates allowed)
~=1 means >=1.0.0, <2.0.0 (major version pinned)

Dependency handling:

depends_on(): Runtime dependencies that must be installed with the package
build_depends_on(): Build-time only dependencies, available in build environment
Build deps are automatically set up in PATH/PKG_CONFIG_PATH during build
Only runtime deps are recorded in the final package manifest
Build deps are never installed on end-user systems
Dependencies are specified as strings with optional version constraints

Sandboxing controls:

Max operations: 50,000,000 (prevent infinite loops)
Max memory: 64 MiB
No filesystem access except through Builder API
No network access except through fetch()
No environment variables or exec()

Complete Starlark API Reference

Context Attributes:

ctx.NAME - Package name from metadata (read-only)
ctx.VERSION - Package version from metadata (read-only)
ctx.PREFIX - Installation prefix, e.g. /opt/pm/live (read-only)
ctx.JOBS - Number of parallel build jobs (read-only)

Build System Functions:

Function	Description	Example
`fetch(ctx, url, hash?)`	Download & extract source archive	`fetch(ctx, "https://example.com/pkg-1.0.tar.gz")`
`configure(ctx, args)`	Run configure script	`configure(ctx, ["--prefix=" + ctx.PREFIX, "--with-ssl"])`
`make(ctx, args)`	Run make with arguments	`make(ctx, ["-j" + str(ctx.JOBS), "test"])`
`autotools(ctx, args)`	Configure + make + make install	`autotools(ctx, ["--enable-shared"])`
`cmake(ctx, args)`	CMake configuration	`cmake(ctx, ["-DCMAKE_BUILD_TYPE=Release", "-GNinja"])`
`meson(ctx, args)`	Meson build setup	`meson(ctx, ["--buildtype=release"])`
`cargo(ctx, args)`	Rust cargo commands	`cargo(ctx, ["build", "--release"])`

Utility Functions:

Function	Description	Example
`apply_patch(ctx, path)`	Apply a patch file	`apply_patch(ctx, "fix-build.patch")`
`command(ctx, cmd)`	Run arbitrary shell command	`command(ctx, "mkdir -p " + ctx.PREFIX + "/share")`
`install(ctx)`	Finalize package creation	`install(ctx)`

Advanced Features:

Function	Description	Example
`detect_build_system(ctx)`	Auto-detect build system	`detect_build_system(ctx)`
`set_build_system(ctx, name)`	Override detected system	`set_build_system(ctx, "cmake")`
`enable_feature(ctx, name)`	Enable build feature	`enable_feature(ctx, "ssl")`
`disable_feature(ctx, name)`	Disable build feature	`disable_feature(ctx, "tests")`
`set_parallelism(ctx, n)`	Override job count	`set_parallelism(ctx, 4)`
`set_target(ctx, triple)`	Cross-compilation target	`set_target(ctx, "aarch64-linux-gnu")`
`with_features(ctx, features, fn)`	Conditional execution	`with_features(ctx, ["ssl"], lambda: configure(ctx, ["--with-ssl"]))`
`parallel_steps(ctx, fn)`	Parallel execution	`parallel_steps(ctx, lambda: [make(ctx, ["docs"]), make(ctx, ["tests"])])`

Build environment setup:

Build dependencies are automatically installed before build() runs
Build deps are downloaded as binary packages from the repository
PATH includes all build deps' bin directories
PKG_CONFIG_PATH set up for all build deps
CFLAGS/LDFLAGS configured for proper linking
Build deps are NOT included in final package

Build Isolation

Build prefix: /opt/pm/build/<pkg>/<ver>/
Build deps prefix: /opt/pm/build/<pkg>/<ver>/deps/
Staging directory: /opt/pm/build/<pkg>/<ver>/stage/
Final installation: Content-addressed in /opt/pm/store/<hash>/
Build dependencies installed to isolated deps/ directory
Environment variables set to use build deps (PATH, PKG_CONFIG_PATH, etc.)
Sandbox model: $PREFIX isolation only (no chroot/container)
Network policy: Disabled by default during builds (configurable via config.toml)
Build environment:
- Clean environment variables (minimal passthrough)
- No access to user's home directory
- Restricted to build prefix only
- Network blocked unless explicitly enabled in recipe
Relocatability scan to detect any hardcoded absolute paths
Build failures if absolute paths found (ensures portability)
Build deps are cleared after successful build (not included in package)

Integration with Atomic Updates

Build Phase: Package built in isolated /opt/pm/build/ prefix with build deps, produces .sp file
Distribution: .sp file uploaded to CDN/GitHub Releases
Install Phase: User downloads .sp file (or provides local path)
Store Phase: Package contents extracted to content-addressed store
Link Phase: Store contents hard-linked into state directories with runtime deps
Activation: Atomic rename makes new state live

Key point: Building and installing are completely separate operations. Users typically only install pre-built binary packages. Building from source is only needed for package maintainers or custom packages.

Package Format

.sp File Structure

Component	Format	Purpose
Payload	`tar --deterministic \| zstd -19`	Reproducible compression
manifest.toml	TOML in archive root	Name, version, deps, hashes
sbom.spdx.json	SPDX 3.0 JSON	Primary SBOM format
sbom.cdx.json	CycloneDX 1.6 JSON (optional)	Secondary SBOM for compatibility
Signature	Detached `.minisig`	Minisign signature over all files
Filename	`<n>-<ver>-<rev>.<arch>.sp`	Unique identification

manifest.toml structure:

[package]
name = "jq"
version = "1.7"
revision = 1
arch = "arm64"

[dependencies]
# Runtime dependencies - required to run the package
runtime = [
    "oniguruma==6.9.8",
    "libc++~=16.0.0"
]
# Build dependencies - only needed during compilation
build = [
    "autoconf>=2.71",
    "automake~=1.16.0",
    "libtool==2.4.7",
    "pkg-config>=0.29.2"
]

[sbom]
spdx = "blake3:4fa5..."
cyclonedx = "blake3:31d2..."  # optional

SBOM Generation (Built from Day 1)

Generator: Syft ≥ 1.4 (deterministic, supports both formats)
When: After install() completes, before packaging
Verification: Re-run to ensure deterministic output
Coverage: All files in staging directory
Exclusions: Debug symbols (*.dSYM), configurable per recipe
Dependency tracking: SBOMs include both runtime and build dependencies with clear labeling

Builder API addition:

ctx.auto_sbom(true)  // Enable SBOM generation (default: true)
ctx.sbom_excludes(["*.pdb", "*.dSYM", "*.a", "*.la"])  // Exclude patterns (static libs added)

Repository Index Format

{
  "version": 1,
  "minimum_client": "0.1.0",
  "timestamp": "2025-05-29T12:00:00Z",
  "packages": {
    "jq": {
      "versions": {
        "1.7": {
          "revision": 1,
          "arch": "arm64",
          "blake3": "...",
          "download_url": "https://...",
          "minisig_url": "https://...",
          "dependencies": {
            "runtime": ["oniguruma==6.9.8", "libc++~=16.0.0"],
            "build": ["autoconf>=2.71", "automake~=1.16.0"]
          },
          "sbom": {
            "spdx": {
              "url": "https://.../jq-1.7-1.arm64.sbom.spdx.json",
              "blake3": "4fa5..."
            },
            "cyclonedx": {
              "url": "https://.../jq-1.7-1.arm64.sbom.cdx.json",
              "blake3": "31d2..."
            }
          }
        }
      }
    }
  }
}

Index version policy:

If index.version > client_supported_version: Hard fail with clear error message
Users must upgrade sps2 to use newer index formats
If client_version < minimum_client: Warn but continue (soft deprecation)
Cache last known good index locally for offline use

Dependency types:

runtime: Required for the package to function after installation
build: Only needed during package compilation (not installed with package)
Build deps are automatically available in build environment but not linked to final package
Both runtime and build deps are satisfied by binary packages from the repository

Security Model

Minisign for package signatures (small attack surface)
BLAKE3 for content verification and hashing
SBOM for supply chain transparency
Codesigning for macOS Gatekeeper
Deterministic builds for reproducibility

Key Distribution & Trust Root

Bootstrap key: Embedded in CLI binary at compile time
Key storage: /opt/pm/keys/ directory with trusted public keys
Key format: Minisign public key files (.pub)
Rotation process:
1. New key signed by old key (creates trust chain)
2. Rotation announcement published as keys.json at repository root
3. Both keys valid during transition period (30 days default)
4. Old key expires after grace period
Rotation file location: https://cdn.sps.io/keys.json (repository root, next to index.json)

Rotation file format (keys.json):

{
  "current": {
    "id": "RWRzQJ6...",
    "pubkey": "untrusted comment: ...\nRWRzQJ6...",
    "valid_from": "2025-01-01T00:00:00Z"
  },
  "rotations": [{
    "new_key": "RWRnew...",
    "signature": "minisign signature of new key by old key",
    "valid_from": "2025-06-01T00:00:00Z",
    "old_key_expires": "2025-07-01T00:00:00Z"
  }]
}

Index protection:
- index.json includes timestamp and is signed
- Signature stored as adjacent index.json.minisig file
- Clients reject indices older than 7 days (configurable)
- Prevents CDN from serving stale but valid indices
Mirror verification: All downloads verified against hashes in signed index

CI/CD Pipeline

Step	Implementation	Purpose
Source cache	GitHub Actions cache by URL+SHA	Avoid re-downloading
Build matrix	`arch=[arm64]` `macos=[14]`	Platform coverage (15 when available)
Codesigning	`codesign --options=runtime --entitlements entitlements.plist`	Hardened runtime for notarization
Upload	GitHub Releases + CDN	Redundant distribution
Index	Static `index.json` with ETag	Efficient updates
MSRV check	See below	Ensure minimum Rust version
Warnings	`cargo clippy -- -D warnings`	Enforce zero warnings

MSRV Enforcement CI Job:

msrv:
  runs-on: macos-14
  steps:
    - uses: actions/checkout@v4
    - name: Install minimum supported Rust version
      run: |
        rustup toolchain install 1.86.0 --profile minimal
        rustup override set 1.86.0
    - name: Check with MSRV
      run: cargo +1.86.0 check --workspace --all-features
    - name: Test with MSRV
      run: cargo +1.86.0 test --workspace

Code-signing entitlements (entitlements.plist):

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE plist PUBLIC "-//Apple//DTD PLIST 1.0//EN" "http://www.apple.com/DTDs/PropertyList-1.0.dtd">
<plist version="1.0">
<dict>
    <key>com.apple.security.cs.allow-unsigned-executable-memory</key>
    <true/>
    <key>com.apple.security.cs.disable-library-validation</key>
    <true/>
</dict>
</plist>

Entitlements justification:

allow-unsigned-executable-memory: Future-proofing for WASM/JIT plugins (Starlark uses bytecode interpreter)
disable-library-validation: Needed to load packages that contain dylibs from /opt/pm/live/lib
These are standard for package managers and development tools
Alternative would break core functionality (no future JIT support, no dynamic libraries)

Package Repository Strategy

Start with essential developer tools (git, curl, openssl, etc.)
Use AI-assisted recipe generation from source URLs
Recipes must specify both runtime and build dependencies with appropriate version constraints
Prefer ~= for compatible releases, >= for minimum versions
Use exact versions (==) only when compatibility requires it
CI/CD builds all packages from recipes to produce binary .sp files
Binary packages (.sp files) hosted on GitHub Releases + CDN
Users install pre-built binary packages only
No compilation happens on end-user systems
No external package manager dependencies

Release & Distribution

Versioning Strategy

CLI version: Semantic versioning (e.g., 0.1.0, 0.2.0, 1.0.0)
Index format version: Integer increment (currently: 1)
Compatibility: CLI checks index version and minimum_client field
Release channels:
- stable: Production-ready releases
- testing: Pre-release testing (opt-in via config)

Bootstrap Installation

#!/bin/bash
# Bootstrap installer for sps2
SPS2_VERSION="0.1.0"
SPS2_URL="https://github.com/org/sps2/releases/download/v${SPS2_VERSION}/sps2-darwin-arm64"
SPS2_MINISIG="https://github.com/org/sps2/releases/download/v${SPS2_VERSION}/sps2-darwin-arm64.minisig"

# Download and verify
curl -L -o /tmp/sps2 "$SPS2_URL"
curl -L -o /tmp/sps2.minisig "$SPS2_MINISIG"

# Embedded public key for bootstrap trust
PUBKEY="RWRzQJ6...bootstrap-key..."
echo "$PUBKEY" | minisign -V -p /dev/stdin -m /tmp/sps2

# Install
sudo mkdir -p /opt/pm/live/bin
sudo mv /tmp/sps2 /opt/pm/live/bin/
sudo chmod +x /opt/pm/live/bin/sps2

# Setup PATH
echo 'export PATH="/opt/pm/live/bin:$PATH"' >> ~/.zshrc
echo "sps2 installed! Restart your shell or run: export PATH=\"/opt/pm/live/bin:\$PATH\""

PATH Policy

No symlinks: We don't create any symlinks in /usr/local/bin or elsewhere
Single prefix: All binaries live in /opt/pm/live/bin/
User responsibility: Users must add /opt/pm/live/bin to their PATH
Shell integration: Bootstrap script adds PATH export to shell rc file
Documentation: README prominently shows PATH setup instructions

Update Mechanism

sps2 self-update: Updates sps2 itself
Downloads new version to temporary location
Verifies signature before replacing
Atomic replacement of binary
Preserves configuration and state

Performance Considerations

Async I/O

Use tokio::fs for all file operations
Use sqlx for all database operations (no blocking DB calls)
Use reqwest for HTTP requests with connection pooling
Concurrent downloads with connection pooling
Parallel hash verification during installs
Batch database operations where possible

APFS Optimizations

clonefile() for instant staging directory creation
Hard links to avoid data duplication
Compression flags on /opt/pm/store/
Avoid unnecessary stat() calls

Caching Strategy

Repository index cached with ETag validation
Package store is the cache (content-addressed)
Build artifacts cached by source hash
Build dependencies cached and reused across builds
Runtime dependencies cached in package store
Starlark recipes parsed and cached

Concurrency Limits

Download pool: 4 concurrent connections (configurable)
Hash verification: num_cpus threads
Build jobs: Algorithm below
Database connections: SQLx pool with 5 max connections (1 writer, 4 readers)

Build concurrency algorithm:

fn calculate_build_jobs(config_value: usize) -> usize {
    if config_value > 0 {
        config_value  // User override
    } else {
        // Auto-detect based on CPU count
        let cpus = num_cpus::get();

        // Use 75% of CPUs for builds, minimum 1
        // This leaves headroom for system responsiveness
        (cpus * 3 / 4).max(1)
    }
}

Event channel notes:

Using unbounded channels for simplicity
In practice, memory usage limited by operation scope
Long builds with verbose output may buffer significant events
BuildStepOutput truncation: Lines longer than 4KB are truncated with "..." suffix
Build log overflow: After 10MB of output per step, emit warning and drop subsequent lines
Future optimization: Consider bounded channels with back-pressure if needed

Future: CVE Audit System (Low Priority)

Note: This functionality will be implemented after the core package manager is complete and stable.

Architecture Overview

The audit crate will provide offline CVE scanning using embedded SBOMs:

sps2 audit [--all|--package <name>] [--fail-on critical]
         │
         ├─> Load SBOM from installed packages
         ├─> Query local vulnerability database
         └─> Report findings (table/json)

Vulnerability Database Design

Format: SQLite databases for offline queries (accessed via sqlx)
Sources: NVD, OSV, GitHub Security Advisories
Updates: Daily sync via sps2 vulndb update
Storage: /opt/pm/vulndb/ with versioned schemas

Audit Workflow

Parse SBOM (SPDX/CycloneDX) from installed packages
Extract component identifiers (PURL, CPE)
Query local SQLite databases for matches
Filter by severity thresholds
Present results with remediation advice

Implementation Plan (Future)

vulndb crate for database management
SBOM parser integration (reuse from builder)
CVE matching logic with semver awareness
CLI command (sps2 audit) with output formats
Post-install hooks for automatic scanning
Database update mechanism and CDN distribution

Why This Design

Offline-first: No privacy concerns from phoning home
Fast: Local SQLite queries < 50ms per package
Accurate: SBOM-based matching reduces false positives
Integrated: Reuses existing SBOM infrastructure

This audit system will provide enterprise-grade supply chain security without compromising user privacy or adding network dependencies to the core package management operations.

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
.github/workflows		.github/workflows
apps/sps2		apps/sps2
crates		crates
test_build		test_build
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE.md		LICENSE.md
README.md		README.md
STARLARK_API_DOCUMENTATION.md		STARLARK_API_DOCUMENTATION.md
VENV_CLEANUP_IMPLEMENTATION.md		VENV_CLEANUP_IMPLEMENTATION.md
setup.sh		setup.sh

License

S4QuLa/sps2

Folders and files

Latest commit

History

Repository files navigation

Project Overview

Installation

Prerequisites

Setup

SQLx Setup (for development)

General Development Rules

Rust Standards

Code Quality Requirements

Best Practices

Cross-Cutting Conventions

Project Structure

Architecture Overview

Crate Dependencies

Dependency Structure

Dependency Resolution Architecture

SAT Solver Implementation

Core Types

Resolution Algorithm

Parallel Execution

Install vs Build Behavior

Performance Characteristics

Example Resolution

Error Handling Architecture

Type Definitions Architecture

Event System and Async Architecture

Async Runtime

Event Communication

Execution Flow

Entry Point

Command Flow Architecture

Operations Hierarchy

Operations Context

Command Implementations

Update vs Upgrade Distinction

Event Flow Pattern

CLI Display Responsibilities

Operation Lifecycle

Core Systems

Configuration Management

Configuration File

Environment Variables

Atomic Update System

Filesystem Layout

State Management Architecture

Atomic Update Process

Key Safety Properties

APFS-Specific Optimizations

Garbage Collection

Build System

Supported Build Systems

Production Features

Build Architecture

Starlark Recipe Format

Complete Starlark API Reference

Build Isolation

Integration with Atomic Updates

Package Format

.sp File Structure

SBOM Generation (Built from Day 1)

Repository Index Format

Security Model

Key Distribution & Trust Root

CI/CD Pipeline

Package Repository Strategy

Release & Distribution

Versioning Strategy

Bootstrap Installation

PATH Policy

Update Mechanism

Performance Considerations

Async I/O

APFS Optimizations

Caching Strategy

Concurrency Limits

Packages