First step of the model-agnostic image-generation framework. Lands the
plumbing other components (skill, ComfyUI/Replicate adapters, agents)
will plug into:
- internal/backend: Backend interface (Request/Result), thread-safe
Registry with init-time Register, plus a Mock reference adapter that
emits a deterministic gradient PNG for smoke tests.
- internal/config: YAML loader for ~/.config/imagen.yaml. Framework owns
default_backend + output settings + a per-backend block; each adapter
owns the schema below its own block via BackendSpec.Raw.
- internal/output: filename templating ({date}/{time}/{slug}/{seed}/
{backend}/{ext}), JSON metadata sidecar, --output override path.
- internal/prompt: embedded styles.yaml, style-preset suffix application.
- internal/server: 501 stub — HTTP surface lands in a follow-up issue.
- cmd/imagen: generate / backends / config (init|validate|path) / serve
/ version subcommands. Stdlib-only flag parsing with a small helper to
honour positional prompt args ahead of flags (matches the issue spec).
- Tests for output (slug, naming template, sidecar), backend (mock PNG
validity + determinism, registry build + duplicate panic), config
(round-trip + validation), prompt (style apply + unknown-style error).
- CLAUDE.md, README.md, docs/architecture.md, docs/usage.md, Makefile.
Acceptance criteria from #211:
1. go build ./... — clean
2. imagen backends — lists registered backends, exits 0
3. imagen generate "test prompt" --backend mock --output /tmp/x.png —
writes a 1024x1024 PNG plus an x.png.json sidecar
4. imagen config init | imagen config validate — round-trips cleanly
5. CLAUDE.md "Adding a new adapter" — six-step recipe
38 lines
1.0 KiB
Go
38 lines
1.0 KiB
Go
// Package backend defines the model-agnostic contract every image-generation
|
|
// adapter must satisfy. The framework speaks only through Backend; concrete
|
|
// adapters (ComfyUI, Replicate, OpenAI, …) translate Request into whatever
|
|
// the upstream API expects and return a Result.
|
|
package backend
|
|
|
|
import (
|
|
"context"
|
|
"io"
|
|
)
|
|
|
|
// Request is the cross-backend request shape. Adapters translate it
|
|
// to whatever their target API expects. Zero values mean "use backend default"
|
|
// unless documented otherwise.
|
|
type Request struct {
|
|
Prompt string
|
|
NegativePrompt string
|
|
Width, Height int
|
|
Steps int
|
|
Seed int64
|
|
Style string
|
|
BackendOpts map[string]any
|
|
}
|
|
|
|
// Result is what the backend produces. The caller is responsible for closing
|
|
// ImageReader.
|
|
type Result struct {
|
|
ImageReader io.ReadCloser
|
|
MimeType string
|
|
Metadata map[string]any
|
|
}
|
|
|
|
// Backend is the interface every adapter satisfies.
|
|
type Backend interface {
|
|
Name() string
|
|
Generate(ctx context.Context, req Request) (*Result, error)
|
|
}
|