Skylark Software LLC

Janua is a new fork of Apache Guacamole with comprehensive support for modern Wayland-based systems.

It is built for GNOME Remote Desktop and KDE KRdp compatibility:

FreeRDP 3 support
RDPSND version 8
H.264/AVC codecs for RDPGFX (required by KDE KRdp)

Learn More View on GitHub

A visual node-based programming environment for AI workflows, featuring a decentralized message bus architecture with a PostgreSQL data plane.

Visual drag-and-drop module interface
Decentralized message bus architecture
Multi-provider LLM support (Anthropic, OpenAI, Mistral, Ollama)
Agent team management with hierarchical roles

View on GitHub

A management dashboard for LLM inference infrastructure, providing control over models, instance pools, GPU allocation, and distributed storage.

Multi-model management (Mistral, DeepSeek, Qwen, Kimi, LLaMA)
Speculative decoding with draft model support
Instance pool orchestration with per-GPU assignment
Speedloader for hot/cold model storage tiers
RDMA-enabled distributed block storage with RAID aggregation

View on GitHub

EagleBranch

llama.cpp fork with TurboQuant 3-bit KV cache compression and speculative decoding (EAGLE v1/v2/v3, NextN/MTP) for current, legacy, and CPU inference. Includes multiple bug fixes that enable legacy NVIDIA hardware and a fused rotated-domain matvec kernel ("Lane B") that closes most of the throughput gap to IQ4_NL on MLA models.

TurboQuant 3-bit KV cache compression — 5× smaller than f16, +2.27% PPL on Mistral Small 24B
EAGLE v1/v2/v3 speculative decoding for Mistral Large 3 (675B)
NextN/MTP native speculative decoding for DeepSeek R1/V3 (671B)
Pascal SM61 through Hopper SM90; CPU inference path on x86
Full MLA (Multi-head Latent Attention) and MoE support
MLA quantized-K bug fixes available under MIT on the public branch

View on GitHub

Life management tools. A sovereign-cloud alternative to NextCloud — self-hosted or cloud-hosted, with zero footprint on your devices.

Self-hosted with HTTPS reverse proxy and Tailscale — your data never leaves your infrastructure
Web client works on any device including mobile — delete the bookmark and nothing exists on your phone
18 modules: tasks, email, calendar, recipes, finance, health, weather, knowledge base, and more
Native Qt6/C++ cross-platform desktop client — Swift for Apple coming soon

Coming Soon

Distributed multi-room audio routing in Rust. A software AV receiver that connects speaker nodes across a network with AI control via MCP.

Route TTS, music, or system audio to any room, group, or broadcast everywhere
Per-room volume and mute with live web dashboard
STT microphone routing — select which room's mic feeds speech recognition
AI control via MCP — Claude can manage routing, volume, and STT sources
Rust server + Python nodes with system tray integration

View on GitHub

Fast hybrid RAM/storage management for LLM models. Written in Rust. Keeps frequently-used models in RAM for instant access while storing others on cold storage.

Multi-tier RAM storage — local DDR5, remote NVMe-oF over RDMA
One-click promote/demote with automatic tier selection
Fastest-fit placement with failover across pools
Remote pool lifecycle — connect/disconnect with progress feedback
Model pinning and async background transfers
Support for Ollama, llama.cpp, vLLM, and HuggingFace

View on GitHub