gguf
Here are 71 public repositories matching this topic...
⚡ Pure-Rust WebGPU inference engine — OpenAI-API compatible, GGUF native, runs on any GPU. No Python. No llama.cpp. Single binary.
-
Updated
Jun 17, 2026 - Rust
PMetal: high-performance Apple Silicon framework for local LLM inference, LoRA/QLoRA fine-tuning, serving, quantization, and MLX/Metal acceleration.
-
Updated
Jun 5, 2026 - Rust
Camelid: a Rust-native local inference backend with evidence-gated model compatibility.
-
Updated
Jun 17, 2026 - Rust
The Natural Language Shell integrates OpenAI's GPTs, Anthropic's Claude, or local GGUF-formatted LLMs directly into the terminal experience, allowing operators to describe their tasks in either POSIX commands or fluent human language
-
Updated
Mar 25, 2024 - Rust
A utility to inspect, validate, sign and verify machine learning model files.
-
Updated
Feb 5, 2025 - Rust
AI inference library for mobile devices
-
Updated
May 11, 2026 - Rust
Inspect LLM's logprobs and perplexity over a piece of text, or compare two LLMs (like a git diff)
-
Updated
Mar 23, 2026 - Rust
Apple Neural Engine (ANE) LLM inference engine — reverse-engineered private APIs, Metal GPU shaders, hybrid ANE+GPU+CPU on Apple Silicon. 32 tok/s matching llama.cpp, 3.6 TFLOPS fused ANE mega-kernels.
-
Updated
Mar 5, 2026 - Rust
Pure Rust tokenizer for GGUF models - llama.cpp compatible
-
Updated
Jan 15, 2026 - Rust
The Private Agent OS — search files, run AI agents, connect to 10,000+ tools via the complete protocol stack (MCP, AG-UI, A2UI, A2A). Zero cloud. Zero telemetry. On-device inference.
-
Updated
Jun 15, 2026 - Rust
Improve this page
Add a description, image, and links to the gguf topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the gguf topic, visit your repo's landing page and select "manage topics."