gguf

Star

Here are 71 public repositories matching this topic...

AlexsJones / llmfit

Sponsor

Star

Hundreds of models & providers. One command to find what runs on your hardware.

skill mlx llm localai gguf unsloth

Updated Jun 17, 2026
Rust

Michael-A-Kuykendall / shimmy

Sponsor

Star

⚡ Pure-Rust WebGPU inference engine — OpenAI-API compatible, GGUF native, runs on any GPU. No Python. No llama.cpp. Single binary.

Updated Jun 17, 2026
Rust

Epistates / pmetal

Star

PMetal: high-performance Apple Silicon framework for local LLM inference, LoRA/QLoRA fine-tuning, serving, quantization, and MLX/Metal acceleration.

Updated Jun 5, 2026
Rust

ShelbyJenkins / llm_client

Star

The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes

rust ai candle llm gguf

Updated Aug 6, 2025
Rust

timtoole02 / Camelid

Star

Camelid: a Rust-native local inference backend with evidence-gated model compatibility.

rust metal inference llama quantization local-first apple-silicon llm gguf openai-compatible

Updated Jun 17, 2026
Rust

The Natural Language Shell integrates OpenAI's GPTs, Anthropic's Claude, or local GGUF-formatted LLMs directly into the terminal experience, allowing operators to describe their tasks in either POSIX commands or fluent human language

shell rust terminal rust-lang shell-script gpt-4 llm llama2 gguf

Updated Mar 25, 2024
Rust

dreadnode / tensor-man

Star

A utility to inspect, validate, sign and verify machine learning model files.

pytorch onnx safetensors gguf

Updated Feb 5, 2025
Rust

llamastash / llamastash

Star

A fast terminal native app (TUI) and CLI with init wizard for launching local LLMs with zero overhead

ai llm llamacpp local-ai ollama gguf lmstudio

Updated Jun 17, 2026
Rust

ShelbyJenkins / llm_utils

Star

llm_utils: Basic LLM tools, best practices, and minimal abstraction.

nlp rust tokenizer segmentation llm gguf

Updated Feb 18, 2025
Rust

iBz-04 / quaynor

Star

AI inference library for mobile devices

rust swift-library react-native flutter inference-engine python-ai llamacpp llm-inference local-ai ollama gguf swift-onnxruntime

Updated May 11, 2026
Rust

jimexist / gguf

Star

A small utility library for parsing GGUF file info

parser ai model nom ggml gguf

Updated Jan 27, 2025
Rust

cluaiz / cluaize

Star

An open-source, high-performance local AI inference engine.

ai llama gemma ai-agents onnx bitnet own-your-data llm local-ai qwen gguf deepseek ai-skills cluaiz

Updated Jun 15, 2026
Rust

zackshen / gguf

Star

a GGUF file parser

ai model llama gpt llm ggml gguf

Updated Jun 15, 2026
Rust

Belluxx / Perplex

Star

Inspect LLM's logprobs and perplexity over a piece of text, or compare two LLMs (like a git diff)

text-analysis text-processing textanalysis llamacpp local-llm gguf llamacpp-python gguf-models

Updated Mar 23, 2026
Rust

thebasedcapital / ane-infer

Star

Apple Neural Engine (ANE) LLM inference engine — reverse-engineered private APIs, Metal GPU shaders, hybrid ANE+GPU+CPU on Apple Silicon. 32 tok/s matching llama.cpp, 3.6 TFLOPS fused ANE mega-kernels.

macos rust reverse-engineering quantization ane npu edge-ai deltanet on-device-ai neural-engine apple-silicon apple-neural-engine llm-inference qwen gguf metal-gpu

Updated Mar 5, 2026
Rust

Michael-A-Kuykendall / shimmytok

Sponsor

Star

Pure Rust tokenizer for GGUF models - llama.cpp compatible

rust machine-learning tokenizer llama bpe sentencepiece llm gguf

Updated Jan 15, 2026
Rust

ghostapp-ai / ghost

Star

The Private Agent OS — search files, run AI agents, connect to 10,000+ tools via the complete protocol stack (MCP, AG-UI, A2UI, A2A). Zero cloud. Zero telemetry. On-device inference.

Updated Jun 15, 2026
Rust

InfiniTensor / gguf

Star

handle gguf files

ai gguf

Updated Aug 14, 2025
Rust

dirmacs / ares

Star

Agentic AI server in Rust. Multi-provider LLM routing, tool calling, RAG, MCP, multi-tenant workflows.

rust ai server mcp chatbot openai agents rag llm llamacpp llama-cpp ggml agentic ollama gguf agentic-workflow tool-calling deep-research

Updated Jun 16, 2026
Rust

maeddesg / vulkanforge

Star

LLM inference engine for AMD RDNA4 — Rust + Vulkan compute shaders, gguf & native FP8.

rust machine-learning amd vulkan inference mesa llm fp8 gguf rdna4 gfx1201 gemma4

Updated Jun 17, 2026
Rust

Improve this page

Add a description, image, and links to the gguf topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gguf topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gguf

Here are 71 public repositories matching this topic...

AlexsJones / llmfit

Michael-A-Kuykendall / shimmy

Epistates / pmetal

ShelbyJenkins / llm_client

timtoole02 / Camelid

mikecvet / nl-sh

dreadnode / tensor-man

llamastash / llamastash

ShelbyJenkins / llm_utils

iBz-04 / quaynor

jimexist / gguf

cluaiz / cluaize

zackshen / gguf

Belluxx / Perplex

thebasedcapital / ane-infer

Michael-A-Kuykendall / shimmytok

ghostapp-ai / ghost

InfiniTensor / gguf

dirmacs / ares

maeddesg / vulkanforge

Improve this page

Add this topic to your repo