llm-inference

Here are 11 public repositories matching this topic...

eastriverlee / LLM.swift

LLM.swift is a simple and readable library that allows you to interact with large language models locally with ease for macOS, iOS, watchOS, tvOS, and visionOS.

macos swift ios tvos watchos llm llm-inference visionos gguf

Updated May 7, 2025
C

zeux / calm

Star

CUDA/Metal accelerated language model inference

cuda ml llm-inference

Updated May 29, 2025
C

modelscope / dash-infer

Star

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.

cpu cuda llm llm-inference native-engine guided-decoding

Updated May 30, 2025
C

FasterDecoding / REST

Star

REST: Retrieval-Based Speculative Decoding, NAACL 2024

retrieval llm-inference speculative-decoding

Updated Dec 2, 2024
C

bd4sur / Nano

Star

电子鹦鹉 / Toy Language Model

ai transformer gpt on-device on-device-ai llm llm-training llm-inference

Updated Jun 12, 2025
C

JohnClaw / chatllm.v

Star

V-lang api wrapper for llm-inference chatllm.cpp

chatbot inference bindings api-wrapper llama quantization gemma mistral v-lang vlang cpu-inference llm llms chatllm ggml llm-inference qwen phi3

Updated Nov 20, 2024
C

JohnClaw / chatllm.nim

Star

Nim api-wrapper for llm-inference chatllm.cpp

Updated Nov 20, 2024
C

JohnClaw / chatllm.kt

Star

kotlin api wrapper for llm-inference chatllm.cpp

kotlin chatbot inference bindings api-wrapper llama quantization gemma mistral cpu-inference llm llms chatllm ggml llm-inference qwen

Updated Nov 26, 2024
C

DongqiShen / iLLM

Star

Implementing LLM from scratch. (Developing...)

arm64 gemm llm-inference

Updated Nov 15, 2023
C

awesome-software / ctransformers

Star

Python bindings for the Transformer models implemented in C/C++ using GGML library.

llm-inference

Updated Sep 17, 2023
C

<img align="left" width="100" height="100" src="logo.png" alt="logo" style="float: left;"/><h3 align="left">Aquila. <br>iOS 6 untethered jailbreak for all devices<div align="right" style="float: top;"></br></h3> Aquila offers a straightforward solution for jailbreaking iOS 6 on various devices. Check out the installation guide at [ios.cfw.guid

nodejs javascript sass kernel mongodb camera rtmp wordpress-theme oop monorepo operating-system hacktoberfest aquila-os webpack4 feature-vectors hacktoberfest2020 llm-training llm-inference

Updated Jun 21, 2025
C

Improve this page

Add a description, image, and links to the llm-inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-inference topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-inference

Here are 11 public repositories matching this topic...

eastriverlee / LLM.swift

zeux / calm

modelscope / dash-infer

FasterDecoding / REST

bd4sur / Nano

JohnClaw / chatllm.v

JohnClaw / chatllm.nim

JohnClaw / chatllm.kt

DongqiShen / iLLM

awesome-software / ctransformers

mark12355433020 / aquila

Improve this page

Add this topic to your repo