Tensor library for machine learning
-
Updated
Jun 21, 2025 - C++
Tensor library for machine learning
High-speed Large Language Model Serving for Local Deployment
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Fast, Flexible and Portable Structured Generation
Fast Multimodal LLM on Mobile Devices
TinyChatEngine: On-Device LLM Inference Library
Run generative AI models in sophgo BM1684X/BM1688
Tensor library & inference framework for machine learning
Cuda implementation of Extended Long Short Term Memory (xLSTM) with C++ and PyTorch ports
Tiny C++11 GPT-2 inference implementation from scratch
Code & data for ICLR 2024 spotlight paper: 🍯MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data
Bridging Items and Language: A Transition Paradigm for Large Language Model-Based Recommendation (KDD'24)
MineGPT 是一个基于Kotlin Multiplatform 开发的本地小型语言模型(SLM)对话应用; MineGPT is a lightweight local SLM (Small Language Model) chat application built with Kotlin Multiplatform. It aims to provide a cross-platform and user-friendly AI assistant experience.
Vulkan & GLSL implementation of FlashAttention-2
This is a special PyTorch For Poor Guys Who can't afford big GPU
LLM-driven 3D terrain generation using OpenGL and C++
deal.II Assistant (Large Language Model & RAG Application)
Add a description, image, and links to the large-language-models topic page so that developers can more easily learn about it.
To associate your repository with the large-language-models topic, visit your repo's landing page and select "manage topics."