llama.cpp

Port of Facebook's LLaMA model in C/C++

This is an exact mirror of the llama.cpp project, hosted at https://github.com/ggml-org/llama.cpp. SourceForge is not affiliated with llama.cpp.

1 Review

Downloads: 5,569 This Week

Last Update: 17 hours ago

Download

Get an email when there's a new version of llama.cpp

Linux Windows Mac BSD ChromeOS

The llama.cpp project enables the inference of Meta's LLaMA model (and other models) in pure C/C++ without requiring a Python runtime. It is designed for efficient and fast model execution, offering easy integration for applications needing LLM-based capabilities. The repository focuses on providing a highly optimized and portable implementation for running large language models directly within C/C++ environments.

Features

Pure C/C++ implementation for efficient LLM inference.
Supports LLaMA models and other variants.
Optimized for performance and portability.
No dependency on Python, ensuring a lightweight deployment.
Provides easy integration into C/C++-based applications.
Scalable for large language model execution.
Open-source, under the MIT license.
Lightweight setup with minimal requirements.
Active development and community contributions.

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow llama.cpp

llama.cpp Web Site

Other Useful Business Software

Managed Cloud Hosting Platform | Nexcess

For growing digital businesses and engineering teams that need reliable, fully managed cloud infrastructure to run high-performance applications.

The managed cloud solution engineered for simplicity, with built-in governance and risk-mitigation, plus a bill you can actually forecast.

Learn More

Rate This Project

User Ratings

5.0 out of 5 stars

★★★★★

★★★★

★★★

★★

★

ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5

features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5

design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5

support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5

User Reviews

Filter Reviews:

All

justinj24 Posted 2023-04-04

Awesome. Democratizing AI for everyone. And it works great!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

C, C++

Related Categories

C++ Large Language Models (LLM), C++ Generative AI, C++ AI Models, C++ LLM Inference Tool, C Large Language Models (LLM), C Generative AI, C AI Models, C LLM Inference Tool

Registered

2023-03-23

Similar Business Software

Gemini Enterprise Agent Platform

Gemini Enterprise Agent Platform is a comprehensive solution from Google Cloud designed to help organizations build, scale, govern, and optimize AI agents. It represents the evolution of Vertex AI, combining advanced model development with new capabilities for agent orchestration and...

See Software
LM-Kit.NET

LM-Kit.NET is a complete local AI runtime for .NET that lets engineering teams ship AI-powered features without cloud dependencies, per-token costs, or data leaving the network. Most .NET AI integrations stop at inference. LM-Kit.NET covers the full range of capabilities production...

See Software
Viktor

Viktor is a persistent AI agent that operates directly within your Slack or Microsoft Teams workspace as an autonomous coworker. Unlike traditional chatbots, Viktor has its own cloud-based computer where it writes code, deploys apps, and executes tasks across more than 3,000 integrations. It...

See Software
RunPod

RunPod offers a cloud-based platform designed for running AI workloads, focusing on providing scalable, on-demand GPU resources to accelerate machine learning (ML) model training and inference. With its diverse selection of powerful GPUs like the NVIDIA A100, RTX 3090, and H100, RunPod supports...

See Software
Parasoft

"Parasoft delivers an AI‑powered software testing platform that helps organizations continuously release high‑quality software. Our solutions support embedded and enterprise teams by integrating code analysis, testing, virtualization, and coverage into the delivery pipeline to improve security,...

See Software
JAMS

JAMS is an automation orchestration and job scheduling solution that works across applications, APIs, and scripting languages. Run, monitor, and manage critical IT processes—from simple batch jobs to cross-platform workflows—from a single pane of glass. JAMS can automate jobs on any...

See Software