ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model

This is an exact mirror of the ChatGLM-6B project, hosted at https://github.com/zai-org/ChatGLM-6B. SourceForge is not affiliated with ChatGLM-6B.

Downloads: 17 This Week

Last Update: 2025-09-26

Get an email when there's a new version of ChatGLM-6B

Windows Mac Linux BSD ChromeOS

ChatGLM-6B is an open bilingual (Chinese + English) conversational language model based on the GLM architecture, with approximately 6.2 billion parameters. The project provides inference code, demos (command line, web, API), quantization support for lower memory deployment, and tools for finetuning (e.g., via P-Tuning v2). It is optimized for dialogue and question answering with a balance between performance and deployability in consumer hardware settings. Support for quantized inference (INT4, INT8) to reduce GPU memory requirements. Automatic mode switching between precision/memory tradeoffs (full/quantized).

Features

Bilingual dialogue capability (Chinese and English)
Support for quantized inference (INT4, INT8) to reduce GPU memory requirements
Parameter-efficient finetuning (P-Tuning v2 method)
CLI, web demo, and API interfaces included
Automatic mode switching between precision / memory tradeoffs (full / quantized)
Integration with the Hugging Face / Transformers ecosystem (trust_remote_code, model loading, tokenization)

Project Samples

ChatGLM-6B Screenshot 1

ChatGLM-6B Screenshot 2

Project Activity

See All Activity >

{{ this.obj.activity_extras.summary }}

{{/each}}

Categories

Large Language Models (LLM), AI Models

License

Apache License V2.0

Follow ChatGLM-6B

ChatGLM-6B Web Site

Other Useful Business Software

Hybrid Bare Metal Cloud Infrastructure | Servers.com Icon

Hybrid Bare Metal Cloud Infrastructure | Servers.com

Scale, customize and manage your bare metal servers - all in one place.

Three bare metal hosting solutions on one global network. Spin up on demand to cover peaks, then optimize for cost when usage stabilizes.

Learn More

Rate This Project

Login To Rate This Project

User Reviews

Be the first to post a review of ChatGLM-6B!

Additional Project Details

Programming Language

Related Categories

Python Large Language Models (LLM), Python AI Models

Registered

2025-09-26

Similar Business Software

ChatGLM

ChatGLM-6B is an open-source, Chinese-English bilingual dialogue language model based on the General Language Model (GLM) architecture with 6.2 billion parameters. Combined with model quantization technology, users can deploy locally on consumer-grade graphics cards (only 6GB of video memory is...

See Software
Gemini Enterprise Agent Platform

Gemini Enterprise Agent Platform is a comprehensive solution from Google Cloud designed to help organizations build, scale, govern, and optimize AI agents. It represents the evolution of Vertex AI, combining advanced model development with new capabilities for agent orchestration and...

See Software
LM-Kit.NET

LM-Kit.NET is a complete local AI runtime for .NET that lets engineering teams ship AI-powered features without cloud dependencies, per-token costs, or data leaving the network. Most .NET AI integrations stop at inference. LM-Kit.NET covers the full range of capabilities production...

See Software
Google AI Studio

Google AI Studio is a unified development platform that helps teams explore, build, and deploy applications using Google’s most advanced AI models, including Gemini 3.5. It brings text, image, audio, and video models together in one interactive playground. With vibe coding, developers can use...

See Software
Kimi K2 Thinking

Kimi K2 Thinking is an advanced open source reasoning model developed by Moonshot AI, designed specifically for long-horizon, multi-step workflows where the system interleaves chain-of-thought processes with tool invocation across hundreds of sequential tasks. The model uses a mixture-of-experts...

See Software
MiMo-V2-Flash

MiMo-V2-Flash is an open weight large language model developed by Xiaomi based on a Mixture-of-Experts (MoE) architecture that blends high performance with inference efficiency. It has 309 billion total parameters but activates only 15 billion active parameters per inference, letting it balance...

See Software

Report inappropriate content

Hybrid Bare Metal Cloud Infrastructure | Servers.com

Scale, customize and manage your bare metal servers - all in one place.

Three bare metal hosting solutions on one global network. Spin up on demand to cover peaks, then optimize for cost when usage stabilizes.

Learn More

Recommended Projects

GLM-130B
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM
BitNet
BitNet: Scaling 1-bit Transformers for Large Language Models
ChatGLM.cpp
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
bitsandbytes
Accessible large language models via k-bit quantization for PyTorch