Audience

Users interested in a powerful Large Language Model solution

About LongLLaMA

This repository contains the research preview of LongLLaMA, a large language model capable of handling long contexts of 256k tokens or even more. LongLLaMA is built upon the foundation of OpenLLaMA and fine-tuned using the Focused Transformer (FoT) method. LongLLaMA code is built upon the foundation of Code Llama. We release a smaller 3B base variant (not instruction tuned) of the LongLLaMA model on a permissive license (Apache 2.0) and inference code supporting longer contexts on hugging face. Our model weights can serve as the drop-in replacement of LLaMA in existing implementations (for short context up to 2048 tokens). Additionally, we provide evaluation results and comparisons against the original OpenLLaMA models.

Pricing

Starting Price:
Free
Free Version:
Free Version available.

Integrations

No integrations listed.

Ratings/Reviews

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Company Information

LongLLaMA
github.com/CStanKonrad/long_llama

Videos and Screen Captures

LongLLaMA Screenshot 1
Other Useful Business Software
Hybrid Bare Metal Cloud Infrastructure | Servers.com Icon
Hybrid Bare Metal Cloud Infrastructure | Servers.com

Scale, customize and manage your bare metal servers - all in one place.

Three bare metal hosting solutions on one global network. Spin up on demand to cover peaks, then optimize for cost when usage stabilizes.
Learn More

Product Details

Platforms Supported
Cloud
On-Premises
Training
Documentation
Support
Online

LongLLaMA Frequently Asked Questions

Q: What kinds of users and organization types does LongLLaMA work with?
Q: What languages does LongLLaMA support in their product?
Q: What kind of support options does LongLLaMA offer?
Q: What type of training does LongLLaMA provide?
Q: How much does LongLLaMA cost?

LongLLaMA Product Features