llm

package module

v0.2.1 Latest Latest Go to latest Published: Sep 1, 2023 License: MIT Imports: 5 Imported by: 0

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

Repository

github.com/swdunlop/llm-go

Links

Open Source Insights

README ¶

The llm-go package is a Go wrapper for llama.cpp that supports running large language models (specifically LLaMA models) in Go. It was derived from ollama's wrapper before their shift to embedding llama-server inside their own server. (If you need an easy to use local LLaMA server, please, use ollama.ai.)

This package is not meant to be API compatible with ollama's (soon to be deprecated) wrapper, nor is its API stable yet. We are still in the middle of a refactor and the GGML to GGUF shift in llama.cpp means work must be done.

Quickstart

Assuming you are on a Mac M1 (or M2), have Go and the Apple SDK, and a , the following should "just work":

wget https://huggingface.co/TheBloke/vicuna-7B-v1.5-GGML/resolve/main/vicuna-7b-v1.5.ggmlv3.q5_K_M.bin
go run github.com/swdunlop/llm-go/examples/io-worker vicuna-7b-v1.5.ggmlv3.q5_K_M.bin

From here, you can enter JSON prompts on stdin and get a stream of JSON predictions on stdout. Sample input:

// io-worker consumes JSONL, each object is processed serially and consists of a prompt for prediction.
{"prompt": "What is the softest breed of llama?"}

And sample output:

// io-worker emits JSONL, strings for incremental predictions, and a final JSON object with timing information
"\n"
" surely"
" not"
" the"
" one"
" that"
" s"
"ells"
" for"
" "
"1"
"."
"5"
" million"
" dollars"
// the final completion has the combined response and wall clock time.
{"response":"\n surely not the one that sells for 1.5 million dollars","seconds":0.942433625}

Supported Platforms

"Support" is a dirty word. This package is a wrapper around a C++ library that changes very fast. It works on Mac M1 using Metal acceleration. We would like it to work on Linux, but we also want to keep the build simple. (This is why we based llm-go on ollama's wrapper -- they got it working with just Go tools, no makefiles required.)

On MacOS you will need the Apple SDK for the following frameworks:

Accelerate
MetalKit
MetalPerformanceShaders

If you use Nix on MacOS, our flake should provide all the dependencies you need. (Keep in mind, if you use Nix to build, your binary will be linked against the Nix store, which means it will not run on other Macs.)

Updating GGML or LLaMA

Like the original ollama wrapper, llm-go currently uses a script to pull in the C++ code and headers from a llama.cpp checkout. This script also prepends Go build tags to control which features are built. (For example, if you don't have Metal acceleration, you can build without it.)

Using NATS Workers

The llm worker command will subscribe to a NATS subject and process prediction requests using the model specified in the llm worker environment. This can be combined with the llm client / llm predict command, or the ./nats package to request predictions over a NATS network.

This is particularly useful for running multiple instances of a model on other hosts.

Example Usage:

The following three BASH commands will start a NATS server, a llm worker that will will generate predictions to requests to llm.worker.default and connect to it with llm predict to generate a prediction.

gnatsd &
llm_model=vicuna-7b-v1.5.ggmlv3.q5_K_M.bin llm worker &
echo "What is the airspeed of an unladen swallow?" | llm_type=nats go run ./cmd/llm predict

Documentation ¶

Overview ¶

Package llm describes a high level interface to large language models suitable for basic prediction tasks.

Index ¶

func Env(prefix string) map[string]any
func Map(ref any, out map[string]any) error
func Register(name string, fn func(map[string]any) (Interface, error), settings ...Option)
func Unmap(in map[string]any, ref any) error
type Interface
- func New(implementation string, settings map[string]any) (Interface, error)
type Option
- func Settings(name string) []Option
type Prediction

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

func Env ¶ added in v0.2.0

func Env(prefix string) map[string]any

Env constructs a configuration map from OS environment variables with the provided prefix, like "llm_"

func Map ¶ added in v0.2.0

func Map(ref any, out map[string]any) error

Map will marshal the provided structure pointer into a map.

func Register ¶

func Register(name string, fn func(map[string]any) (Interface, error), settings ...Option)

Register will register a named LLM implementation.

func Unmap ¶ added in v0.2.0

func Unmap(in map[string]any, ref any) error

Unmap will unmarshal the provided map into the provided structure pointer.

Types ¶

type Interface ¶

type Interface interface {
	Release() // Closes the model and releases any associated resources.

	// Predict calls the provided function with the language model's predicted continuation of the provided input
	// string.  Prediction will stop if the function returns an error, and will eventually stop after the provided
	// context is cancelled.
	Predict(
		ctx context.Context, settings map[string]any, content []string, fn func(Prediction) error,
	) (string, error)
}

Interface describes the common interface that large language models supported by this package provide.

func New ¶

func New(implementation string, settings map[string]any) (Interface, error)

New uses the named implementation to create a new LLM instance.

type Option ¶ added in v0.2.1

type Option struct {
	// Name is the name of this setting.  This is used as the key in the settings map passed to the LLM implementation.
	Name string `json:"name"` // The name of this setting.

	// Value is the value of this setting.  This is either the default value or the current value, depending on the
	// context.
	Value any `json:"value"`

	// Use describes the purpose of this setting.
	Use string `json:"use"`

	// Init identifies options that are only applicable when creating a new  instance and not when using its methods.
	Init bool `json:"init,omitempty"`
}

A Option describes a setting that can be used to configure an LLM implementation.

func Settings ¶ added in v0.2.1

func Settings(name string) []Option

Settings returns a list of settings that can be used to configure an LLM implementation.

type Prediction ¶

type Prediction interface {
	// String will return the predicted continuation as a string.
	String() string
}

A Prediction provides a partial prediction of the input continuation from a Predictor.

Source Files ¶

View all Source files

Directories ¶

Path	Synopsis
cmd
llm
examples
io-worker
internal
slog Package slog uses either the experimental slog package or the standard slog package depending on the Go version.	Package slog uses either the experimental slog package or the standard slog package depending on the Go version.
llama
nats Package nats registers an llm implementation that uses NATS to communicate with a worker process.	Package nats registers an llm implementation that uses NATS to communicate with a worker process.
internal
protocol Package msg describes the protocol used between the NATS client and worker.	Package msg describes the protocol used between the NATS client and worker.
worker Package worker implements a NATS-based worker for large language models.	Package worker implements a NATS-based worker for large language models.

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL