generationutils

package

v0.2.1 Latest Latest Go to latest Published: Nov 8, 2023 License: BSD-2-Clause Imports: 10 Imported by: 0

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

Repository

github.com/nlpodyssey/cybertron

Links

Open Source Insights

README ¶

This package is a derivative work from Hugging Face Transformers generation utils .

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an " AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Documentation ¶

Overview ¶

Package generationutils implements a decoding search algorithm for conditional generation.

Index ¶

type BeamSearchDecoder
- func (b *BeamSearchDecoder) Decode(ctx context.Context) ([][]int, []float64)
type Config
type DecodingStrategyFunc
type PredictNextFunc
type ScoreProcessor
type ScoredToken
- func SelectNextMultinomial(tokensScores []mat.Matrix, resultSize int) []*ScoredToken
- func SelectNextTopK(tokensScores []mat.Matrix, resultSize int) []*ScoredToken

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

This section is empty.

Types ¶

type BeamSearchDecoder ¶

type BeamSearchDecoder struct {
	// Config is the configuration of the beam decoder.
	Config Config
	// PredictNext is a function that predicts the next tokens given the current tokens.
	PredictNext PredictNextFunc
	// SelectNext is a function that selects the next tokens given the current tokens.
	SelectNext DecodingStrategyFunc
}

BeamSearchDecoder is an implementations of a decoding search algorithm for conditional generation.

func (*BeamSearchDecoder) Decode ¶

func (b *BeamSearchDecoder) Decode(ctx context.Context) ([][]int, []float64)

Decode generates sequences for model with a language modeling head, using beam-search decoding.

type Config ¶

type Config struct {
	// NumBeams is the number of beams for decoding search.
	NumBeams int
	// MinLength is the minimum length of the sequence to be generated.
	MinLength int
	// MaxLength is the maximum length of the sequence to be generated.
	MaxLength int
	// IsEncoderDecoder reports whether the model is used as an encoder/decoder.
	IsEncoderDecoder bool
	// BOSTokenID is the ID of the Beginning-Of-Sequence token.
	BOSTokenID int
	// EOSTokenID is the ID of the End-Of-Sequence token.
	EOSTokenID int
	// PadTokenID is the id of the padding token.
	PadTokenID int
	// VocabSize is the size of the vocabulary.
	VocabSize int
	// DecoderStartTokenID is the ID of the start token for the decoder of an
	// encoder-decoder model.
	DecoderStartTokenID int
	// LengthPenalty is the exponential penalty to the length.
	// 1.0 means no penalty. Set to values < 1.0 in order to encourage the
	// model to generate shorter sequences, to a value > 1.0 in order to
	// encourage the model to produce longer sequences.
	LengthPenalty float64
	// EarlyStopping reports whether to stop the decoding search when at least
	// NumBeams sentences are finished per batch or not.
	EarlyStopping bool
	// BadWordsIDs is a list of token IDs that are not allowed to be generated.
	BadWordsIDs [][]int
	// When set to a positive value, generated n-grams of this size will
	// only occur once.
	NoRepeatNGramSize int
}

Config provides configuration options for the decoding search algorithm.

type DecodingStrategyFunc ¶

type DecodingStrategyFunc func(tokensScores []mat.Matrix, resultSize int) []*ScoredToken

DecodingStrategyFunc returns the next tokens to be generated.

type PredictNextFunc ¶

type PredictNextFunc func(decodingInputIDs [][]int, lastBeamIndices []int) []mat.Matrix

PredictNextFunc is a function that predicts the next token scores for a given input.

type ScoreProcessor ¶

type ScoreProcessor func(scores mat.Matrix) mat.Matrix

ScoreProcessor is a function that takes a matrix of scores and returns an altered matrix of scores.

func ProcessScores ¶

func ProcessScores(processors ...ScoreProcessor) ScoreProcessor

ProcessScores applies a list of score processors to a matrix of scores.

func TemperatureProcessor ¶

func TemperatureProcessor(temperature float64) ScoreProcessor

TemperatureProcessor applies a temperature to a matrix of scores.

func TopKProcessor ¶

func TopKProcessor(topK int, filterValue float64) ScoreProcessor

TopKProcessor applies a top-k filter to a matrix of scores.

func TopPProcessor ¶

func TopPProcessor[T float.DType](topP, filterValue T, minSize int) ScoreProcessor

TopPProcessor applies a top-p filter to a matrix of scores. Note that when using beam decoding (with beam > 1) then minSize must be at least 2.

type ScoredToken ¶

type ScoredToken struct {
	BeamIndex  int
	TokenIndex int
	Score      float64
}

ScoredToken associates a score to a token identified by its (beam-index, token-index) position.

func SelectNextMultinomial ¶

func SelectNextMultinomial(tokensScores []mat.Matrix, resultSize int) []*ScoredToken

SelectNextMultinomial returns the next tokens to be generated.

func SelectNextTopK ¶

func SelectNextTopK(tokensScores []mat.Matrix, resultSize int) []*ScoredToken

SelectNextTopK returns the next tokens to be generated.

Source Files ¶

View all Source files

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL