Documentation ¶
Index ¶
- Constants
- func IsTerminal(identifier string) bool
- type Grammar
- type GrammarBuilder
- type LeafToken
- type MatchedResult
- type NonLeafToken
- type Production
- func (p *Production) Copy() itff.Copier
- func (p *Production) Equals(other Productioner) bool
- func (p *Production) GetLhs() string
- func (p *Production) GetRhsAt(index int) (string, error)
- func (p *Production) GetSymbols() []string
- func (p *Production) IndexOfRhs(rhs string) int
- func (p *Production) Iterator() itf.Iterater[string]
- func (p *Production) Match(at int, b any) Tokener
- func (p *Production) ReverseIterator() itf.Iterater[string]
- func (p *Production) Size() int
- func (p *Production) String() string
- type Productioner
- type RegProduction
- func (r *RegProduction) Compile() error
- func (p *RegProduction) Copy() itff.Copier
- func (p *RegProduction) Equals(other Productioner) bool
- func (p *RegProduction) GetLhs() string
- func (p *RegProduction) GetSymbols() []string
- func (p *RegProduction) Match(at int, b any) Tokener
- func (r *RegProduction) String() string
- type Tokener
Constants ¶
const ( // LeftToRight is the direction of a production from left to right. LeftToRight string = "->" // StartSymbolID is the identifier of the start symbol in the grammar. StartSymbolID string = "source" // EndSymbolID is the identifier of the end symbol in the grammar. EndSymbolID string = "EOF" )
Variables ¶
This section is empty.
Functions ¶
func IsTerminal ¶ added in v0.2.18
IsTerminal checks if the given identifier is a terminal. Terminals are identifiers that start with an uppercase letter.
Parameters:
- identifier: The identifier to check.
Returns:
- bool: True if the identifier is a terminal, false otherwise.
Types ¶
type Grammar ¶
type Grammar struct { // Productions is a slice of Productions in the grammar. Productions []Productioner // LhsToSkip is a slice of productions to skip. LhsToSkip []string // Symbols is a slice of Symbols in the grammar. Symbols []string }
Grammar represents a context-free grammar.
A context-free grammar is a set of productions, each of which consists of a non-terminal symbol and a sequence of symbols.
The non-terminal symbol is the left-hand side of the production, and the sequence of symbols is the right-hand side of the production.
The grammar also contains a set of symbols, which are the non-terminal and terminal symbols in the grammar.
func (*Grammar) Match ¶ added in v0.2.18
func (g *Grammar) Match(at int, b any) []MatchedResult
Match returns a slice of MatchedResult that match the input token.
Parameters:
- at: The position in the input string.
- b: The input stream to match. Refers to Productioner.Match.
Returns:
- []MatchedResult: A slice of MatchedResult that match the input token.
type GrammarBuilder ¶
type GrammarBuilder struct {
// contains filtered or unexported fields
}
GrammarBuilder represents a builder for a grammar.
The default direction of the productions is LeftToRight.
func (*GrammarBuilder) AddProduction ¶
func (b *GrammarBuilder) AddProduction(p ...Productioner)
AddProduction is a method of GrammarBuilder that adds a production to the GrammarBuilder.
Parameters:
- p: The production to add to the GrammarBuilder.
func (*GrammarBuilder) Build ¶
func (b *GrammarBuilder) Build() (*Grammar, error)
Build is a method of GrammarBuilder that builds a Grammar from the GrammarBuilder.
Returns:
- *Grammar: A Grammar built from the GrammarBuilder.
func (*GrammarBuilder) Reset ¶
func (b *GrammarBuilder) Reset()
Reset is a method of GrammarBuilder that resets a GrammarBuilder.
func (*GrammarBuilder) SetToSkip ¶ added in v0.2.19
func (b *GrammarBuilder) SetToSkip(lhss ...string)
SetToSkip is a method of GrammarBuilder that sets the productions to skip in the GrammarBuilder.
Parameters:
- lhss: The left-hand sides of the productions to skip.
func (*GrammarBuilder) String ¶
func (b *GrammarBuilder) String() string
String is a method of GrammarBuilder that returns a string representation of a GrammarBuilder.
It should only be used for debugging and logging purposes.
Returns:
- string: A string representation of a GrammarBuilder.
type LeafToken ¶ added in v0.2.18
type LeafToken struct { // ID is the identifier of the token. ID string // Data is the data of the token. Data string // At is the position of the token in the input string. At int }
LeafToken represents a token that contains a single piece of data.
func NewLeafToken ¶ added in v0.2.18
NewLeafToken creates a new leaf token with the given identifier, data, and position.
Parameters:
- id: The identifier of the token.
- data: The data of the token.
- at: The position of the token in the input string.
Returns:
- LeafToken: The new leaf token.
func (*LeafToken) GetData ¶ added in v0.2.18
GetData returns the data of the token.
Returns:
- any: The data of the token.
func (*LeafToken) GetID ¶ added in v0.2.18
GetID returns the identifier of the token.
Returns:
- string: The identifier of the token.
type MatchedResult ¶ added in v0.2.18
type MatchedResult struct { // Matched is the matched token. Matched Tokener // RuleIndex is the index of the production that matched. RuleIndex int }
MatchedResult represents the result of a match operation.
func NewMatchResult ¶ added in v0.2.18
func NewMatchResult(matched Tokener, ruleIndex int) MatchedResult
NewMatchResult is a constructor of MatchedResult.
Parameters:
- matched: The matched token.
- ruleIndex: The index of the production that matched.
Returns:
- MatchedResult: A new MatchedResult.
type NonLeafToken ¶ added in v0.2.18
type NonLeafToken struct { // ID is the identifier of the token. ID string // Data is the data of the token. Data []Tokener // At is the position of the token in the input string. At int }
NonLeafToken represents a token that contains multiple pieces of data.
func NewNonLeafToken ¶ added in v0.2.18
func NewNonLeafToken(id string, at int, data ...Tokener) NonLeafToken
NewNonLeafToken creates a new non-leaf token with the given identifier, data, and position.
Parameters:
- id: The identifier of the token.
- data: The data of the token.
- at: The position of the token in the input string.
Returns:
- NonLeafToken: The new non-leaf token.
func (*NonLeafToken) GetData ¶ added in v0.2.18
func (t *NonLeafToken) GetData() any
GetData returns the data of the token.
Returns:
- any: The data of the token.
func (*NonLeafToken) GetID ¶ added in v0.2.18
func (t *NonLeafToken) GetID() string
GetID returns the identifier of the token.
Returns:
- string: The identifier of the token.
func (*NonLeafToken) GetPos ¶ added in v0.2.18
func (t *NonLeafToken) GetPos() int
GetPos returns the position of the token in the input string.
Returns:
- int: The position of the token in the input string.
func (*NonLeafToken) String ¶ added in v0.2.18
func (t *NonLeafToken) String() string
String is a method of fmt.Stringer interface.
It should only be used for debugging and logging purposes.
Returns:
- string: A string representation of the non-leaf token.
type Production ¶
type Production struct {
// contains filtered or unexported fields
}
Production represents a production in a grammar.
func NewProduction ¶
func NewProduction(lhs string, rhs ...string) *Production
NewProduction is a function that returns a new Production with the given left-hand side and right-hand side.
Parameters:
- lhs: The left-hand side of the production.
- rhs: The right-hand side of the production.
Returns:
- *Production: A new Production with the given left-hand side and right-hand side.
func (*Production) Copy ¶ added in v0.2.21
func (p *Production) Copy() itff.Copier
Copy is a method of Production that returns a copy of the production.
Returns:
- itff.Copier: A copy of the production.
func (*Production) Equals ¶ added in v0.2.18
func (p *Production) Equals(other Productioner) bool
Equals is a method of Production that returns whether the production is equal to another production. Two productions are equal if their left-hand sides are equal and their right-hand sides are equal.
Parameters:
- other: The other production to compare to.
Returns:
- bool: Whether the production is equal to the other production.
func (*Production) GetLhs ¶ added in v0.2.18
func (p *Production) GetLhs() string
GetLhs is a method of Production that returns the left-hand side of the production.
Returns:
- string: The left-hand side of the production.
func (*Production) GetRhsAt ¶
func (p *Production) GetRhsAt(index int) (string, error)
GetRhsAt is a method of Production that returns the symbol at the given index in the right-hand side of the production.
Parameters:
- index: The index of the symbol to get.
Returns:
- string: The symbol at the given index in the right-hand side of the production.
- error: An error of type *ErrInvalidParameter if the index is invalid.
func (*Production) GetSymbols ¶
func (p *Production) GetSymbols() []string
GetSymbols is a method of Production that returns a slice of symbols in the production. The slice contains the left-hand side of the production and the right-hand side of the production, with no duplicates.
Returns:
- []string: A slice of symbols in the production.
func (*Production) IndexOfRhs ¶ added in v0.2.18
func (p *Production) IndexOfRhs(rhs string) int
IndexOfRhs is a method of Production that returns the index of the given symbol in the right-hand side of the production.
Parameters:
- rhs: The symbol to find the index of.
Returns:
- int: The index of the symbol in the right-hand side of the production. Returns -1 if the symbol is not found.
func (*Production) Iterator ¶
func (p *Production) Iterator() itf.Iterater[string]
Iterator is a method of Production that returns an iterator for the production that iterates over the right-hand side of the production.
Returns:
- itf.Iterater[string]: An iterator for the production.
func (*Production) Match ¶ added in v0.2.18
func (p *Production) Match(at int, b any) Tokener
Match is a method of Production that returns a token that matches the production in the given stack. The token is a non-leaf token if the production is a non-terminal production, and a leaf token if the production is a terminal production.
Parameters:
- at: The current index in the input stack.
- b: The stack to match the production against.
Returns:
- Tokener: A token that matches the production in the stack.
Information:
- 'at' is the current index where the match is being attempted. It is used by the lexer to specify the position of the token in the input string. In parsers, however, it is not really used (at = 0). Despite that, it can be used to provide additional information to the parser for error reporting or debugging.
- as of now, only Stack.Stacker[Tokener] is supported as the type of 'b'.
func (*Production) ReverseIterator ¶ added in v0.2.18
func (p *Production) ReverseIterator() itf.Iterater[string]
ReverseIterator is a method of Production that returns a reverse iterator for the production that iterates over the right-hand side of the production in reverse.
Returns:
- itf.Iterater[string]: A reverse iterator for the production.
func (*Production) Size ¶
func (p *Production) Size() int
Size is a method of Production that returns the number of symbols in the right-hand side of the production.
Returns:
- int: The number of symbols in the right-hand side of the production.
func (*Production) String ¶
func (p *Production) String() string
String is a method of Production that returns a string representation of a Production.
Returns:
- string: A string representation of a Production.
type Productioner ¶ added in v0.2.18
type Productioner interface { // Equals returns whether the production is equal to another production. // Two productions are equal if their left-hand sides are equal and their // right-hand sides are equal. // // Parameters: // // - other: The other production to compare to. // // Returns: // // - bool: Whether the production is equal to the other production. Equals(other Productioner) bool // GetLhs returns the left-hand side of the production. // // Returns: // // - string: The left-hand side of the production. GetLhs() string // GetSymbols returns a slice of symbols in the production. The slice // contains the left-hand side of the production and the right-hand side // of the production, with no duplicates. // // Returns: // // - []string: A slice of symbols in the production. GetSymbols() []string // Match returns a token that matches the production in the given stack. // The token is a non-leaf token if the production is a non-terminal // production, and a leaf token if the production is a terminal production. // // Parameters: // // - at: The current index in the input stack. // - b: The input stream or stack to match the production against. // // Returns: // // - Tokener: A token that matches the production in the input stream or stack. // nil if there is no match. // // Information: // // - 'at' is the current index where the match is being attempted. // It is used by the lexer to specify the position of the token in the // input string. In parsers, however, it is not really used (at = 0). // Despite that, it can be used to provide additional information to // the parser for error reporting or debugging. Match(at int, b any) Tokener fmt.Stringer itff.Copier }
Productioner is an interface that defines methods for a production in a grammar.
type RegProduction ¶ added in v0.2.18
type RegProduction struct {
// contains filtered or unexported fields
}
RegProduction represents a production in a grammar that matches a regular expression.
func NewRegProduction ¶ added in v0.2.18
func NewRegProduction(lhs string, regex string) *RegProduction
NewRegProduction is a function that returns a new RegProduction with the given left-hand side and regular expression.
It adds the '^' character to the beginning of the regular expression to match the beginning of the input string.
Parameters:
- lhs: The left-hand side of the production.
- regex: The regular expression to match the right-hand side of the production.
Returns:
- *RegProduction: A new RegProduction with the given left-hand side and regular expression.
Information:
- Must call Compile() on the returned RegProduction to compile the regular expression.
func (*RegProduction) Compile ¶ added in v0.2.24
func (r *RegProduction) Compile() error
Compile is a method of RegProduction that compiles the regular expression of the production.
Returns:
- error: An error if the regular expression cannot be compiled.
func (*RegProduction) Copy ¶ added in v0.2.21
func (p *RegProduction) Copy() itff.Copier
Copy is a method of RegProduction that returns a copy of the production.
Returns:
- itff.Copier: A copy of the production.
func (*RegProduction) Equals ¶ added in v0.2.18
func (p *RegProduction) Equals(other Productioner) bool
Equals is a method of RegProduction that returns whether the production is equal to another production. Two productions are equal if their left-hand sides are equal and their right-hand sides are equal.
Parameters:
- other: The other production to compare to.
Returns:
- bool: Whether the production is equal to the other production.
func (*RegProduction) GetLhs ¶ added in v0.2.18
func (p *RegProduction) GetLhs() string
GetLhs is a method of RegProduction that returns the left-hand side of the production.
Returns:
- string: The left-hand side of the production.
func (*RegProduction) GetSymbols ¶ added in v0.2.18
func (p *RegProduction) GetSymbols() []string
GetSymbols is a method of RegProduction that returns a slice of symbols in the production. The slice contains the left-hand side of the production.
Returns:
- []string: A slice of symbols in the production.
func (*RegProduction) Match ¶ added in v0.2.18
func (p *RegProduction) Match(at int, b any) Tokener
Match is a method of RegProduction that returns a token that matches the production in the given stack. The token is a non-leaf token if the production is a non-terminal production, and a leaf token if the production is a terminal production.
Parameters:
- at: The current index in the input stack.
- b: The slice of bytes to match the production against.
Returns:
- Tokener: A token that matches the production in the stack. nil if there is no match.
func (*RegProduction) String ¶ added in v0.2.18
func (r *RegProduction) String() string
String is a method of fmt.Stringer that returns a string representation of a RegProduction.
It should only be used for debugging and logging purposes.
Returns:
- string: A string representation of a RegProduction.
type Tokener ¶ added in v0.2.18
type Tokener interface { // GetID returns the identifier of the token. // // Returns: // // - string: The identifier of the token. GetID() string // GetData returns the data of the token. // // Returns: // // - any: The data of the token. GetData() any // GetPos returns the position of the token in the input string. // // Returns: // // - int: The position of the token in the input string. GetPos() int fmt.Stringer }
Tokener is an interface that defines the methods that a token must implement.