textplain

package module

v0.2.9 Latest Latest Go to latest Published: Mar 8, 2024 License: MIT Imports: 8 Imported by: 1

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

Repository

github.com/mailproto/textplain

Links

Open Source Insights

README ¶

Textplain

This project began as a port of the html_to_plaintext logic from github.com/premailer/premailer and applies the same basic set of rules for generating a text/plain copy of an email, given the text/html version

Usage

myHTML := `<html><body>Hello World</body></html>`
myPlaintext := textplain.Convert(myHTML, textplain.DefaultLineLength)

By default it applies a word wrapping algorithm that is also supplied standalone.

wrapped := textplain.WordWrap("hello world, here is some text", 15)

Options

Two plaintexters are supplied:

converter := textplain.NewTreeConverter()

Uses the x/net/html package to parse the supplied html into a tree, and performs a single-pass conversion to plaintext. This is the best performing option, and recommended for general usage.

The library still includes the older converter option

converter := textplain.NewRegexpConverter()

is the most "true to premailer" implementation, and uses regular expressions, which is largely problematic as it needs to both compile those regexps and regular expressions in the Go world use mutexes which limit concurrency

Documentation ¶

Index ¶

Constants
Variables
func Convert(document string, lineLength int) (string, error)
func MustConvert(document string, lineLength int) string
func WordWrap(txt string, lineLength int) string
type Converter
- func NewRegexpConverter() Converter
- func NewTreeConverter() Converter
type RegexpConverter
- func (t *RegexpConverter) Convert(document string, lineLength int) (string, error)
type TreeConverter
- func (t *TreeConverter) Convert(document string, lineLength int) (string, error)

Constants ¶

View Source

const (
	DefaultLineLength = 65
)

Defaults

Variables ¶

View Source

var (
	ErrBodyNotFound = errors.New("could not find a `body` element in your html document")
)

Well-defined errors

Functions ¶

func Convert ¶

func Convert(document string, lineLength int) (string, error)

Convert is a convenience method so the library can be used without initializing a converter because this library relies heavily on regexp objects, it may act as a bottlneck to concurrency due to thread-safety mutexes in *regexp.Regexp internals

func MustConvert ¶ added in v0.2.0

func MustConvert(document string, lineLength int) string

func WordWrap ¶

func WordWrap(txt string, lineLength int) string

WordWrap searches for logical breakpoints in each line (whitespace) and tries to trim each line to the specified length Note: this diverges from the regex approach in premailer, which I found to be significantly slower in cases with long unbroken lines https://github.com/premailer/premailer/blob/7c94e7a/lib/premailer/html_to_plain_text.rb#L116

Types ¶

type Converter ¶

type Converter interface {
	Convert(string, int) (string, error)
}

func NewRegexpConverter ¶ added in v0.2.0

func NewRegexpConverter() Converter

New textplain converter object

func NewTreeConverter ¶ added in v0.2.0

func NewTreeConverter() Converter

type RegexpConverter ¶ added in v0.2.0

type RegexpConverter struct {
	// contains filtered or unexported fields
}

func (*RegexpConverter) Convert ¶ added in v0.2.0

func (t *RegexpConverter) Convert(document string, lineLength int) (string, error)

Convert returns a text-only version of supplied document in UTF-8 format with all HTML tags removed

type TreeConverter ¶ added in v0.2.0

type TreeConverter struct{}

func (*TreeConverter) Convert ¶ added in v0.2.0

func (t *TreeConverter) Convert(document string, lineLength int) (string, error)

Source Files ¶

View all Source files

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL