htmlizer

package module

v0.0.0-...-003fcb5 Latest Latest Go to latest Published: Feb 18, 2018 License: MIT Imports: 4 Imported by: 1

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

Repository

github.com/gpestana/htmlizer

README ¶

htmlizer

Parses only human readable content from HTML DOM.

Example

import (
  "fmt"
  "github.com/gpestana/htmlizer"
)

func main() {
  html := `
    <html>
     <body>
       <h1>Heading H1</h1>
       <p>This is the first text</p>
       <h2>heading h2</h2>
       <p>This is the second text</p>
     </body>
     <script>console.log("scripts are discarded")</script>
   </html>`

  // will trim out all the tabs from text
  ignore := []rune{'\t'}
  hizer := htmlizer.New(ignore)
  hizer.Load(html)

  fmt.Println(">> Struct:")
  fmt.Println(hizer)

  fmt.Println(">> Human readable content:")
  fmt.Println(hizer.HumanReadable())
}

Output:

>> Struct:
{[Heading H1 heading h2], [this is the first text this is the seconf text]}
>> Human readable content:
Heading H1
This is the first text
heading h2
This is the second text

Contribute

Fork and PR and use issues for bug reports, feature requests and general comments.

gpestana © MIT

Documentation ¶

Index ¶

type Htmlizer
- func New(ignore []rune) (Htmlizer, error)
type Tag
- func (t *Tag) String() string

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

This section is empty.

Types ¶

type Htmlizer ¶

type Htmlizer struct {
	Tags []Tag
	// contains filtered or unexported fields
}

func New ¶

func New(ignore []rune) (Htmlizer, error)

func (*Htmlizer) GetValues ¶

func (h *Htmlizer) GetValues(tagType string) ([]Tag, error)

Returns all values of `tagType`

func (*Htmlizer) HumanReadable ¶

func (h *Htmlizer) HumanReadable() string

func (*Htmlizer) Load ¶

func (h *Htmlizer) Load(s string) error

type Tag ¶

type Tag struct {
	Type  string
	Value string
}

func (*Tag) String ¶

func (t *Tag) String() string

Source Files ¶

View all Source files

htmlizer.go

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL