crawler

package
v0.1.2 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Mar 19, 2017 License: MIT Imports: 20 Imported by: 0

Documentation

Index

Constants

This section is empty.

Variables

This section is empty.

Functions

This section is empty.

Types

type Crawler

type Crawler struct {
	BaseURL                    string
	KeywordsToExclude          []string
	KeywordsToInclude          []string
	MaxNumberWorkers           int
	MaxNumberConnections       int
	Verbose                    bool
	FilePrefix                 string
	Remote, Username, Password string // Parameters for BoltDB remote connection
	UserAgent                  string
	TimeIntervalToPrintStats   int
	// contains filtered or unexported fields
}

Crawler is the crawler instance

func New

func New(url string, boltdbserver string, trace bool) (*Crawler, error)

New will create a new crawler

func (*Crawler) Crawl

func (c *Crawler) Crawl() error

Crawl is the function to crawl with the set parameters

func (*Crawler) Download

func (c *Crawler) Download(urls []string) error

Crawl downloads the pages specified in the todo file

func (*Crawler) Dump

func (c *Crawler) Dump() error
func (c *Crawler) GetLinks() (links []string, err error)

func (*Crawler) Name

func (c *Crawler) Name() string

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL