crawler

package
v0.0.0-...-aa53c72 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Nov 12, 2021 License: GPL-3.0 Imports: 6 Imported by: 0

Documentation

Index

Constants

This section is empty.

Variables

This section is empty.

Functions

func ExtractContent

func ExtractContent(
	webpageURL string,
	processor *processor,
)

func InitProcessor

func InitProcessor(params CrawlPsParams) *processor

Types

type CrawlPsParams

type CrawlPsParams struct {
	SiteToCrawl      string
	LaunchThreadsNum int
	VerboseMode      bool
	ReqThrottling    time.Duration
	TimeLimit        time.Duration
}

type CrawledUrlMap

type CrawledUrlMap map[string]int
type ProcessorLinks struct {
	Pending   int
	Requested int
	Processed int
}

type SpendTime

type SpendTime struct {
	Total      int64
	RequestAvg int64
	ParseAvg   int64
}

type StatisticsMap

type StatisticsMap struct {
	CrawledUrlMap  CrawledUrlMap
	ProcessorLinks ProcessorLinks
	SpendTime      SpendTime
}

Directories

Path Synopsis

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL