scraper

package
v0.9.2 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Apr 25, 2024 License: Apache-2.0 Imports: 15 Imported by: 0

Documentation

Overview

Package scrapper implements the scrapper library for the Crowler and the Crowler search engine.

Package scraper implements the scraper library for the Crowler and the Crowler search engine.

Index

Constants

This section is empty.

Variables

This section is empty.

Functions

func ApplyPostProcessingStep

func ApplyPostProcessingStep(step *rs.PostProcessingStep, data *[]byte)

ApplyPostProcessingStep applies the provided post-processing step to the provided data.

func ApplyRule

func ApplyRule(rule *rs.ScrapingRule, webPage *selenium.WebDriver) map[string]interface{}

ApplyRule applies the provided scraping rule to the provided web page.

func ApplyRulesGroup

func ApplyRulesGroup(ruleGroup *rs.RuleGroup, url string, webPage *selenium.WebDriver) (map[string]interface{}, error)

ApplyRulesGroup extracts the data from the provided web page using the provided a rule group.

Types

type ScraperRuleEngine

type ScraperRuleEngine struct {
	*rs.RuleEngine // generic rule engine
}

ScraperRuleEngine extends RuleEngine from the ruleset package

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL