gogetter

package module
v0.0.0-...-8fe0bd0 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Aug 6, 2015 License: Apache-2.0 Imports: 16 Imported by: 0

README

gogetter

Circle CI

Fetches open graph data for URLs

If deploying this via docker, you need to remember to mount your host certs to /etc/ssl/certs

Documentation

Index

Constants

View Source
const DEFAULT_UA = "Gogetter (https://github.com/JustinTulloss/gogetter) (like GoogleBot and facebookexternalhit/1.1 and Twitterbot/1.0)"

Variables

This section is empty.

Functions

This section is empty.

Types

type Scraper

type Scraper struct {
	// contains filtered or unexported fields
}

A Scraper instance can be used to scrape webpages for metadata.

func NewScraper

func NewScraper(ua string, shouldCheckRobotsTxt bool) (*Scraper, error)

Creates a new scraper. If no user agent is provided, DEFAULT_UA is used.

func (*Scraper) ParseTags

func (s *Scraper) ParseTags(r io.Reader, webUrl string) (wildcard.Wildcard, error)

func (*Scraper) ScrapeTags

func (s *Scraper) ScrapeTags(url string) (interface{}, error)

Directories

Path Synopsis
Package implementing applink, documented here: http://applinks.org/documentation/
Package implementing applink, documented here: http://applinks.org/documentation/
Structs for wildcard schema, documented here: http://www.trywildcard.com/docs/schema/
Structs for wildcard schema, documented here: http://www.trywildcard.com/docs/schema/

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL