Documentation ¶
Index ¶
- type Crawler
- func (c *Crawler) Crawl()
- func (c *Crawler) Get(ctx context.Context, url *url.URL) (*http.Response, error)
- func (c *Crawler) Head(ctx context.Context, url *url.URL) (*http.Response, error)
- func (c *Crawler) Index(ctx context.Context, url *url.URL) error
- func (c *Crawler) Schedule(url *url.URL)
- func (c *Crawler) ScheduleLinks(from *url.URL, node *html.Node)
- type Metadata
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
This section is empty.
Types ¶
type Crawler ¶
type Crawler struct { Client *http.Client Domain string DomainID int Authoritative bool Exclude []*regexp.Regexp Delay time.Duration RetryAfter time.Duration Robots *robotstxt.Group UserAgent string Start time.Time // contains filtered or unexported fields }
Click to show internal directories.
Click to hide internal directories.