crawl-cache

command module
v0.0.0-...-9c1906e Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Mar 24, 2017 License: Apache-2.0 Imports: 14 Imported by: 0

README

crawl-cache

Circle CI

NSQ crawl queue interceptor caching requests.

Ignores http://, https://, www. prefixes.

Usage

Example usage from command line:

# Install command line application for crawl scheduling
$ go install github.com/crackcomm/crawl/nsq/crawl-schedule
# It will consumer `google_search_cache` and produce `google_search`
$ crawl-cache --topic google_search_cache:google_search &
# Schedule crawl of google search results
$ crawl-schedule \
      --topic google_search_cache \
      --callback github.com/crackcomm/go-google-search/spider.Google \
      "https://www.google.com/search?q=Github"

Callbacks are currently ignored, only URLs are cached.

License

                             Apache License
                       Version 2.0, January 2004
                    http://www.apache.org/licenses/

Authors

Documentation

The Go Gopher

There is no documentation for this package.

Directories

Path Synopsis

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL