mirroring-web-crawler

command module
v0.0.0-...-37a03cf Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Apr 23, 2023 License: Apache-2.0 Imports: 16 Imported by: 0

README

mirroring-web-crawler

A recursive, mirroring web crawler as a command-line tool that accepts a starting URL and a destination directory. The crawler will then download the page at the URL, save it in the destination directory, and then recursively proceed to any valid links in this page.

Dependencies

  • Go Version- go1.20
  • Go package- net/http
  • Go package- golang.org/x/sync v0.1.0

Features

  • Fetch the resource at a target url
  • Parse the retrieve target resource to extract all children <a> links
  • Writes downloaded resource into provided directory
  • Skip fetching and processing files where not necessary such as when they were already downloaded
  • Configurable concurrent processing
  • Support interruption and resume capabilities

Enhancements

  • Use a goroutine pool backed by a queue which provides a better concurrency model and usage of resources.
  • Add support for resuming partially downloaded files which can happen when the crawler is interrupted

Code

Compile:

go build

Usage:
Usage of ./mirroring-web-crawler:
  -concurrency int
        number of concurrent crawlers (default 10)
  -dir string
        target directory to save crawledState files
  -interval int
        the number of milliseconds to wait between crawls (default 1000)
  -overwrite
        overwrite download of files
  -url string
        target url to crawl

Example:

./mirroring-web-crawler -url https://www.scrapingbee.com -dir ./scrapping-bee -interval 10 -concurrency 100

Output:

 % ./mirroring-web-crawler -url https://www.scrapingbee.com -dir ./scrapping-bee -interval 10 -concurrency 100
starting crawler on: 'https://www.scrapingbee.com'
fetching 'https://www.scrapingbee.com'
writing to 'scrapping-bee/index.html'
extracted valid child url: 'https://www.scrapingbee.com/index.xml'
extracted valid child url: 'https://www.scrapingbee.com/images/favico.png'
extracted valid child url: 'https://www.scrapingbee.com/images/favico.svg'
extracted valid child url: 'https://www.scrapingbee.com/'
extracted valid child url: 'https://www.scrapingbee.com/main.min.a09f1f7d5c32eba3a323bc3c39fca98dc62a83bad52faf6e0c62e7c5285cab6a.css'
extracted valid child url: 'https://www.scrapingbee.com/main.min.a09f1f7d5c32eba3a323bc3c39fca98dc62a83bad52faf6e0c62e7c5285cab6a.css'
extracted valid child url: 'https://www.scrapingbee.com/'
extracted valid child url: 'https://www.scrapingbee.com/#pricing'
extracted valid child url: 'https://www.scrapingbee.com/#faq'
extracted valid child url: 'https://www.scrapingbee.com/blog/'
extracted valid child url: 'https://www.scrapingbee.com/features/screenshot/'
extracted valid child url: 'https://www.scrapingbee.com/features/google/'
extracted valid child url: 'https://www.scrapingbee.com/features/data-extraction/'
extracted valid child url: 'https://www.scrapingbee.com/features/javascript-scenario/'
extracted valid child url: 'https://www.scrapingbee.com/features/make/'
extracted valid child url: 'https://www.scrapingbee.com/tutorials'
extracted valid child url: 'https://www.scrapingbee.com/documentation/'
extracted valid child url: 'https://www.scrapingbee.com/documentation/'
extracted valid child url: 'https://www.scrapingbee.com/features/data-extraction'
extracted valid child url: 'https://www.scrapingbee.com/features/javascript-scenario'
extracted valid child url: 'https://www.scrapingbee.com/features/screenshot'
extracted valid child url: 'https://www.scrapingbee.com/features/google'
extracted valid child url: 'https://www.scrapingbee.com/features/make'
extracted valid child url: 'https://www.scrapingbee.com/documentation/#javascript-execution'
extracted valid child url: 'https://www.scrapingbee.com/documentation/data-extraction/'
extracted valid child url: 'https://www.scrapingbee.com/documentation/js-scenario/'
extracted valid child url: 'https://www.scrapingbee.com/documentation/#Pricing'
extracted valid child url: 'https://www.scrapingbee.com/journey-to-one-million-arr/'
extracted valid child url: 'https://www.scrapingbee.com/'
extracted valid child url: 'https://www.scrapingbee.com/#about-us'
extracted valid child url: 'https://www.scrapingbee.com/journey-to-one-million-arr/'
extracted valid child url: 'https://www.scrapingbee.com/blog/'
extracted valid child url: 'https://www.scrapingbee.com/rebranding/'
extracted valid child url: 'https://www.scrapingbee.com/affiliates/'
extracted valid child url: 'https://www.scrapingbee.com/curl-converter/'
extracted valid child url: 'https://www.scrapingbee.com/terms-and-conditions/'
extracted valid child url: 'https://www.scrapingbee.com/privacy-policy/'
extracted valid child url: 'https://www.scrapingbee.com/gdpr/'
extracted valid child url: 'https://www.scrapingbee.com/data-processing-agreement/'
extracted valid child url: 'https://www.scrapingbee.com/#features'
extracted valid child url: 'https://www.scrapingbee.com/#pricing'
extracted valid child url: 'https://www.scrapingbee.com/crawlera-alternative/'
extracted valid child url: 'https://www.scrapingbee.com/luminati-alternative/'
extracted valid child url: 'https://www.scrapingbee.com/smartproxy-alternative/'
extracted valid child url: 'https://www.scrapingbee.com/netnut-alternative/'
extracted valid child url: 'https://www.scrapingbee.com/scraperapi-alternative/'
extracted valid child url: 'https://www.scrapingbee.com/scrapingbee-alternative/'
extracted valid child url: 'https://www.scrapingbee.com/blog/no-code-web-scraping/'
extracted valid child url: 'https://www.scrapingbee.com/blog/no-code-competitor-monitoring/'
extracted valid child url: 'https://www.scrapingbee.com/blog/scrape-content-google-sheet/'
extracted valid child url: 'https://www.scrapingbee.com/blog/no-code-stock-price-slack/'
extracted valid child url: 'https://www.scrapingbee.com/blog/nocode-amazon/'
extracted valid child url: 'https://www.scrapingbee.com/blog/nocode-amazon/'
extracted valid child url: 'https://www.scrapingbee.com/blog/no-code-job-data-extraction/'
extracted valid child url: 'https://www.scrapingbee.com/webscraping-questions/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-without-getting-blocked/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-tools/'
extracted valid child url: 'https://www.scrapingbee.com/blog/best-free-proxy-list-web-scraping/'
extracted valid child url: 'https://www.scrapingbee.com/blog/best-mobile-4g-proxy-provider-webscraping/'
extracted valid child url: 'https://www.scrapingbee.com/blog/scraping-vs-crawling/'
extracted valid child url: 'https://www.scrapingbee.com/blog/rotating-proxies/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-101-with-python/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-php/'
extracted valid child url: 'https://www.scrapingbee.com/blog/introduction-to-web-scraping-with-java/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-ruby/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-javascript/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-r/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-csharp/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-c++/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-elixir/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-perl/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-rust/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-go/'
extracted valid child url: 'https://www.scrapingbee.com/images/logo.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/landing/hero_illustration.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/landing/feature_headless.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/testimonials/mike.png'
extracted valid child url: 'https://www.scrapingbee.com/images/landing/feature_rendering.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/testimonials/russel.jpeg'
extracted valid child url: 'https://www.scrapingbee.com/images/landing/feature_proxies.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/testimonials/dominic.jpeg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/icon01.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/data.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/js.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/picture-1.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/search.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/nocode.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/testimonials/anton.jpeg'
extracted valid child url: 'https://www.scrapingbee.com/images/testimonials/maxime_2.jpeg'
extracted valid child url: 'https://www.scrapingbee.com/images/testimonials/andy.jpeg'
extracted valid child url: 'https://www.scrapingbee.com/images/testimonials/jean.jpg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/icon-info-grey.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/icon-info-grey.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/icon-info-grey.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/icon-info-grey.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/icon-info-grey.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/icon-info-grey.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/team/kevin.png'
extracted valid child url: 'https://www.scrapingbee.com/images/team/pierre.png'
extracted valid child url: 'https://www.scrapingbee.com/images/team/etienne.png'
extracted valid child url: 'https://www.scrapingbee.com/images/logo-white.svg'
extracted valid child url: 'https://www.scrapingbee.com/scripts/app-landing.js'
crawler has finished crawling: 'https://www.scrapingbee.com'
starting crawler on: 'https://www.scrapingbee.com/images/team/kevin.png'
fetching 'https://www.scrapingbee.com/images/team/kevin.png'
writing to 'scrapping-bee/images/team/kevin.png'
crawler has finished crawling: 'https://www.scrapingbee.com/images/team/kevin.png'
starting crawler on: 'https://www.scrapingbee.com/images/icons/nocode.svg'
fetching 'https://www.scrapingbee.com/images/icons/nocode.svg'
writing to 'scrapping-bee/images/icons/nocode.svg'
crawler has finished crawling: 'https://www.scrapingbee.com/images/icons/nocode.svg'
starting crawler on: 'https://www.scrapingbee.com/documentation/js-scenario/'
fetching 'https://www.scrapingbee.com/documentation/js-scenario/'
writing to 'scrapping-bee/documentation/js-scenario.html'
extracted valid child url: 'https://www.scrapingbee.com/index.xml'
extracted valid child url: 'https://www.scrapingbee.com/images/favico.png'
extracted valid child url: 'https://www.scrapingbee.com/images/favico.svg'
extracted valid child url: 'https://www.scrapingbee.com/documentation/js-scenario/'
extracted valid child url: 'https://www.scrapingbee.com/main.min.a09f1f7d5c32eba3a323bc3c39fca98dc62a83bad52faf6e0c62e7c5285cab6a.css'
extracted valid child url: 'https://www.scrapingbee.com/'
extracted valid child url: 'https://www.scrapingbee.com/documentation/'
extracted valid child url: 'https://www.scrapingbee.com/documentation/google/'
extracted valid child url: 'https://www.scrapingbee.com/documentation/proxy-mode/'
extracted valid child url: 'https://www.scrapingbee.com/documentation/data-extraction/'
extracted valid child url: 'https://www.scrapingbee.com/documentation/js-scenario/'
extracted valid child url: 'https://www.scrapingbee.com/tutorials/'
extracted valid child url: 'https://www.scrapingbee.com/'
extracted valid child url: 'https://www.scrapingbee.com/documentation'
extracted valid child url: 'https://www.scrapingbee.com/documentation#json_response'
extracted valid child url: 'https://www.scrapingbee.com/documentation/#json_response'
extracted valid child url: 'https://www.scrapingbee.com/'
extracted valid child url: 'https://www.scrapingbee.com/#about-us'
extracted valid child url: 'https://www.scrapingbee.com/journey-to-one-million-arr/'
extracted valid child url: 'https://www.scrapingbee.com/blog/'
extracted valid child url: 'https://www.scrapingbee.com/rebranding/'
extracted valid child url: 'https://www.scrapingbee.com/affiliates/'
extracted valid child url: 'https://www.scrapingbee.com/curl-converter/'
extracted valid child url: 'https://www.scrapingbee.com/terms-and-conditions/'
extracted valid child url: 'https://www.scrapingbee.com/privacy-policy/'
extracted valid child url: 'https://www.scrapingbee.com/gdpr/'
extracted valid child url: 'https://www.scrapingbee.com/data-processing-agreement/'
extracted valid child url: 'https://www.scrapingbee.com/#features'
extracted valid child url: 'https://www.scrapingbee.com/#pricing'
extracted valid child url: 'https://www.scrapingbee.com/crawlera-alternative/'
extracted valid child url: 'https://www.scrapingbee.com/luminati-alternative/'
extracted valid child url: 'https://www.scrapingbee.com/smartproxy-alternative/'
extracted valid child url: 'https://www.scrapingbee.com/netnut-alternative/'
extracted valid child url: 'https://www.scrapingbee.com/scraperapi-alternative/'
extracted valid child url: 'https://www.scrapingbee.com/scrapingbee-alternative/'
extracted valid child url: 'https://www.scrapingbee.com/blog/no-code-web-scraping/'
extracted valid child url: 'https://www.scrapingbee.com/blog/no-code-competitor-monitoring/'
extracted valid child url: 'https://www.scrapingbee.com/blog/scrape-content-google-sheet/'
extracted valid child url: 'https://www.scrapingbee.com/blog/no-code-stock-price-slack/'
extracted valid child url: 'https://www.scrapingbee.com/blog/nocode-amazon/'
extracted valid child url: 'https://www.scrapingbee.com/blog/nocode-amazon/'
extracted valid child url: 'https://www.scrapingbee.com/blog/no-code-job-data-extraction/'
extracted valid child url: 'https://www.scrapingbee.com/webscraping-questions/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-without-getting-blocked/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-tools/'
extracted valid child url: 'https://www.scrapingbee.com/blog/best-free-proxy-list-web-scraping/'
extracted valid child url: 'https://www.scrapingbee.com/blog/best-mobile-4g-proxy-provider-webscraping/'
extracted valid child url: 'https://www.scrapingbee.com/blog/scraping-vs-crawling/'
extracted valid child url: 'https://www.scrapingbee.com/blog/rotating-proxies/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-101-with-python/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-php/'
extracted valid child url: 'https://www.scrapingbee.com/blog/introduction-to-web-scraping-with-java/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-ruby/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-javascript/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-r/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-csharp/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-c++/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-elixir/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-perl/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-rust/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-go/'
extracted valid child url: 'https://www.scrapingbee.com/images/logo.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/logo-white.svg'
extracted valid child url: 'https://www.scrapingbee.com/scripts/hljs.min.js'
extracted valid child url: 'https://www.scrapingbee.com/scripts/app-doc.min.js'
crawler has finished crawling: 'https://www.scrapingbee.com/documentation/js-scenario/'
starting crawler on: 'https://www.scrapingbee.com/#features'
fetching 'https://www.scrapingbee.com/#features'
writing to 'scrapping-bee.html'
extracted valid child url: 'https://www.scrapingbee.com/index.xml'
extracted valid child url: 'https://www.scrapingbee.com/images/favico.png'
extracted valid child url: 'https://www.scrapingbee.com/images/favico.svg'
extracted valid child url: 'https://www.scrapingbee.com/'
extracted valid child url: 'https://www.scrapingbee.com/main.min.a09f1f7d5c32eba3a323bc3c39fca98dc62a83bad52faf6e0c62e7c5285cab6a.css'
extracted valid child url: 'https://www.scrapingbee.com/main.min.a09f1f7d5c32eba3a323bc3c39fca98dc62a83bad52faf6e0c62e7c5285cab6a.css'
extracted valid child url: 'https://www.scrapingbee.com/'
extracted valid child url: 'https://www.scrapingbee.com/#pricing'
extracted valid child url: 'https://www.scrapingbee.com/#faq'
extracted valid child url: 'https://www.scrapingbee.com/blog/'
extracted valid child url: 'https://www.scrapingbee.com/features/screenshot/'
extracted valid child url: 'https://www.scrapingbee.com/features/google/'
extracted valid child url: 'https://www.scrapingbee.com/features/data-extraction/'
extracted valid child url: 'https://www.scrapingbee.com/features/javascript-scenario/'
extracted valid child url: 'https://www.scrapingbee.com/features/make/'
extracted valid child url: 'https://www.scrapingbee.com/tutorials'
extracted valid child url: 'https://www.scrapingbee.com/documentation/'
extracted valid child url: 'https://www.scrapingbee.com/documentation/'
extracted valid child url: 'https://www.scrapingbee.com/features/data-extraction'
extracted valid child url: 'https://www.scrapingbee.com/features/javascript-scenario'
extracted valid child url: 'https://www.scrapingbee.com/features/screenshot'
extracted valid child url: 'https://www.scrapingbee.com/features/google'
extracted valid child url: 'https://www.scrapingbee.com/features/make'
extracted valid child url: 'https://www.scrapingbee.com/documentation/#javascript-execution'
extracted valid child url: 'https://www.scrapingbee.com/documentation/data-extraction/'
extracted valid child url: 'https://www.scrapingbee.com/documentation/js-scenario/'
extracted valid child url: 'https://www.scrapingbee.com/documentation/#Pricing'
extracted valid child url: 'https://www.scrapingbee.com/journey-to-one-million-arr/'
extracted valid child url: 'https://www.scrapingbee.com/'
extracted valid child url: 'https://www.scrapingbee.com/#about-us'
extracted valid child url: 'https://www.scrapingbee.com/journey-to-one-million-arr/'
extracted valid child url: 'https://www.scrapingbee.com/blog/'
extracted valid child url: 'https://www.scrapingbee.com/rebranding/'
extracted valid child url: 'https://www.scrapingbee.com/affiliates/'
extracted valid child url: 'https://www.scrapingbee.com/curl-converter/'
extracted valid child url: 'https://www.scrapingbee.com/terms-and-conditions/'
extracted valid child url: 'https://www.scrapingbee.com/privacy-policy/'
extracted valid child url: 'https://www.scrapingbee.com/gdpr/'
extracted valid child url: 'https://www.scrapingbee.com/data-processing-agreement/'
extracted valid child url: 'https://www.scrapingbee.com/#features'
extracted valid child url: 'https://www.scrapingbee.com/#pricing'
extracted valid child url: 'https://www.scrapingbee.com/crawlera-alternative/'
extracted valid child url: 'https://www.scrapingbee.com/luminati-alternative/'
extracted valid child url: 'https://www.scrapingbee.com/smartproxy-alternative/'
extracted valid child url: 'https://www.scrapingbee.com/netnut-alternative/'
extracted valid child url: 'https://www.scrapingbee.com/scraperapi-alternative/'
extracted valid child url: 'https://www.scrapingbee.com/scrapingbee-alternative/'
extracted valid child url: 'https://www.scrapingbee.com/blog/no-code-web-scraping/'
extracted valid child url: 'https://www.scrapingbee.com/blog/no-code-competitor-monitoring/'
extracted valid child url: 'https://www.scrapingbee.com/blog/scrape-content-google-sheet/'
extracted valid child url: 'https://www.scrapingbee.com/blog/no-code-stock-price-slack/'
extracted valid child url: 'https://www.scrapingbee.com/blog/nocode-amazon/'
extracted valid child url: 'https://www.scrapingbee.com/blog/nocode-amazon/'
extracted valid child url: 'https://www.scrapingbee.com/blog/no-code-job-data-extraction/'
extracted valid child url: 'https://www.scrapingbee.com/webscraping-questions/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-without-getting-blocked/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-tools/'
extracted valid child url: 'https://www.scrapingbee.com/blog/best-free-proxy-list-web-scraping/'
extracted valid child url: 'https://www.scrapingbee.com/blog/best-mobile-4g-proxy-provider-webscraping/'
extracted valid child url: 'https://www.scrapingbee.com/blog/scraping-vs-crawling/'
extracted valid child url: 'https://www.scrapingbee.com/blog/rotating-proxies/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-101-with-python/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-php/'
extracted valid child url: 'https://www.scrapingbee.com/blog/introduction-to-web-scraping-with-java/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-ruby/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-javascript/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-r/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-csharp/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-c++/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-elixir/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-perl/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-rust/'
extracted valid child url: 'https://www.scrapingbee.com/blog/web-scraping-go/'
extracted valid child url: 'https://www.scrapingbee.com/images/logo.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/landing/hero_illustration.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/landing/feature_headless.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/testimonials/mike.png'
extracted valid child url: 'https://www.scrapingbee.com/images/landing/feature_rendering.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/testimonials/russel.jpeg'
extracted valid child url: 'https://www.scrapingbee.com/images/landing/feature_proxies.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/testimonials/dominic.jpeg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/icon01.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/data.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/js.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/picture-1.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/search.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/nocode.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/testimonials/anton.jpeg'
extracted valid child url: 'https://www.scrapingbee.com/images/testimonials/maxime_2.jpeg'
extracted valid child url: 'https://www.scrapingbee.com/images/testimonials/andy.jpeg'
extracted valid child url: 'https://www.scrapingbee.com/images/testimonials/jean.jpg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/icon-info-grey.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/icon-info-grey.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/icon-info-grey.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/icon-info-grey.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/icon-info-grey.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/icons/icon-info-grey.svg'
extracted valid child url: 'https://www.scrapingbee.com/images/team/kevin.png'
extracted valid child url: 'https://www.scrapingbee.com/images/team/pierre.png'
extracted valid child url: 'https://www.scrapingbee.com/images/team/etienne.png'
extracted valid child url: 'https://www.scrapingbee.com/images/logo-white.svg'
extracted valid child url: 'https://www.scrapingbee.com/scripts/app-landing.js'
crawler has finished crawling: 'https://www.scrapingbee.com/#features'
starting crawler on: 'https://www.scrapingbee.com/images/icons/nocode.svg'
content for target url 'https://www.scrapingbee.com/images/icons/nocode.svg' found under file path: 'scrapping-bee/images/icons/nocode.svg'
starting crawler on: 'https://www.scrapingbee.com/#pricing'
content for target url 'https://www.scrapingbee.com/#pricing' found under file path: 'scrapping-bee.html'
^C
crawler has been interrupted

Tests:

go test

Documentation

The Go Gopher

There is no documentation for this package.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL