infoimprese-scraping-tool-go

command module
v0.0.0-...-ada829c Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Sep 28, 2019 License: MIT Imports: 8 Imported by: 0

README

InfoImprese Scraper Tool

That's a GoLang porting of the original project which is written in Python .

You can find the original following this link: https://github.com/debba/infoimprese-scraping-tool

Config

Before using for the first time this tool you should create a config.json inside the conf folder. Please take a look to conf/config.example.json or directly clone it.

You can setup an Anti-Captcha API keys in order to skip captcha checks, please follow how to generate keys from this link: https://anti-captcha.com

You can setup fields you want to export. A complete list:

  • "Denominazione",
  • "Sede legale",
  • "Attività",
  • "Sede operativa",
  • "Indirizzo web",
  • "Posta elettronica",
  • "Commercio elettronico",
  • "Chi siamo",
  • "Cosa facciamo",
  • "Classe di fatturato",
  • "Canali di vendita",
  • "Marchi",
  • "Principali paesi di export",
  • "Certificazioni"

You can setup a mode, you can learn about it following the next section.

Modes

You can choose one of the following scraping modes:

  • search_by_name (Ricercando nel Nome in the website)
  • search_by_desc (Ricercando nella Descrizione attività in the website)
  • with_dash (con la Vetrina su infoimprese.it in the website)
  • with_cert (con certificazione di qualità in the website)
  • with_dash (che praticano e-commerce in the website)
  • with_email (che possiedono l'e-mail in the website)
  • with_website (che hanno il sito internet in the website)
  • with_export (che svolgono attività di export in the website)

Compilation

make

A binary infoimprese will be generated.

For more info, please check the Makefile

Usage

usage: ./infoimprese [-h] -q QUERY [-m MODE] [-l LOCATION] [-o OUTPUT]

Arguments are:

  • query represents your keyword
  • location represents where you want search
  • mode represent modes (check Modes section)
  • output csv file for storing data

Enjoy :)

Windows User?

You can use exec.bat in order to have a very basic GUI

Credits

Disclaimer: Please Note that this is a research project. I am by no means responsible for any usage of this tool.

Documentation

The Go Gopher

There is no documentation for this package.

Directories

Path Synopsis

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL