go-web-scraping

command module
v0.0.0-...-df668e8 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: May 8, 2023 License: MIT Imports: 7 Imported by: 0

README

Go Web Scraping

This is a simple web scraping developed using the Go programming language that has the possibility to fetch information from websites rendered via JavaScript.

Its execution is done using Docker, where Chromium it's installed inside the container, which can be used to render the page, or not, and use the traditional way that takes the HTML of the page with a simple request.

Searching for site information can be passed easily via body, in a POST request.

There is also a way to return all the HTML from a page by making a request to a specific path.

To know more details, there is the Makefile file with the commands necessary for its execution and example of how to use, including curl commands.

JavaScript Render

If the site you are fetching the data from has JavaScript rendering, use the javascript parameter of the body with the value of true.

How to use

Run the application:

make run

Start the application:

make start

Stop the application:

make stop

View logs for Docker containers:

make logs

Example of how to fetch information within a web page using a URL as input:

make test-find

Example of how to fetch for information within a web page using HTML as input:

make test-find-html

Example of how to get all the HTML from a web page using a URL as input:

make test-html

Documentation

The Go Gopher

There is no documentation for this package.

Directories

Path Synopsis

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL