webDownloader

command module
v0.0.0-...-2a98c17 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Feb 16, 2024 License: MIT Imports: 7 Imported by: 0

README

webDownloader v0.1

Golang implementation of a method of downloading websites - built for our Senior Project Peer to Peer Web Cache.

This project is ment for education purposes as a tool in order to develop a peer to peer web cache capability. The repository to that one is currently private, though this tool seems universally useful so I felt should be seperate and public.

Usage: webDownloader {Website URL}

Will contain more options in potential future releases

To-Do

  • Downloads HTML Webpages
  • Saves them in an organized matter
  • Identifies assets
  • Downloads Assets
  • Organizes and saves assets
  • Replaces source location with local path
  • Index's and saves table of downloaded webpages
  • Connects with the P2P Senior Project Peer To Peer Webcache
  • Handles robots.txt to avoid downloading unauthorized information
  • Can take in multiple options to handle either the full webpage, specific webpage, directory of webpages etc.

Documentation

Overview

Alex Kefer - 2024 - Built for P2P Web Cache Project Downloads webpages and stores them in a cache for later retrieval Usage: go run . <URL> Example: go run . https://www.google.com

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL