crawler

command
v0.1.0 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Jan 10, 2022 License: CC0-1.0 Imports: 18 Imported by: 0

Documentation

Overview

This program crawls the LOC.gov API and identifies digitized items.

It proceeds in this way.

  1. It fetches all digital collections (with some filtering)
  2. It fetches the items in those digital collections (again with some filtering for full text items).

Everything gets stored in a database.

The task of fetching the full item metadata is handled by itemmd.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL