For easy retrieval of links inside href fields. Filter href fields using regex.
Usage
go install https://github.com/BehzadE/link-extractor@latest
link-extractor 'https://golang.org' '/doc/'
https://golang.org/doc/
https://golang.org/doc/copyright.html
https://golang.org/doc/tos.html
As a library
urls, err := linkext.Extract(
"https://dumps.wikimedia.org/enwiki/latest/",
"pages-articles-multistream(-index|)[0-9]+[.]",
)