HTML-Link-Parser

command module

v0.0.0-...-f597c94 Latest Latest Go to latest Published: Apr 9, 2024 License: MIT Imports: 3 Imported by: 0

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

Repository

github.com/siddhant-vij/HTML-Link-Parser

Links

Open Source Insights

README ¶

HTML Link Parser

Gophercises Exercise Details:

In this exercise your goal is create a package that makes it easy to parse an HTML file and extract all of the links (<a href="">...</a> tags). For each extracted link you should return a data structure that includes the href.

Links will be nested in different HTML elements, and it is very possible that you will have to deal with HTML similar to code below.

<a href="/dog">
  <span>Something in a span</span>
  Text not in a span
  <b>Bold text!</b>
</a>

In situations like these we want to get output that looks roughly like:

Link{
  Href: "/dog",
}

Once you have a working program, try to write some tests for it to practice using the testing package in go.

Technical Notes

Use the x/net/html package. Package html implements an HTML5-compliant tokenizer and parser.
Ignore nested links. Eg with following HTML:
```
<a href="#">
Something here <a href="/dog">nested dog link</a>
</a>
```
It is okay if your code returns only the outside link - for the purposes of this exercise.
Include the nested links as well in the output.
Test the code with example files included in the project repository. Improve your tests and edge-case coverage. Add Examples and Documentation for the code. Run the following in this order, using go tooling:
- tests
  - go test
- coverage
  - go test -cover
  - go test -coverprofile coverage.out
- coverage shown in web browser
  - go tool cover -html=coverage.out
- examples shown in documentation in a web browser
  - godoc -http=:8080

Documentation ¶

There is no documentation for this package.

Source Files ¶

View all Source files

main.go

Directories ¶

Path	Synopsis
parser

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL