web-crawler/internal/util/testdata/GetURLFromHTML/my-simple-cooking-website.html
Dan Anglin 4519de764e
All checks were successful
Tests / test (pull_request) Successful in 13s
feat: add the web crawler
Add the source code for the web crawler. The web crawler is a simple Go
CLI application that traverses through a website and generates a report
of all the internal links found in the site.
2024-08-27 15:42:26 +01:00

37 lines
1.4 KiB
HTML

<!DOCTYPE html>
<html xmlns="http://www.w3.org/1999/xhtml" lang="" xml:lang="">
<head>
<meta charset="utf-8" />
<meta name="viewport" content="width=device-width, initial-scale=1.0, user-scalable=yes" />
<title>My simple cooking website</title>
<style>
code{white-space: pre-wrap;}
span.smallcaps{font-variant: small-caps;}
span.underline{text-decoration: underline;}
div.column{display: inline-block; vertical-align: top; width: 50%;}
div.hanging-indent{margin-left: 1.5em; text-indent: -1.5em;}
ul.task-list{list-style: none;}
</style>
</head>
<body>
<header id="title-block-header">
<h1 class="title">My simple cooking website</h1>
</header>
<p>Find my favourite recipes here.</p>
<h2 id="recipes">Recipes</h2>
<ul>
<li><a href="/recipes/sweet-n-sour-kung-pao-style-chicken">Sweet 'n' Sour Kung Pao-Style Chicken</a></li>
<li><a href="/recipes/beef-and-broccoli">Beef and Broccoli</a></li>
<li><a href="/recipes/asian-glazed-salmon">Asian Glazed Salmon</a></li>
<li><a href="/recipes/caesar-salad">Caesar Salad</a></li>
<li><a href="/recipes/simple-tuna-salad">Simple Tuna Salad</a></li>
<li><a href="/recipes/wholemeal-pizza">Wholemeal Pizza</a></li>
</ul>
<h2 id="links">Links</h2>
<ul>
<li><a href="/news">News</a></li>
<li><a href="/about/contact">Contact</a></li>
<li><a href="https://the-other-site.example.new/home">The other site</a></li>
</ul>
</body>
</html>