From 05a2282bd7ad227a37f7c4773834dbe747f3832c Mon Sep 17 00:00:00 2001 From: Rawiri Blundell Date: Sat, 15 Apr 2023 23:54:03 +1200 Subject: [PATCH] Update README --- README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.md b/README.md index 7126380..8ab9035 100644 --- a/README.md +++ b/README.md @@ -6,7 +6,9 @@ This is targeting pages that have been captured by the Wayback Machine that spec See the incomplete script "archive_crawler" to see my working. - TODO: Markdown linting +- TODO: Markdown conversion from Dokuwiki "Markup" to GitHub "Markdown" using pandoc - TODO: Parse the already downloaded files for any missing links +- TODO: Rinse and repeat ## Extracting the markdown So the pages that have `'?do-edit'` on the end of their URL appear to have a reliable and predictable structure: