diff --git a/README.md b/README.md index a6aa23e..7126380 100644 --- a/README.md +++ b/README.md @@ -1,21 +1,44 @@ # wiki.bash-hackers.org Extraction of wiki.bash-hackers.org from the Wayback Machine -This is targeting pages that have been captured by the Wayback Machine that specifically have '?do=edit' on the end of their URL. This gives us the markdown source. +This is targeting pages that have been captured by the Wayback Machine that specifically have `'?do=edit'` on the end of their URL. This gives us the markdown source. See the incomplete script "archive_crawler" to see my working. -- TODO: Second crawl -- TODO: Filter out all the non-markdown garbage. It looks like everything up to `