diff --git a/README.md b/README.md index 69fa8d8..a6aa23e 100644 --- a/README.md +++ b/README.md @@ -5,8 +5,8 @@ This is targeting pages that have been captured by the Wayback Machine that spec See the incomplete script "archive_crawler" to see my working. -TODO: Second crawl -TODO: Filter out all the non-markdown garbage. +- TODO: Second crawl +- TODO: Filter out all the non-markdown garbage. It looks like everything up to `