* Common Crawl Foundation seeds * clean mil list to just hostnames * doc: add location of ccf repo that generated these files --------- Co-authored-by: Greg Lindahl <greg@commomncrawl.org>