eot2024/seed-lists
2024-09-06 16:10:50 -05:00
..
117th_House_Seeds.xlsx Adding seed lists from NARA and in-scope non gov/mil PURL target domain csv 2024-09-06 16:10:50 -05:00
117th_Senate_Seeds.xlsx Adding seed lists from NARA and in-scope non gov/mil PURL target domain csv 2024-09-06 16:10:50 -05:00
118th_Senate.xlsx Adding seed lists from NARA and in-scope non gov/mil PURL target domain csv 2024-09-06 16:10:50 -05:00
CRS_ReportsList.csv Nominated URLs for CRS Reports 2024-05-29 21:05:02 -05:00
FDLP_WEb_Archiveseed_list_20240212.csv Add seed lists from GPO 2024-02-16 14:13:29 -06:00
FOIA_Libraries_Dataset_Oct_3_2023_Final.xlsx Add spreadsheet for James Jacobs 2024-05-08 16:37:50 -05:00
govspeakeot080124.xlsx Add GovSpeak seeds 2024-08-01 11:59:00 -05:00
LOC-seeds-for-eot-20240712.xlsx Add Library of Congress bulk seed list 2024-08-01 09:50:06 -05:00
PURL_server_domains_20240214_non_gov_mil.csv Adding seed lists from NARA and in-scope non gov/mil PURL target domain csv 2024-09-06 16:10:50 -05:00
PURL_server_domains_20240214.csv Add seed lists from GPO 2024-02-16 14:13:29 -06:00
README.md Adding seed lists from NARA and in-scope non gov/mil PURL target domain csv 2024-09-06 16:10:50 -05:00

End of Term 2024 Seed Lists

Posted here are seed lists used in the 2024 End of Term Web Archive project. Provenance notes are included below. These lists will be uploaded into the End of Term Bulk Nomination Tool.

GPO seeds

Seeds supplied by Dorothy Bower of the U.S. Government Publishing Office:

  • FDLP_WEb_Archiveseed_list_20240212.csv - list of seeds from the FDLP Web Archive with one page only seeds deleted, that were mainly embedded youtube videos.
  • PURL_server_domains_20240214.csv - report of all target domains from the PURL server; some determined to be out of scope were not included in the Nomination Tool.
    • PURL_server_domains_20240214_non_gov_mil.csv - non .gov/.mil seeds from the PURL_server_domains_20240214.csv list that were determined to be in scope by Mark Phillips of UNT.

Internet Archive seeds

Seeds supplied by Antoine McGrath of Internet Archive:

  • CRS_ReportsList.csv - nominated URLs to government hosted CRS Reports from Daniel Schuman with the American Governance Institute.

Library of Congress seeds

  • LOC-seeds-for-eot-20240712.xlsx

National Archives and Records Administration seeds

Seeds supplied by Elizabeth England of the U.S. National Archives and Records Administration (NARA):

  • 117th_House_Seeds.xlsx - contains five sheets, one each for: House members, majority committees, minority committees, caucuses, and leadership/support/other.
  • 117th_Senate_Seeds.xlsx
  • 118th_Senate.xlsx

Stanford seeds

Seeds supplied by James Jacobs of Stanford University Libraries:

  • FOIA_Libraries_Dataset_Oct_3_2023_Final.xlsx - spreadsheet with seeds for all of the federal FOIA libraries. Lisa DeLuca, who collated the list, said it would be fine to use her spreadsheet from https://works.bepress.com/lisa_deluca/59/.

UC San Diego

Seeds supplied by Kelly L. Smith, Government Information Librarian and Librarian for Urban Studies & Planning / Environmental Studies at UC San Diego Library (via James Jacobs):