Commit Graph

40 Commits

Author SHA1 Message Date
James R. Jacobs
d851fb5e78
added MIT spreadsheet (#30)
* Update README.md

added MIT spreadsheet

* Add files via upload

added MIT spreadsheet

* Update README.md

new bulk list by EdTrust

* Add files via upload

new bulk list by EdTrust
2025-01-17 13:50:04 -06:00
James R. Jacobs
4a9c7e4b45
3 new bulk lists (#29)
* Update README.md

3 more bulk lists added to the readme

* Add files via upload

3 new bulk seed lists submitted to eot-info@archive.org
2025-01-14 13:15:01 -06:00
James R. Jacobs
f8b8df2490
new seed list submitted by Power To Decide (#28)
* Update README.md

Seed list submitted by Power To Decide

* Add files via upload

new seed list submitted by Power To Decide
2025-01-10 17:11:44 -06:00
James R. Jacobs
ed8e0e2a3e
7 new bulk lists from Infodocket and eot-info submissions. (#27)
* Update README.md

Several new bulk lists added from Infodocket and eot-info submissions.

* Add files via upload

7 new bulk lists from Infodocket and eot-info submissions.
2025-01-08 17:17:15 -06:00
Lauren Ko
c35eb3240e Add cdc-dataset-urls.csv to README 2025-01-06 10:07:15 -06:00
Lauren Ko
ee6cba5868 Update README for FDA and NIH sitemap seed lists 2025-01-02 12:32:17 -06:00
Lauren Ko
526cb84dd0 Add State Department FOIA PDF seed lists to README 2025-01-02 11:26:40 -06:00
Lauren Ko
3e95bc46a9 Tweak seed lists README 2025-01-02 11:02:08 -06:00
James R. Jacobs
59b04f37a8
Add 6 bulk lists (#23)
* Update README.md

2 new lists from EDGI and Hermann-Wu submitted via dot-info@archive.org

* Add files via upload

2 new lists from EDGI and Hermann-Wu submitted via dot-info@archive.org

* Update README.md

bulk list re USDA seeds submitted by AWI 20241222

* Add files via upload

bulk list re USDA seeds submitted by AWI 20241222

* Update README.md

3 bulk lists added: 1 from Gary Price and 2 from eot-info submissions: AWI-XL-4-20241224.xlsx,  AWI-USDA-FSIS-20241222.xlsx,  NSF-20241224.xlsx

* Add files via upload

3 bulk lists added: 1 from Gary Price and 2 from eot-info submissions: AWI-XL-4-20241224.xlsx,  AWI-USDA-FSIS-20241222.xlsx,  NSF-20241224.xlsx

* Update README.md

list submitted by Natl Indian Law Library Natl-Indian-Law-Library-bulk-seeds-20241224.xlsx

* Add files via upload

list submitted by Natl Indian Law Library Natl-Indian-Law-Library-bulk-seeds-20241224.xlsx
2025-01-02 10:56:39 -06:00
James R. Jacobs
bf7cf89659 Update README.md
added bulk list Sustainability-gov-Hermann-Wu-20241220.xlsx
2024-12-20 09:19:42 -06:00
Lauren Ko
86a3364700 Add Defenders of Wildlife seeds 2024-12-19 15:30:47 -06:00
James R. Jacobs
0f565c94e4
bulk list submitted 20241219 by Ailsa Hermann-Wu (#21)
* Update README.md

bulk list on performance.gov by Ailsa Hermann-Wu

* Add files via upload

Bulk list submitted by Ailsa Hermann-Wu re performance.gov 20241219
2024-12-19 15:00:38 -06:00
James R. Jacobs
f1694a635c
bulk seed list of GAO seeds by Ailsa Hermann-Wu (#20)
* Update README.md

bulk seed list of GAO seeds sent by Ailsa Hermann-Wu

* Add files via upload

bulk seed list of GAO seeds sent by Ailsa Hermann-Wu
2024-12-18 10:14:24 -06:00
James R. Jacobs
ef3bd7d5f9
3 new bulk lists submitted by Gary Price (#19)
* Update README.md

added another bulk list from Gary Price/Infodocket. File is USDA_FIS_ERS.xlsx

* Add files via upload

USDA_FIS_ERS.xlsx from Gary Price/infodocket. 1700 or so urls from the USDA. Specifically, the Food Inspection Service and Economic Research Service.

* Update README.md

3 more bulk lists from Gary Price sent on 12/14/2024

* Add files via upload

3 new bulk lists from Gary Price submitted 12/14/2024
2024-12-16 14:11:03 -06:00
James R. Jacobs
ed7cabab8e
another bulk seed list from Gary Price (USDA) (#18)
* Update README.md

added another bulk list from Gary Price/Infodocket. File is USDA_FIS_ERS.xlsx

* Add files via upload

USDA_FIS_ERS.xlsx from Gary Price/infodocket. 1700 or so urls from the USDA. Specifically, the Food Inspection Service and Economic Research Service.
2024-12-12 15:38:05 -06:00
Lauren Ko
97a727fc4e Update README for sitemaps.txt and sitemap-url-seeds directory 2024-12-11 13:21:44 -06:00
Lauren Ko
3b3bf304b9 Update README for CDC PDFs 2024-12-10 14:55:01 -06:00
James R. Jacobs
5a9195431e
bulk list of NPS seeds submitted by Hermann-Wu - Hermann-Wu-nps-20241209.txt (#16)
* Update README.md

added NPS seeds submitted by Hermann-Wu - Hermann-Wu-nps-20241209.txt

* Add files via upload

NPS seeds submitted by Hermann-Wu - Hermann-Wu-nps-20241209.txt

* Update README.md

edited the contact section.
2024-12-10 12:48:14 -06:00
Lauren Ko
ed4d0f0d8a
Add some Bureau of Land Management and EnergyFundsForAll.org seeds (#15)
* Update README.md

* Update README.md

* Add files via upload

* Update README.md

added bulk file from EnergyFundsForAll.org

* Bulk list from EnergyFundsForAll

* Remove extra whitespace

Signed-off-by: Lauren Ko <lauren.ko@unt.edu>

* Remove duplicate listing of infodocket-11-21-2024.xls

---------

Signed-off-by: Lauren Ko <lauren.ko@unt.edu>
Co-authored-by: James R. Jacobs <freegovinfo@gmail.com>
2024-12-09 14:28:43 -06:00
Lauren Ko
a6e38c7311 Add CDC .html seed list 2024-12-03 15:53:17 -06:00
James R. Jacobs
d633f6965c
uploaded new bulk seed files from Gary Price and Kelly Smith (#11)
* adding info docket bulk seed list

* Update README.md

* Update README.md

* Add files via upload

Bulk lists from Gary Price and Kelly Smith. Seed list readme updated with file names.
2024-12-02 12:11:12 -06:00
Lauren Ko
4519cb1ee8 Add NLM seed list 2024-11-22 13:18:59 -06:00
Lauren Ko
37b32203c5 Add irs.gov seeds from Gary Price 2024-11-21 13:24:36 -06:00
James R. Jacobs
58e14710e3
pull requests for info docket bulk list 11-21-2024 (#5)
* adding info docket bulk seed list

* Update README.md
2024-11-21 12:56:50 -06:00
Lauren Ko
7e3d04ed8c Update README for bsky_gov_urlverified.txt 2024-11-21 08:54:52 -06:00
Lauren Ko
3a14a8fb3f Add seed list from EDGI 2024-11-14 09:54:06 -06:00
Lauren Ko
8e8c22e358 Add updated govspeak list 2024-11-08 09:36:35 -06:00
Lauren Ko
a325cf3f79 Add two lists supplied by James Jacobs 2024-10-25 16:48:38 -05:00
Lauren Ko
99460625a9 Add usagov.csv seed list 2024-09-23 11:10:35 -05:00
Greg Lindahl
ba124bec62
Common Crawl seeds (#3)
* Common Crawl Foundation seeds

* clean mil list to just hostnames

* doc: add location of ccf repo that generated these files

---------

Co-authored-by: Greg Lindahl <greg@commomncrawl.org>
2024-09-16 09:33:58 -05:00
Lauren Ko
4392d90188 Add more files from web resources 2024-09-12 15:54:23 -05:00
Lauren Ko
b9dfb4f189 Add seeds from https://touchpoints.app.cloud.gov/registry 2024-09-12 12:19:09 -05:00
Lauren Ko
1b1b4736b4 Add NARA's 118th House Seeds 2024-09-09 16:24:56 -05:00
Lauren Ko
e49378d304 Adding seed lists from NARA and in-scope non gov/mil PURL target domain csv 2024-09-06 16:10:50 -05:00
Lauren Ko
a7cf90dd34 Add GovSpeak seeds 2024-08-01 11:59:00 -05:00
Lauren Ko
b79e23eac5 Add Library of Congress bulk seed list 2024-08-01 09:50:06 -05:00
Lauren Ko
5fe4a4136e
Add CRS reports seeds 2024-06-04 10:42:30 -05:00
Lauren Ko
7a9154ae73 Add spreadsheet for James Jacobs 2024-05-08 16:37:50 -05:00
Lauren Ko
a355cdf1f4 Add seed lists from GPO 2024-02-16 14:13:29 -06:00
Lauren Ko
3738c930be
Create location for seed-lists and provenance README.md 2024-02-09 13:38:48 -06:00