BigLAM, an initiative to create an open source, community resource of LAM datasets, seems like it would be a great tool for those working in NLP in libraries and archives. It is a collaboration between BigScience and Hugging Face and is ongoing until the end of October.
https://github.com/bigscience-workshop/lam
https://huggingface.co/biglam
------------------------------
Mary Aycock
Database and Metadata Coordinator
Texas State University
She/Her/Hers
------------------------------