Permissivly licensed datasets for LLM pre-training.
Martin Elstner PRO
dwablimol
AI & ML interests
None yet
Recent Activity
liked a dataset 1 day ago
mlfoundations-dev/organic_chemistry_pdf_word_search liked a dataset 7 days ago
bjoernp/tagesschau-2018-2023 liked a dataset 13 days ago
casperhansen/pmc-oa-markdown