Open Q&A dataset for the Swedish construction industry (byggbranschen). 503 bilingual (SV+EN) Q&As grounded in Swedish law. DOI: 10.5281/zenodo.19630803. By Zaragoza AB.
Command-line tool to split documents into chunks and automatically generate question–answer datasets, designed for preparing data to fine-tune large language models (LLMs).