Texts in the construction domain of the Danish DK-CLARIN LSP corpus come from Statens Byggeforskningsinstitut, Erhvervs- og byggestyrelsen and Murerfagets Oplysningsråd. All texts are in XML TEIP5 format (TEIP5DKCLARIN-format), with tokenisation, pos-tagging, lemmatisation and termhood annotation placed text extermally in separate spangroups.