The sample contains 125 Hungarian laws from 1999. It is used in the 4th chapter of the textbook (https://tankonyv.poltextlab.com/corpus-ch.html)

data_lawtext_1999

Format

It is a data.frame, with 125 observation, 2 variables:

doc_id

A unique document id, in this case the filename of the law text

text

The unprocessed text

Source

https://cap.tk.hu/en/dataoverview