The dataset contains a sample 600 Hungarian laws proposed in the 1994-2018 period. This data is used in the 10th chapter (https://tankonyv.poltextlab.com/similarity.html).

data_lawprop_sample

Format

It is a data.frame, with 600 observation, 2 variables:

tvjav_id

Id for each law proposal. The syntax: election-cycle_proposalname

tvjav_szoveg

The unprocessed law proposal texts

Source

https://cap.tk.hu/en/dataoverview