data_magyar_nemzet_small.Rd
The dataset contains 2 834 front page articles from the print Hungarian daily, Magyar Nemzet. It is sampled from the data_magyar_nemzet_large dataset. This dataset is used in the 6th chapter of the textbook (https://tankonyv.poltextlab.com/sentiment.html).
data_magyar_nemzet_small
It is a data.frame
, with 2834 observation, 3 variables:
A unique document id, row number in this case
The unprocessed article texts
Date of the article
https://cap.tk.hu/en/dataoverview
Sebők, Miklós, and Zoltán Kacsuk (2021). The Multiclass Classification of Newspaper Articles with Machine Learning: The Hybrid Binary Snowball Approach.. Political Analysis, 29(2): 236-249.