In this guided project, you will learn how to import textual data stored in raw text files into R, turn these files into a corpus (a collection of textual documents), and tokenize the text all using the R software package quanteda. You will then learn how to check for words with positive or negative sentiment within the text, and how to plot the proportion of use for these words over time, while stratifying by a third variable. You will also learn how to carry out a targeted sentiment analysis by looking for words with a positive or negative sentiment that are adjacent to relevant keywords or phrases, and how to compare the results of a targeted sentiment analysis with the results of a generic analysis.
Your workspace is a cloud desktop right in your browser, no download required
In a split-screen video, your instructor guides you step-by-step