Since I did the hard work of figuring out how to create a word cloud of the first presidential debate I decided to use my new found knowledge on the Vice Presidential debate. Once again I used the transcript from the Washington Post and I broke it into three documents, kaine.txt, pence.txt, and quijano.txt. This time I not only cleaned up the data for text mining but consolidated terms. As an example Senator and Senator Kaine became Kaine. I did not consolidate terms in the Quijano word cloud. You can see in the word cloud that she referred to Senator Kaine primarily as Senator and Govenor Pence as Govenor. The interesting tidbits that I see in the word clouds are:
- The dominant words were “american” and “people”.
- The word, “jobs”, did not appear in any cloud and the word, “economy”, just barely made the overall list.
- Governor Pence’s word cloud looks like it is a little more issue oriented than Senator Kaine’s word cloud. He talked a lot about “american” and “people”.