Big data collaboratory feedback results

Created by Ivan Lima on Mon Jun 7 2021 12:16:15 -0400

Word cloud plots (one for each question)

Topic modeling

Combine responses to all questions into one dataset.

Apply lemmatization to responses

Convert list of responses to bag-of-words matrix using tf-idf scaling

Extract 20 topics using Non-negative Matrix Factorization (NMF)

Topic number, 10 most frequent words in each topic and topic frequency

20 most frequent words in each topic

Word cloud plot of topics