• If you are citizen of an European Union member nation, you may not use this service unless you are at least 16 years old.

  • Whenever you search in PBworks, Dokkio Sidebar (from the makers of PBworks) will run the same search in your Drive, Dropbox, OneDrive, Gmail, and Slack. Now you can find what you're looking for wherever it lives. Try Dokkio Sidebar for free.



Page history last edited by mike@mbowles.com 11 years, 2 months ago

In the last homework you combines the "acquisition" articles and the "crude" articles from the Reuter's document set and then tried to cluster then based on tfidf matrix.  Now take use the tfidf matrix to do two things. 

1.  put together a an LDA model with 10 topics.  have  a look at the topics and see if you can identify which ones correspond to the original classifications of the documents. 

2.  generate a supervised LDA model with 10 topics using +/- 1 labels corresponding to whether the article is from "crude" or "acquisitions".  How do the topics change from above.

Comments (0)

You don't have permission to comment on this page.