| 
  • If you are citizen of an European Union member nation, you may not use this service unless you are at least 16 years old.

  • Buried in cloud files? We can help with Spring cleaning!

    Whether you use Dropbox, Drive, G-Suite, OneDrive, Gmail, Slack, Notion, or all of the above, Dokkio will organize your files for you. Try Dokkio (from the makers of PBworks) for free today.

  • Dokkio (from the makers of PBworks) was #2 on Product Hunt! Check out what people are saying by clicking here.

View
 

MLText-HW3

Page history last edited by mike@mbowles.com 10 years, 6 months ago

In the last homework you combines the "acquisition" articles and the "crude" articles from the Reuter's document set and then tried to cluster then based on tfidf matrix.  Now take use the tfidf matrix to do two things. 

1.  put together a an LDA model with 10 topics.  have  a look at the topics and see if you can identify which ones correspond to the original classifications of the documents. 

2.  generate a supervised LDA model with 10 topics using +/- 1 labels corresponding to whether the article is from "crude" or "acquisitions".  How do the topics change from above.

Comments (0)

You don't have permission to comment on this page.