Skip to content

FrankMartinIII/YahooAnswersClassification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 

Repository files navigation

YahooAnswersClassification

Text mining project

Analyzed data from the defunct Yahoo Answers. The goal of this project was to try to predict the best answers to questions in the test set from similar quesitons in the training data using term frequenct-inverse document frequency (TF-IDF).

Used multinomial naive bayes and rocchio algorithms for finding similar questions.

Dataset used can be found here: https://www.kaggle.com/datasets/yacharki/yahoo-answers-10-categories-for-nlp-csv/data

About

Text mining project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors