YahooAnswersClassification

Text mining project

Analyzed data from the defunct Yahoo Answers. The goal of this project was to try to predict the best answers to questions in the test set from similar quesitons in the training data using term frequenct-inverse document frequency (TF-IDF).

Used multinomial naive bayes and rocchio algorithms for finding similar questions.

Dataset used can be found here: https://www.kaggle.com/datasets/yacharki/yahoo-answers-10-categories-for-nlp-csv/data

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

YahooAnswersClassification

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

YahooAnswersClassification

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages