Project Objectives
Build a model which accurately classifies news articles to an appropriate topic
Build a web app interface which allows users to use the model
Project Phases & Timeline
Data Cleaning & Analysis (Week 1)
Handling null values, fixing data types, data inconsistencies
EDA
Feature Engineering & Model Selection (Week 2)
Building new features, merging other forms of data
Training traditional NLP models
Performance evaluation of traditional NLP models
Tuning a huggingface model (optional)
Use generative model to suggest labels to unlabeled articles (optional)
Evaluate suggested labels and use curated articles to improve traditional NLP models (optional)
Deployment (Week 3)
Building a streamlit app
Deploying model to app
Deploying app to streamlit cloud
Link to GitHub: https://github.com/SuperDataScience-Community-Projects/SDS-CP016-news-article-classification