post cover image

[Completed] SDS CP #16 - News Article Classification

Project Objectives

  • Build a model which accurately classifies news articles to an appropriate topic

  • Build a web app interface which allows users to use the model

Project Phases & Timeline

Data Cleaning & Analysis (Week 1)

  • Handling null values, fixing data types, data inconsistencies

  • EDA

Feature Engineering & Model Selection (Week 2)

  • Building new features, merging other forms of data

  • Training traditional NLP models

  • Performance evaluation of traditional NLP models

  • Tuning a huggingface model (optional)

  • Use generative model to suggest labels to unlabeled articles (optional)

  • Evaluate suggested labels and use curated articles to improve traditional NLP models (optional)

Deployment (Week 3)

  • Building a streamlit app

  • Deploying model to app

  • Deploying app to streamlit cloud


Link to GitHub: https://github.com/SuperDataScience-Community-Projects/SDS-CP016-news-article-classification

Post a comment