Project Fake Reviews Detection SOMTE

- 1 min

Synopsis

Online consumer reviews help buyers decide what to trust, yet deceptive, computer-generated posts routinely distort perceptions. This research build investigates automated detection strategies beyond BERT by comparing classical algorithms and modern deep learning models on a balanced dataset of 20,000 fake and 20,000 authentic reviews spanning multiple product categories.

Problem Statement

The presence of fake reviews undermines the credibility of e-commerce platforms. The goal is to distinguish computer-generated (CG) fake reviews from genuine, human-authored (OR) reviews. By exploring alternative model architectures and feature pipelines—including traditional ML, hybrid ensembles, and transformer-based representations—the project seeks to surface stronger detection signals.

Highlights

Tech Stack & Methods

Python, scikit-learn, PyTorch, transformer encoders, SMOTE variants for imbalance resilience, and experiment tracking with reproducible notebooks.

comments powered by Disqus