Skip to content

Zayuki/Fake-Review-Detection-in-E-commerce-Platform

Repository files navigation

Fake Review Detection in E-Commerce Platforms

Overview

This project focuses on the classification of fake reviews in e-commerce platforms using advanced transformer-based models. The primary goal is to accurately identify computer-generated fake reviews, leveraging the capabilities of models like ELECTRA and MiniLM. This repository contains the code and datasets used in the study, along with detailed documentation on the methodology and results.

Dataset

The dataset includes Amazon reviews, with a balanced mix of human-written and computer-generated fake reviews. The fake reviews were generated using GPT-2, ensuring diversity and complexity in the data. This approach helps avoid biases typically present in manually labeled datasets.

Project Flow Chart

image

Algorithm Selections

  • ELECTRA-Base
  • ELECTRA-Small
  • ELECTRA-Large
  • MiniLM

Hyperparameters of the models

Hyperparameter Base ELECTRA Large ELECTRA Small ELECTRA MiniLM
Repository Path google/electra-base-discriminator google/electra-large-discriminator google/electra-small-discriminator microsoft/Multilingual-MiniLM-L12-H384
Parameters 109M 335M 14M 118M
Batch Size for Training 32 6 32 32
Batch Size for Testing 32 6 32 32
Epoch 2 1 2 2
Maximum Sequence Length 512 512 512 512
Activation Function GELU GELU GELU GELU
Learning Rate 0.00005 0.00005 0.00005 0.00005
Dropout Probability 0.1 0.1 0.1 0.1

Results

Model Parameters Accuracy Precision Recall F1-score Total Fine-tuning time (s)
ELECTRA-Base 109M 96 96 96 96 2984
ELECTRA-Small 14M 95 95 95 95 651
ELECTRA-Large 335M 98 98 98 98 12649
fakeROBERTA (Salminen et al., 2022) 125M 97 97 97 97 3069
MiniLM 118M 97 97 97 97 1054

Conclusion

In conclusion, fake reviews can mislead consumers which cause damage to both business and consumers. Apart from that, the advancement of LLM or AI models in recent years has contributed to the generation of fake reviews using these algorithms. Thus, the computer-generated fake reviews on e-commerce platforms presents a significant challenge, potentially misleading consumers and harming both businesses and consumers. This study has effectively addressed this challenge by investigating the capabilities of transformer-based models in detecting computer-generated fake reviews. Our research focused on models like ELECTRA and MiniLM, which have demonstrated their competitiveness and, in several aspects, matched or even surpassed the current state-of-the-art model, fakeRoBERTa.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published