Quantifying Commercial Disparagement by Analyzing Algorithmic Bias in the spambase Dataset with a Random Forest

Main Article Content

πŸ‘€ Arie Setya Putra
🏒 Department Computer Science, Faculty of Mathematics and Sciences, Lampung University, Bandar Lampung 35145, Indonesia
πŸ‘€ Admi Syarif
🏒 Department of Computer Science, Faculty of Mathematics and Natural Sciences, Lampung University, Bandar Lampung 35145, Indonesia
πŸ‘€ Mahfut Mahfut
🏒 Department of Biology Faculty of Mathematics and Sciences, Lampung University, Bandar Lampung 35145, Indonesia
πŸ‘€ Sri Ratna Sulistiyanti
🏒 Department of Electrical Engineering, Faculty of Engineering, Lampung University, Bandar Lampung 35145, Indonesia
πŸ‘€ Muhammad Said Hasibuan
🏒 Institut of Informatics and Business Darmajaya, Bandar Lampung 35141, Indonesia

Automated decision-making systems, such as spam filters, are ubiquitous but increasingly scrutinized for algorithmic bias. While most scholarship focuses on social discrimination, this research investigates a novel legal claim: algorithmic commercial disparagement. We posit that a machine learning filter trained on a single company's "personalized" data can systematically and unfairly penalize its competitors, creating a data-driven basis for a tortious interference claim. This study provides an empirical model for this legal thesis using the spambase dataset. A Random Forest classifier was trained, achieving a high baseline accuracy of 94.57%β€”a "veneer of neutrality" that would justify its commercial deployment. However, a feature importance analysis revealed the model’s logic was biased, learning to associate corporate-specific keywords (e.g., hp, hpl, george) with non-spam emails. To quantify the harm, we simulated "internal" (Set A) and "competitor" (Set B) communications from the legitimate test data. The results demonstrate a significant disparate impact: the False Positive Rate (FPR) for internal emails was 1.31%, while the FPR for competitor emails was 5.53%. This shows the filter is 4.2 times more likely to wrongfully block a competitor's legitimate communication. This study concludes that this foreseeable, quantifiable harm, resulting from the negligent deployment of a biased model, provides an empirical foundation for claims of algorithmic commercial disparagement.

Putra, A. S., Syarif, A., Mahfut, M., Sulistiyanti, S. R., & Hasibuan, M. S. (2025). Quantifying Commercial Disparagement by Analyzing Algorithmic Bias in the spambase Dataset with a Random Forest. Journal of Cyber Law, 1(4), 330–343. Retrieved from https://jcl.mbicore.com/index.php/jcl/article/view/2

Article Details

Section
Articles

Similar Articles

1 2 > >> 

You may also start an advanced similarity search for this article.