WhatsApp me








The Secret to Optimizing Strategies and Maximizing Results .
A/B test - Statistical Analysis


Project URL in R: AB-test-notebook-in-R.html

Tools used: RStudio, Microsoft Excel

Section 1: Project Introduction



Problem Description

We are working on an A/B testing data analysis project, where we analyze data from a website that launched an A/B experiment with the goal of increasing its revenue.

The data provided contains some misleading features that may affect the final outcome of the experiment, and we are tasked with determining if there is a significant difference between the two options.

Business Objective

"The purpose of this project is to analyze the results of an A/B experiment conducted on a website aiming to increase its revenue and provide recommendations based on the findings of the analysis."

Section 2: Data Description

The provided Excel file contains raw data, including user ID, sample type, and revenue generated by the user. The file contains simulated data from an A/B experiment with the following attributes:

Column Type of data Subtype of data Ranges and categories
1. User ID Categorical Nominal ID with numbers from 1 to 10000
2. Variant Name Categorical Nominal Variant, Control
3. Income Numerical Continuous $0 to $196

Section 3: Data Cleaning

User Distribution



Observations

User distribution after cleaning



Actions taken for cleaning

Section 4: Exploratory Data Analysis

Income distribution vs variant





Observations

Section 5: Statistical analysis

Standard error and 95% confidence interval for both variants




Observations

Density curve for both variants



Observations

Is the difference in means statistically significant?



Observations

Section 6: Conclusion

The results suggest that the observed difference in income generated between the control group and the variant group is within the range of variation due to chance, and is therefore not statistically significant.

If resources and interest are available, we recommend continuing the A/B test to collect more data that can lead to more certain results, although we must emphasize that the difference between both tests is not very promising.

©2023 Abraham Cedeño Levy