This report delves into the findings from the Flipkart's product dataset. Following a thorough cleanup, we've uncovered noteworthy observations. Our analysis focused on understanding various price metrics, providing a comprehensive view of the seller and actual prices on the platform, examining stock availability, evaluating how sellers and brands are performing, and investigating the impact of discounts on both product and subcategory ratings. This report aims to present a clear and comprehensive overview of these dynamics.
Report: Click to View PDF
- The dataset used in this analysis is sourced from Kaggle.
- Data used from this analysis was collected in 2021
- The dataset is presented in Excel.
- It consists of 393,120 rows and 14 columns.
- Currency used is ₹ Indian Rupee
The cleaned dataset includes the following key variables and features:
Click to view key variables
- product_id: Identifier assigned to each unique product in the dataset.
- actual_price: The original price of the product before any discounts.
- average_rating: The average customer rating based on reviews for the product.
- category: The primary grouping/type that defines the product.
- discount: The reduction percentage applied to the product's original price.
- out_of_stock: Indicates whether the product is currently unavailable for purchase.
- seller: The Individual selling the product.
- selling_price: The current price at which the product is being offered for sale.
- sub_category: More detailed classification within the main category, providing additional product details.
- brand: The brand associated with the product.
Delving into the specifics of our product dataset, a noteworthy observation is the presence of 28,080 unique products.
There are 498 distinct sellers & 346 brands contributing to the platform's diverse product offerings. The dataset comprises a total of 4 categories and 22 subcategories, reflecting the comprehensive range of offerings available to Flipkart’s users.
To identify the benchmark of the typical cost of products within the Flipkart dataset, we carried out the Avg of actual product price and the selling price
Average of Actual Price | Average of Selling Price |
---|---|
₹ 1,415.25 | ₹ 705.58 |
The range between minimum and maximum prices for both actual and selling prices provides insights into the diversity of pricing within the dataset.
Minimum Actual Price | Minimum of Selling Price |
---|---|
₹ 150.00 | ₹ 99.00 |
Maximum Actual Price | Maximum of Selling Price |
---|---|
₹ 12,999 | ₹ 7,999 |
Out of stock products | 1,644 | 5.85% |
---|---|---|
In stock products | 26,436 | 94.15% |
Analysing the inventory status of Flipkart's product catalog, we found that out of 28,080 products, 5.85% items are currently out of stock. This indicates a relatively small portion of products temporarily unavailable for purchase. On the contrary, a significant majority of Flipkart's inventory, 94.15%, are currently in stock. The balanced distribution between in-stock and out-of-stock items showcases a healthy inventory management approach, catering to the diverse needs of Flipkart's customers.
All the top 5 priciest products in the dataset share the same price tag of ₹ 12,999. A noteworthy observation is the consistency in product types, with a focus on suits, jackets, and sweatshirts. This suggests that these high-priced items cater to a particular niche or demand for sophisticated and stylish men's apparel.
This analysis reveals the performance of top 10 brands based on weighted averages, combining factors like the number of products and average ratings. The weighted average allows for a fair evaluation, giving prominence to brands with a combination of high ratings and a substantial product presence.
In this context, Reebok emerges as the top performer, showcasing consistent excellence across a significant product range. Arbour closely follows in the second spot, demonstrating a commendable balance between quality and quantity. Notably, Keoti, despite having an average rating of 3.82, ranks sixth when considering the weighted average. This highlights the significance of the weighted average, which ensures that both ratings and the number of products contribute meaningfully to how top-performing brands are ranked.
3. Create a table showcasing the average actual/selling prices for the top 15 brands that have the highest number of products in the dataset.
Delving into Flipkart’s seller ratings, from our findings RETAILNET emerges as the top seller with an average rating of 4.11 across 1416 products, ARBOR closely follows with a 4.10 rating and 783 products. However, SANDSMARKETING ranks as a bottom seller with a lower 2.68 rating across 887 products, signaling potential challenges in customer satisfaction.
While the average rating for Bags, Wallets & Belts is 4.13, indicating positive feedback, it's crucial to note the small sample size of only 41 products within this category. The analysis suggests that caution is needed when considering Bags, Wallets & Belts as the best�performing category. In contrast, Clothing and Accessories, with 27,118 products, provides a much larger dataset for assessing customer preferences.
Weighted rating was carried out as it helped provide a balanced view of how well each product category is performing. By taking into account both the number of products and their average ratings, it gives a fair representation of the overall performance, considering not only the variety of products but also how satisfied customers are with them.
The weighted average rating for subcategories was conducted to offer a comprehensive evaluation of their performance. By considering both the quantity (number of products) and the quality (average rating), this assessment provides insights into the overall success of each subcategory.
Notably, Topwear emerges as the top-performing subcategory, excelling in both the diversity of products and customer satisfaction.
In the given dataset, out-of-stock products are found in two main categories: Clothing & Accessories and Footwear.
The distribution of these out-of-stock products differs among subcategories, with the highest numbers in Topwear, followed by Clothing Accessories, Bottomwear & winter wear.
Most Discounted Subcategories: Most Discounted Subcategories: Brand Trunk Bags, Wallets & Belts (74%): This subcategory has the highest average discount among all.
Other highly discounted subcategories worth noting are Fabrics (66%), Roy Clothing and Accessories (65%), and SUNSHOPPING Bags, Wallets & Belts (63%).
2. Do you think the discount is responsible for the rating (based on products)? | Analyse the correlation between discount percentages and average ratings.
In analysing the scatter plot, a negative relationship between product ratings and discounts is observed. The slope of the line indicates that as the discount increases, the rating tends to decrease.
This suggests that the discount does not necessarily have an influence on customers' rating of the product due to a high discount. On average, there is a slight tendency for higher ratings to be associated with slightly lower discounts.
In summary, our analysis highlights Retailnet and Arbor as top-rated sellers, along with Reebok and Arbour as leading brands. The standout subcategory is Topwear.
Our analysis suggests that offering high discounts does not guarantee better product ratings. It is important to consider other factors such as product quality, sizing as they play a crucial role in customer satisfaction.
Furthermore, our study reveals a trend where the priciest products are predominantly tailored to men's apparel. These insights provide valuable perspectives on seller and brand dynamics, subcategory performance, and the influential factors shaping customer ratings within the Flipkart platform.