E-Commerce Sales Trends and Customer Analysis
Table of Contents
Introduction
Olist is one of Brazil’s leading e-commerce platforms. Founded in 2015, it connects small and medium-sized businesses by connecting them with major online marketplaces across the country, and is now expanding its presence internationally.
This project uses real commercial data from Olist’s early operational period (2016–2018), including over 100,000 orders, to explore the following questions:
- How did Olist’s total sales evolve across months and product categories? Are there clear seasonal patterns or growth phases?
- Can customers be grouped into clusters based on their purchasing behavior and value?
- Which customer segments and product categories contribute most to total revenue, and which represent growth opportunities?
Executive Summary
Between 2016 and 2018, Olist processed over 100,000 orders, generating approximately $4.8 million in sales. The company experienced rapid expansion in 2017, with an average monthly growth rate of 14.7%, followed by a period of stabilization in 2018 (2.6% monthly growth).
The Southeast region, including São Paulo, Rio de Janeiro, and Minas Gerais, emerged as the dominant market, contributing 63% of total revenue.
Customer segmentation revealed four main behavioral profiles based on purchasing patterns, satisfaction, and delivery experience:
The Southeast region, including São Paulo, Rio de Janeiro, and Minas Gerais, emerged as the dominant market, contributing 63% of total revenue.
Customer segmentation revealed four main behavioral profiles based on purchasing patterns, satisfaction, and delivery experience:
- High-Value Premium Buyers: Highest spend per order and strong satisfaction, though mostly one-time purchasers.
- Satisfied Low-Spending Shoppers: Largest group; highly satisfied but with low transaction values.
- Unsatisfied Low-Spending Shoppers: Lowest satisfaction due to delayed deliveries and high freight costs.
- Loyal Repeat Customers: Small but valuable group with consistent repeat purchases and steady satisfaction.
Working Datasets
Figure 1. Entity Relationship Diagram showing the three relational datasets used in this project.
See the Data Pipeline project for details on data preprocessing.
PK: Primary Key; FK: Foreign Key.
See the Data Pipeline project for details on data preprocessing.
PK: Primary Key; FK: Foreign Key.
Detailed Insights
Sales Trends and Growth Rate
- Olist recorded over $4.8 million in online sales between 2016 and 2018 (~$2.2 million per year), receiving an average of 48,000 orders per year. The Average Order Value (AOV) is of 50 USD and remained constant across time.
- The company experienced strong growth in 2016-2017, with an average monthly growth rate of 14.7%, while it slowed to 2.6% in 2018, indicating a period of stabilization after rapid expansion.
- The Southeast region (comprising the states of São Paulo, Rio de Janeiro, and Minas Gerais) accounts for 63% of total sales, totaling approximately 3 millions USD.
Figure 2. Dashboard showing the monthly growth rate, average order value, and total sales, and its variation across states. Open in Tableau Public to explore the data interactively.
Customer Behavior
-
Four distinct customer segments were identified based on purchasing behavior, satisfaction, and delivery experience:
-
Cluster 1 – High-Value Premium Buyers:
- Customers with the highest spending per order (≈ 192 USD) and strong satisfaction (4.0/5).
- They purchase high-value products and experience early deliveries.
- Although mostly one-time buyers, they represent highly profitable transactions.
-
Cluster 2 – Satisfied Low-Spending Shoppers:
- The largest group, making single, low-value purchases (~34 USD) with excellent satisfaction (4.7/5).
- They experience early deliveries and pay relatively high freight costs.
- Represent happy but casual customers with low long-term engagement.
-
Cluster 3 – Unsatisfied Low-Spending Shoppers:
- Lowest satisfaction segment (1.8/5) with small purchases (~37 USD).
- Deliveries are slower, and freight costs are high (~37% of total order value).
- This group highlights potential service and logistics issues affecting customer experience.
-
Cluster 4 – Loyal Repeat Customers:
- The only segment with repeat purchasing behavior (~2 orders per customer).
- They are satisfied (4.1/5), buy moderately priced items (~45 USD), and maintain average freight costs.
- This cluster represents loyal, consistent customers and a key retention opportunity.
-
Cluster 1 – High-Value Premium Buyers:
Figure 3. Top: Monthly sales by customer cluster during the 2017–2018 period. Bottom: Radar chart illustrating the key behavioral and value attributes of each cluster.
- Overall, the majority of revenue is driven by Cluster 2, with an average monthly share of 47.1%, followed by Cluster 1 at 36.6%, while Cluster 3 and Cluster 4 contribute smaller but significant amounts, averaging 10.3% and 6.0% of monthly revenue, respectively.
- Only 3.1% of Olist customers made repeat purchases, which nonetheless contributed to 5.6% of the total revenue.
Recommendation
The analysis highlights key opportunities to improve Olist’s growth trajectory and customer satisfaction. Strategic actions should focus on enhancing retention, optimizing logistics, and differentiating customer experiences across segments.
- Retain High-Value Buyers: Implement loyalty incentives, premium delivery options, or exclusive product bundles to foster repeat purchases among top spenders, with a focus on Customer Cluster 1.
- Convert Satisfied Low-Spending Shoppers: Encourage higher basket values through targeted upselling, product recommendations, or free-shipping thresholds, specifically targeting Customer Cluster 2.
- Improve Delivery Performance: Address delays and high freight costs affecting low-satisfaction customers through better carrier partnerships and delivery tracking.
- Strengthen Customer Retention: Given that only 3.1% of customers make repeat purchases, develop reactivation campaigns and personalized follow-ups to increase repeat rates.
- Regional Expansion: Build on the Southeast region’s strong performance by replicating best practices in emerging markets with growing e-commerce adoption.
Assumptions and Caveats
- The analyzed dataset covers a relatively short time period and reflects an early stage in the company’s development. This limits the ability to identify seasonal patterns, which typically emerge more clearly over longer time spans.
- Despite the large volume of data (over 100,000 orders), the dataset contains very few repeat customers. This limits the feasibility of a traditional Recency–Frequency–Monetary (RFM) segmentation approach. Instead, a combination of alternative customer characteristics was used to identify behavioral clusters, providing a broader range of insights, albeit through an unconventional approach.
Code
The analysis was implemented in Python and is openly available on
GitHub.