Maven Analytics courses have been a valuable resource for advancing my data skills. Maven posted a LEGO data visualization challenge and inspired me to use my newly acquired Tableau skills to play with the dataset.
Well Played | Lego Data Visualization
LEGO means to “play well” in Danish. I have always been captivated by the creative freedom of a plethora of tiny plastic bricks. Much like LEGO encourages exploration, I included a detail sheet linked from the small blue LEGO icon for others to explore the data pieces independently. Visualizations focused on the theme transitions and LEGO nostalgia.
I focused my analysis on traditional LEGO sets, thus the notation of “Ages 4+. “Duplo sets were filtered out, as were sets geared for educational or workshop use. When exploring the data, I noticed that limiting my analysis to sets with 100+ pieces or more removed most of the missing pricing data, as most of these sets were promotional or polybags. These changes improved the data quality to display set themes over time.
To validate this choice, considerable time was spent researching the “sets” below this threshold. Report users can bring any filtered-out sets back into view on the detail page.
Data Ink
Edward Tufte, a statistician and sculptor, is considered a pioneer in data visualization. He coined the term “data-ink“: the amount of ink used to represent data divided by the total ink used in a visual.
I applied Tufte’s principles by adding the grid lines to distinguish the 50-count intervals on the focal “Top Themes by Set Release Count” bar chart. Color intensity was used to layer on an additional feature of the years a theme had been produced.
Data Bricks
I created a set of 1×4, 1×3, 1×2, and 1×1 illustrations to layer behind the average price bars. Each little nub illustrates a $2.50 price increase, a decorative and practical data marker.
My 11-year-old brought me a 1×4 for reference to ensure these pieces were to scale.
Set Selection Criteria
My boys are keenly aware of the minifigure count in LEGO sets they select. I created this concentration grid to illustrate which themes had the best ratio of figures, which is valuable information for an 11-year-old boy looking to increase his minifigure collection to enhance his stop-motion movies.
This Lego data visualization benefited from filtering out the sets that were 100 pieces or fewer. A small bag of minifigures would have a high average minifigure ratio, skewing the data.
This visual has an additional tooltip that shows the total number of Minifigures for these themes in addition to the Minifigure average.
LEGO Data Design
Every pixel was purposely placed and designed to enhance your understanding of this 50-year history.
Details:
- Custom color theme inspired by the LEGO.com website and product palette
- Edward Tufte inspired details, such as vertical white divider lines and color-coded number groupings on the set count by year
- Darth Vader clip art, created in Photoshop from an image I took of one of our minifigures, glowing lightsaber and drop shadow included
- History notations: 1970 was thirty-four years after the company started, and they only released 14 sets
- Custom duck icon on the detail sheet, noting the start of LEGO’s history, which began with a clever use of wood scraps