Reverse Vending

Building a Real-Time Packaging Recognition System with Edge-Optimized AI

Edge Devices Computer Vision Data Engineering

With sub-500ms detection and lightweight packaging, the computer vision model is built for frictionless scaling, ready to support hundreds of machines in the national deposit return network. Challenge Accepted!

Building a Real-Time Packaging Recognition System with Edge-Optimized AI

Process & Story

Our client builds intelligent machines that automate the collection of recyclable bottles and cans, known as Reverse Vending Machines. To ensure the accuracy of returns and prevent fraud, they needed a solution that could verify an object’s shape, material, and barcode, including distinguishing eligible packages from lookalikes and damaged containers.

We designed a lightweight machine learning system tailored to their hardware and performance constraints. The result? A scalable, fast packaging recognition system ready for nationwide deployment.

Problem

The client’s machines operate in the context of a deposit return scheme. That means they don’t just need to accept recyclables, they must:

Match the scanned barcode to the actual package,
Verify the physical condition of each item (no crushed or misshapen bottles),
Detect fraudulent attempts, like scanning a valid EAN code and inserting a non-eligible copycat.

All of this needs to happen in real time on a compact Raspberry Pi 5, inside a public-facing machine.

Solution

We designed and delivered a real-time object recognition system that helps our client verify recyclables and prevent fraud in deposit return machines. The solution combined custom dataset development, lightweight model training, and edge deployment—all optimized to run on Raspberry Pi 5 within a strict 500ms time budget.

Outcomes

Custom dataset built from high-resolution machine-captured images, tailored to the real-world packaging types and fraud scenarios
Accurate object recognition achieved through lightweight segmentation and classification models optimized for edge deployment
Sub-500ms inference time on Raspberry Pi 5, enabling real-time fraud detection and package validation
End-to-end ML pipeline delivered, including data ingestion, deduplication, labeling workflow, model training, optimization, and packaging
Experiment tracking and performance benchmarking via Neptune.ai and ONNX Runtime, ensuring transparency and reproducibility

Custom dataset and labeling pipeline

To build a model capable of verifying both object type and condition, we developed a tailored dataset from high-resolution images captured directly by the client’s machines. Since existing public datasets lacked the precision or licensing required, we established a custom ingestion pipeline to organize raw image archives, eliminate duplicates, and preserve metadata such as object dimensions and material.

We deployed a self-hosted instance of Label Studio and onboarded an external labeling team to efficiently annotate thousands of samples. Clear labeling guidelines and active feedback loops ensured accurate segmentation, even in edge cases like open bottle caps or pull-tabs. This labeling infrastructure was designed for reusability—allowing the client to expand the dataset and improve accuracy over time.

Iterative model development for segmentation and verification

We evaluated existing image segmentation models but found them too large or slow for on-device execution. Instead, our team took an iterative approach: a data scientist built an initial segmentation model optimized for quality, which was then fine-tuned, ported, and tested by an ML engineer for performance on the target hardware.

Each iteration was tracked using Neptune.ai, with benchmark metrics including per-image inference time and hardware compatibility via ONNX Runtime. This approach allowed us to gradually reduce model complexity while preserving accuracy, ultimately delivering a version that met both speed and reliability thresholds.

Lightweight, deployable architecture

To simplify installation and updates, we shipped the entire solution as a versioned Python package (.tar.gz), including:

The final trained model,
Inference scripts optimized for the client’s setup,
A requirements list to streamline dependency management.

This modular packaging strategy reduced update sizes and made it easy for the client to deploy the solution at scale across multiple machines, even in bandwidth-limited environments.

Ready for future expansion

The system architecture supports future needs like:

Adding classification for brands or deposit eligibility,
Including new object types (e.g. glass containers),
Fine-tuning models as more labeled data becomes available,
Integrating with fraud detection dashboards and analytics tools.

The final solution delivers accurate, real-time image analysis at the edge, enabling secure, fraud-resistant recycling flows, with no cloud connectivity required.

Case Study Schema Building a Real-Time Packaging Recognition System with Edge-Optimized AI

Tools

Python

Google Cloud Platform

Have a similar project in mind?

LET'S TALK!

Challenges

Fragmented and low-quality image data
The first batches of imagery provided by the client were heavily downsampled, making it impossible to train a reliable model for shape or material detection. Without access to raw image detail, both segmentation accuracy and object classification suffered. We had to work with the client to re-establish a data acquisition process that delivered high-resolution images consistently and with proper metadata.

Lack of suitable public datasets
Off-the-shelf datasets fell short, not due to class coverage or volume alone, but because they lacked the specific object positioning and framing required for this use case. Since the system had to verify not just the type of container but also its shape and dimensions, precision was critical. Public datasets like TACO or the Bottles Dataset didn’t offer consistently positioned, high-resolution images that would support reliable size estimation or edge-case detection. To meet these specific requirements, we designed and built a custom dataset pipeline, from high-resolution image ingestion and deduplication to segmentation labeling and metadata tracking, ensuring full control over data quality and consistency.

Edge deployment under strict latency constraints
Running deep learning models on a Raspberry Pi 5—within a 500ms budget—required precise engineering. Most existing segmentation models were too large or compute-intensive. We couldn't rely on batching or cloud offloading, so the solution had to be lean and local. Our team iterated between quality tuning and ONNX-based optimization to reach the performance envelope, making careful trade-offs between speed and accuracy.

Ambiguity in packaging edge cases
Real-world recyclables aren't always perfect. Some items came with unscrewed bottle caps, crushed necks, or partially torn labels, leading to labeling inconsistencies. Rather than over-engineering logic for every corner case, we created clear annotation standards and maintained a feedback loop with the external labeling team. This helped enforce consistency across the dataset and reduced noise during training.

Scalability and updatability across devices
With hundreds (and eventually thousands) of machines expected to run the model, maintenance needed to be lightweight. Static deployment packages quickly became impractical. We designed a packaging approach that kept the Python module and requirements separate, allowing the client to push targeted updates without re-downloading the entire environment.

Our other projects:

Katana Studios

Powering Real-Time 3D Automotive Configurator with Multi-Cloud GPU Rendering at Scale

Empowering real-time product visualization in automotive marketing with a resilient, cloud-native rendering platform and elastic UI Challenge Accepted!

Computer Vision Data Engineering

Orkla Home & Personal Care (OHPC)

Optimizing Procurement through Seamless RFQ Management

The new AI-powered procurement system reduced RFQ processing time by up to 85%, cutting the cycle from 4 weeks to just 4 days and enabling faster, more cost-efficient purchasing decisions.

Challenge Accepted!

Data Engineering Generative AI LLMs

Insurance Sector

Auto Insurance Fraud Claim Detection

Developing an AI model that can detect car insurance fraud based on car accident claims?
Challenge Accepted!

Fraud Detection Predictive Modeling