Logistics is moving forward at its own pace, testing the latest technological advancements and adapting to AI progress. While generative artificial intelligence offers more possibilities in other niches, logistics is also trying to harness it for its own needs.
In a conversation with Ritesh Ratti, Jarek Jarzębowski asks about key issues at the intersection of AI and logistics. What are the biggest challenges of implementing large language models in this sector? Are pre-built LLMs and Retrieval-Augmented Generation useful for logistics? How can a logistics company set its data science journey in the right direction?
Find answers in this insightful interview, where Ritesh Ratti shares his industry knowledge.
Key Takeaways from the Conversation
AI has put logistics on a direct route to efficiency: Instead of building manual rule-based models, logistics companies can now implement models based on labeled datasets, saving resources and expanding productivity.
Implementing AI in logistics can be a rocky path: Effective implementation remains the biggest challenge, as large language models can be time-consuming to process. Careful planning and tool selection are essential for optimized AI deployment.
The optimal route for LLMs in logistics is through fine-tuning: Using pre-built large language models can be risky in logistics, but fine-tuning techniques and reinforcement learning with human feedback will keep your solution on the right track.
Generative AI can set a new direction for various areas of logistics: Automation of the supply chain, product sizing, and anomaly detection are where Generative AI shows the greatest potential in this industry.
Conversation with Ritesh Ratti:
Jarek Jarzębowski: Can we start with you telling a little bit about yourself, your experience, and your background, especially focusing on using ML, AI, data, and logistics?
Ritesh Ratti: Sure. I’m Ritesh Ratti, currently working with Delivery Hero as the data science lead. I’ve built projects related to food taxonomy generation, using language models to develop food categorization and sub-categorization.
I have over 13 years of experience, starting as an engineer and moving into data science and machine learning. My career began at Samsung, developing a question-answering system, followed by work on recommendation systems at Aerial and Pitney Bowes. At Grab, I focused on recommender systems and food categorization using transformer-based modeling.
Currently, I lead a team of data scientists and engineers, working from POC stage to production. I hold a doctorate in data science and a master’s in computer science, with a focus on NLP-based models for anomaly detection, specifically on network data.
Jarek Jarzębowski: Great. Most of our readers are familiar with AI, data, and machine learning, but for the general public, AI has become more common with generative AI and large language models. Can you give us an overview of the AI landscape, especially in logistics?
Ritesh Ratti: The landscape has changed significantly over the years. Ten years ago, we relied on rule-based models for specific problems like address parsing in logistics. We would manually extract rules to parse addresses into different components, like street, postcode, and building name.
Harness the full potential of AI for your business
Now, machine learning allows us to build models based on labeled datasets, shifting away from manual rule-based systems. Generative AI is used to extract meaningful information from text, like addresses, using large language models trained on multilingual data. AI has also advanced in identifying products through images, which helps in estimating shipment costs more accurately.
In logistics, AI plays a vital role in computer vision models, product classification, and even optimizing models for production using technologies like Kubernetes and C++/C-based frameworks.
Jarek Jarzębowski: Can you explain the biggest challenges in introducing these models and getting them into production?
Ritesh Ratti: The biggest challenges are scaling these models to predict outcomes quickly and efficiently and then deploying them in production. Large language models can be time-consuming to process, so we use solutions like Kubernetes to distribute processing. We also optimize models through quantization, but we need to balance performance with accuracy.
For production, we often use FastAPI or Flask to expose models as APIs, and we deploy them using Docker and Kubernetes. Other tools like Spinnaker help automate deployments. Overall, it’s a complex stack, and careful planning is needed to ensure models run efficiently in production.
Jarek Jarzębowski: How useful are pre-built large language models in logistics compared to those you need to build from scratch?
Ritesh Ratti: Pre-built large language models can be challenging because they may not have been trained on data specific to your needs. However, you can use techniques like prompt-based engineering or few-shot learning to adapt these models by generating data to fine-tune them.
Reinforcement learning with human feedback is another strategy to improve these models iteratively. The key is to balance the use of pre-built models with custom fine-tuning to meet specific requirements in logistics.
Jarek Jarzębowski: Are retrieval-augmented generation (RAG) models useful in logistics? Can you explain what RAGs are and how they work?
Ritesh Ratti: RAG stands for Retrieval-Augmented Generation. It involves using a language model and providing instances of information from a vector database to answer questions or generate content.
RAG models first index data in a vector database for efficient retrieval and then use similar instances to generate better responses. This approach is beneficial in generating accurate solutions for complex problems by leveraging in-context learning, where the model uses provided examples to understand the context better and produce more relevant outputs.
Jarek Jarzębowski: With the rapid advancements in AI, do you see more opportunities for data science in logistics and supply chain in the near future?
Ritesh Ratti: Yes, there are many possibilities. Generative AI can enhance human interaction with systems, enabling deeper understanding and automation in tasks like product sizing and anomaly detection.
AI can also be integrated into robotics for supply chain automation, reducing human intervention. Additionally, multimodal generative AI can help analyze issues reported by consumers, identifying whether there are genuine product problems or fraud attempts.
The potential for AI to streamline and improve supply chain operations is vast, especially with the continued development of generative models.
Jarek Jarzębowski: What advice would you give to companies in logistics that are just starting their data science journey?
Ritesh Ratti: First, identify the areas in your logistics pipeline that need improvement, such as product optimization or machine learning solutions.
Then, assess the risks of implementing machine learning and have a backup plan in case results are not as expected. It’s essential to have a team of experts who understand your product and can build effective solutions.
If using generative AI, consider its cost and performance impact, and start with simpler approaches like prompt engineering before diving into more complex fine-tuning.
Lastly, ensure security measures are in place to protect user data, especially with GDPR compliance, and be cautious of potential issues like model hallucinations.
Jarek Jarzębowski: To have a successful product, we need experts in technology, security, and business. From your experience, which of these components is most often missing?
Ritesh Ratti: The most important aspect missing is business understanding. We need to evaluate from a business perspective whether we really need to invest heavily in machine learning, and to what extent, to ensure better user experience and customer satisfaction.
A simple solution for a big problem might be the best approach. If predefined rules can solve the issue, that’s the direction we should take, especially if those rules are stable and not likely to change soon.
Explainability is also crucial, especially when reverse engineering or identifying loopholes in models or data. These are key factors for driving the business forward.
Jarek Jarzębowski: One drawback of using data science is the inability to precisely predict outcomes and ROI beforehand, unlike traditional software development. How can we cope with these challenges when we don’t know if the solution will even be possible or how much it will cost?
Ritesh Ratti: That’s a great question. First, you need to check if historical data is available for the problem you’re trying to solve. Analyzing this data can help estimate ROI, like predicting increases in revenue if certain features are implemented.
Data analytics plays a vital role here. You should also consider costs like infrastructure and resources, and whether you can reduce man-hours. These factors together can help estimate ROI.
Jarek Jarzębowski: Can you give us some predictions about the challenges that might arise with using more of these technologies in the future?
Ritesh Ratti: I see a few major challenges. One is resource availability, especially as trends move towards building proprietary models.
Another is the environmental impact of training large language models, which can run for weeks and consume significant energy. Explainability remains crucial, especially in risk management, as incorrect predictions shouldn’t bring down an entire system.
Security is also a concern, as providing false information could lead to disastrous consequences. Lastly, the impact on education is significant; while these models can enhance knowledge, misuse could hinder conceptual understanding.
Jarek Jarzębowski: There’s a lot of potential for good with technology, but we also need to consider these challenges. Thank you for sharing your perspective and experience. Is there anything else you’d like to mention, especially regarding the use of data science in logistics?
Ritesh Ratti: Yes, there are many areas in logistics where data science can have a big impact. Inventory management is one, where you can invest heavily in estimating inventory levels and locations. Optimizing delivery routes and ensuring on-time delivery are also crucial areas. Another aspect is verifying the authenticity of delivery addresses. There’s a lot of potential in the logistics and supply chain sector, and it requires more research. It’s a huge opportunity.
Jarek Jarzębowski: We might delve deeper into these topics in the future, but for now, I think we’ve covered a lot of ground. This will make for an interesting article. Thank you again!
Ritesh Ratti’s Background
Ritesh Ratti is a dynamic Data Science and Machine Learning Lead with over 14 years of experience in crafting intelligent software solutions. Leading successful teams at HelloFresh and Delivery Hero, he applied his expertise in machine learning, data mining, and predictive modeling to drive impactful insights.
Ratti’s work is rooted in a deep understanding of classification, clustering, and text analytics. With a Ph.D. in Computer Science from the Indian Institute of Technology, Guwahati, he has also presented research at top conferences in machine learning and network security, pushing the boundaries of what’s possible in data science.
About Delivery Hero
Delivery Hero is a global leader in online food delivery and a pioneer in quick commerce, operating in around 70 countries across four continents. The company, headquartered in Berlin, was founded in 2011 and has rapidly grown to become one of the world’s largest online food delivery platforms, handling millions of orders daily.
Delivery Hero not only connects customers with restaurants and grocery stores but also focuses on innovative technology solutions to enhance the delivery experience, including the use of AI and automation. Committed to sustainability, it actively works on reducing its carbon footprint across its global operations.
Closing Thoughts
The recent advancements in AI and GenAI may set a new direction for logistics in the next few years. From computer vision powered autonomous solutions in ports or manufacturing halls, through predictive maintenance, to simulating optimization scenarios, the logistics today is already strongly AI-driven.
As Ritesh Ratti underlines, for these implementations to succeed, they need to be followed by careful planning and tool choice. Contrary to common conviction, even the pre-built large language models can benefit logistics with the right approach. We cannot wait to see how this new phase of AI’s development impacts and improves the industry!
Jarek is an experienced People & Culture professional and tech enthusiast. He is a speaker at HR and tech conferences and Podcaster, who shares a lot on LinkedIn. He loves working on the crossroads of humans, technology, and business, bringing the best of all worlds and combining them in a novel way. At nexocode, he is responsible for leading People & Culture initiatives.
Would you like to discuss AI opportunities in your business?
Let us know and Dorota will arrange a call with our experts.
Step into the narrative of change with our AI Revolution Diaries, where each interview captures a moment in the ongoing revolution of artificial intelligence across industries. These diaries detail the firsthand experiences of businesses at the forefront of integrating AI, highlighting the transformative impact and the lessons learned throughout their journey of innovation.
Engage with our series to discover the strategies that drive successful AI integration, and grasp the benefits and hurdles encountered by pioneers in the field. Let us be your guide in navigating the transformative journey of AI, empowering your business to harness the full potential of data and shape the future of your industry.
In the interests of your safety and to implement the principle of lawful, reliable and transparent
processing of your personal data when using our services, we developed this document called the
Privacy Policy. This document regulates the processing and protection of Users’ personal data in
connection with their use of the Website and has been prepared by Nexocode.
To ensure the protection of Users' personal data, Nexocode applies appropriate organizational and
technical solutions to prevent privacy breaches. Nexocode implements measures to ensure security at
the level which ensures compliance with applicable Polish and European laws such as:
Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on
the protection of natural persons with regard to the processing of personal data and on the free
movement of such data, and repealing Directive 95/46/EC (General Data Protection Regulation)
(published in the Official Journal of the European Union L 119, p 1);
Act of 10 May 2018 on personal data protection (published in the Journal of Laws of 2018, item
1000);
Act of 18 July 2002 on providing services by electronic means;
Telecommunications Law of 16 July 2004.
The Website is secured by the SSL protocol, which provides secure data transmission on the Internet.
1. Definitions
User – a person that uses the Website, i.e. a natural person with full legal capacity, a legal
person, or an organizational unit which is not a legal person to which specific provisions grant
legal capacity.
Nexocode – NEXOCODE sp. z o.o. with its registered office in Kraków, ul. Wadowicka 7, 30-347 Kraków, entered into the Register of Entrepreneurs of the National Court
Register kept by the District Court for Kraków-Śródmieście in Kraków, 11th Commercial Department
of the National Court Register, under the KRS number: 0000686992, NIP: 6762533324.
Website – website run by Nexocode, at the URL: nexocode.com whose content is available to
authorized persons.
Cookies – small files saved by the server on the User's computer, which the server can read when
when the website is accessed from the computer.
SSL protocol – a special standard for transmitting data on the Internet which unlike ordinary
methods of data transmission encrypts data transmission.
System log – the information that the User's computer transmits to the server which may contain
various data (e.g. the user’s IP number), allowing to determine the approximate location where
the connection came from.
IP address – individual number which is usually assigned to every computer connected to the
Internet. The IP number can be permanently associated with the computer (static) or assigned to
a given connection (dynamic).
GDPR – Regulation 2016/679 of the European Parliament and of the Council of 27 April 2016 on the
protection of individuals regarding the processing of personal data and onthe free transmission
of such data, repealing Directive 95/46 / EC (General Data Protection Regulation).
Personal data – information about an identified or identifiable natural person ("data subject").
An identifiable natural person is a person who can be directly or indirectly identified, in
particular on the basis of identifiers such as name, identification number, location data,
online identifiers or one or more specific factors determining the physical, physiological,
genetic, mental, economic, cultural or social identity of a natural person.
Processing – any operations performed on personal data, such as collecting, recording, storing,
developing, modifying, sharing, and deleting, especially when performed in IT systems.
2. Cookies
The Website is secured by the SSL protocol, which provides secure data transmission on the Internet.
The Website, in accordance with art. 173 of the Telecommunications Act of 16 July 2004 of the
Republic of Poland, uses Cookies, i.e. data, in particular text files, stored on the User's end
device. Cookies are used to:
improve user experience and facilitate navigation on the site;
help to identify returning Users who access the website using the device on which Cookies were
saved;
creating statistics which help to understand how the Users use websites, which allows to improve
their structure and content;
adjusting the content of the Website pages to specific User’s preferences and optimizing the
websites website experience to the each User's individual needs.
Cookies usually contain the name of the website from which they originate, their storage time on the
end device and a unique number. On our Website, we use the following types of Cookies:
"Session" – cookie files stored on the User's end device until the Uses logs out, leaves the
website or turns off the web browser;
"Persistent" – cookie files stored on the User's end device for the time specified in the Cookie
file parameters or until they are deleted by the User;
"Performance" – cookies used specifically for gathering data on how visitors use a website to
measure the performance of a website;
"Strictly necessary" – essential for browsing the website and using its features, such as
accessing secure areas of the site;
"Functional" – cookies enabling remembering the settings selected by the User and personalizing
the User interface;
"First-party" – cookies stored by the Website;
"Third-party" – cookies derived from a website other than the Website;
"Facebook cookies" – You should read Facebook cookies policy: www.facebook.com
"Other Google cookies" – Refer to Google cookie policy: google.com
3. How System Logs work on the Website
User's activity on the Website, including the User’s Personal Data, is recorded in System Logs. The
information collected in the Logs is processed primarily for purposes related to the provision of
services, i.e. for the purposes of:
analytics – to improve the quality of services provided by us as part of the Website and adapt
its functionalities to the needs of the Users. The legal basis for processing in this case is
the legitimate interest of Nexocode consisting in analyzing Users' activities and their
preferences;
fraud detection, identification and countering threats to stability and correct operation of the
Website.
4. Cookie mechanism on the Website
Our site uses basic cookies that facilitate the use of its resources. Cookies contain useful
information
and are stored on the User's computer – our server can read them when connecting to this computer
again.
Most web browsers allow cookies to be stored on the User's end device by default. Each User can
change
their Cookie settings in the web browser settings menu:
Google ChromeOpen the menu (click the three-dot icon in the upper right corner), Settings >
Advanced. In
the "Privacy and security" section, click the Content Settings button. In the "Cookies and site
date"
section you can change the following Cookie settings:
Deleting cookies,
Blocking cookies by default,
Default permission for cookies,
Saving Cookies and website data by default and clearing them when the browser is closed,
Specifying exceptions for Cookies for specific websites or domains
Internet Explorer 6.0 and 7.0
From the browser menu (upper right corner): Tools > Internet Options >
Privacy, click the Sites button. Use the slider to set the desired level, confirm the change with
the OK
button.
Mozilla Firefox
browser menu: Tools > Options > Privacy and security. Activate the “Custom” field.
From
there, you can check a relevant field to decide whether or not to accept cookies.
Opera
Open the browser’s settings menu: Go to the Advanced section > Site Settings > Cookies and site
data. From there, adjust the setting: Allow sites to save and read cookie data
Safari
In the Safari drop-down menu, select Preferences and click the Security icon.From there,
select
the desired security level in the "Accept cookies" area.
Disabling Cookies in your browser does not deprive you of access to the resources of the Website.
Web
browsers, by default, allow storing Cookies on the User's end device. Website Users can freely
adjust
cookie settings. The web browser allows you to delete cookies. It is also possible to automatically
block cookies. Detailed information on this subject is provided in the help or documentation of the
specific web browser used by the User. The User can decide not to receive Cookies by changing
browser
settings. However, disabling Cookies necessary for authentication, security or remembering User
preferences may impact user experience, or even make the Website unusable.
5. Additional information
External links may be placed on the Website enabling Users to directly reach other website. Also,
while
using the Website, cookies may also be placed on the User’s device from other entities, in
particular
from third parties such as Google, in order to enable the use the functionalities of the Website
integrated with these third parties. Each of such providers sets out the rules for the use of
cookies in
their privacy policy, so for security reasons we recommend that you read the privacy policy document
before using these pages.
We reserve the right to change this privacy policy at any time by publishing an updated version on
our
Website. After making the change, the privacy policy will be published on the page with a new date.
For
more information on the conditions of providing services, in particular the rules of using the
Website,
contracting, as well as the conditions of accessing content and using the Website, please refer to
the
the Website’s Terms and Conditions.