Boussias Group Turning Disconnected Data into a Scalable Data Intelligence Hub
Data Engineering
How Boussias transformed manual reporting by unifying 3M+ records and 1,000+ tables into a cloud-native data lake.
Challenge Accepted!
Process & Story
Boussias is a leading B2B media and events company operating in Greece and Cyprus. Since 1980, the organization has been setting industry standards in trade publishing, conferences, and awards. With a growing footprint and an increasingly digital operation, Boussias needed a modern, scalable approach to data management.
Boussias faced a familiar data dilemma, disconnected systems, manual processes, and limited access to insights. Beyond solving immediate operational inefficiencies, the company wanted to future-proof its data infrastructure to support advanced analytics and AI initiatives.
Problem
The core issues included:
Data scattered across multiple formats, including flat files, Excel sheets, databases, real-time streaming data, and APIs
Data silos across departments
Heavy reliance on spreadsheets
Manual reconciliation of entities
Fragmented data sources with no clear lineage
Lack of a unified, robust foundation to enable AI-driven projects and data innovation
The company needed more than a one-off solution, they needed a data infrastructure that could grow with the business, be managed in the future by an internal team, and integrate seamlessly with existing systems.
Solution
We designed and implemented a modern, scalable data lake architecture for Boussias, built on Google Cloud Platform, to unify fragmented data sources, streamline data operations, and lay a strong foundation for analytics and future AI initiatives.
Outcomes
Key data sources integrated, Salesforce (CRM) and MariaDB (eCommerce data) connected to the centralized data lake
Over 3 million data rows migrated and ready for analysis and reporting
1,000+ structured tables established across medallion architecture layers
End-to-end pipeline delivered, from ingestion with Fivetran to business intelligence exposure via Metabase, all in just 3 months
Full knowledge transfer completed within one week, enabling seamless handover to the internal team
Cloud-native, scalable data infrastructure
At the core of the solution was a robust and cost-effective infrastructure based on Google Cloud Platform. We used BigQuery as the central data warehouse to ingest and store structured, semi-structured, and unstructured data. Its serverless nature ensured effortless scalability, while allowing the business to process millions of records efficiently without managing infrastructure.
Within 3 months, Boussias went from disconnected data sources to a production-ready system ingesting two critical data sources:
Salesforce, the CRM platform
eCommerce MariaDB database, housing transactional data
The platform was ready with all layers, from ingestion to analytics, fully automated and ready for plugging in the next sources. The solution seamlessly migrated over 3 million records across 1,000+ structured tables, laying the groundwork for effortless expansion as new data sources are added.
Layered medallion architecture for clear data governance
To ensure clarity and traceability, we implemented a medallion architecture with bronze (raw), silver (cleaned), and gold (business-ready) layers. This approach provided a systematic pipeline for refining and promoting data across stages, significantly reducing manual data wrangling.
We incorporated CI/CD pipelines to deploy transformations and models safely and efficiently, enabling iterative development while ensuring data quality through built-in validation checks.
Automated ingestion from diverse sources
Using Fivetran, we connected initial core systems, Salesforce and an eCommerce MariaDB database, to automate data ingestion. This eliminated the need for manual exports and provided a real-time, always-updated data pipeline that could be easily extended to additional sources like Excel files, APIs, and streaming data.
This automation allowed Boussias to shift from time-consuming, spreadsheet-based workflows to instantaneous, reliable access to unified data.
Powerful transformation and modeling framework
Data cleansing, augmentation, and transformation were managed via SQLMesh, supported by Python for advanced data logic. We implemented logic to:
Clean inconsistent data at the source
Enrich records with contextual metadata
Automatically promote and validate datasets through transformation stages
Monitor lineage and manage schema evolution seamlessly
This framework made the entire pipeline transparent, auditable, and easy to evolve for future needs.
Business Intelligence for all
To empower non-technical users, we integrated Metabase, an open-source BI platform. Business users could instantly explore curated datasets, create dashboards, and build reports, no SQL or engineering support required. The platform became the single source of truth for insights across departments.
Unifying fragmented systems and file formats Boussias’ data was spread across disconnected systems and various file formats, including Excel spreadsheets, flat files, databases, streaming sources, and APIs. These fragmented sources led to duplicated efforts, inconsistent reporting, and major delays in deriving meaningful insights. We addressed this by building a future-proof ingestion framework using Fivetran, supported by Google Cloud’s native services, allowing seamless integration across structured and semi-structured sources.Within weeks, the team established the infrastructure for a standardized, automated data pipeline, laying the groundwork to eliminate ad hoc file handling and support both batch and real-time ingestion in the future. As a first milestone, the platform successfully integrated two key sources: Salesforce and the eCommerce database.
Eliminating manual reconciliation and Excel-based workflows Business teams previously relied heavily on Excel for reporting, merging data manually, and resolving entity mismatches on a case-by-case basis. This not only consumed time but also posed risks of inconsistency and human error. With the introduction of the medallion architecture and an automated transformation pipeline, we enabled Boussias’ internal teams to start designing streamlined, error-resistant processes for data management and report creation. The platform provides clean, well-structured data via Metabase, supporting more consistent and reliable access to insights as new workflows are developed.
Empowering a small internal team to manage a modern platform The solution had to be lightweight enough to be operated by a small in-house team, while still being enterprise-grade. By using Infrastructure as Code (Terraform + GitHub) and modular, low-maintenance tools, we ensured that updates, extensions, and maintenance could be handled internally with minimal effort. We also conducted a thorough project handoff and knowledge transfer to the internal team.
Preparing for AI-readiness Beyond reporting, Boussias wanted a foundation capable of supporting future AI and machine learning projects. But without consistent, high-quality, and traceable data, such initiatives were not feasible. Through meticulous design of data models, version control, lineage tracking, and schema evolution handling, we built a system that supports data experimentation, model training, and advanced analytics, ready for any AI project they pursue.
The new AI-powered procurement system reduced RFQ processing time by up to 85%, cutting the cycle from 4 weeks to just 4 days and enabling faster, more cost-efficient purchasing decisions.
In the interests of your safety and to implement the principle of lawful, reliable and transparent
processing of your personal data when using our services, we developed this document called the
Privacy Policy. This document regulates the processing and protection of Users’ personal data in
connection with their use of the Website and has been prepared by Nexocode.
To ensure the protection of Users' personal data, Nexocode applies appropriate organizational and
technical solutions to prevent privacy breaches. Nexocode implements measures to ensure security at
the level which ensures compliance with applicable Polish and European laws such as:
Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on
the protection of natural persons with regard to the processing of personal data and on the free
movement of such data, and repealing Directive 95/46/EC (General Data Protection Regulation)
(published in the Official Journal of the European Union L 119, p 1);
Act of 10 May 2018 on personal data protection (published in the Journal of Laws of 2018, item
1000);
Act of 18 July 2002 on providing services by electronic means;
Telecommunications Law of 16 July 2004.
The Website is secured by the SSL protocol, which provides secure data transmission on the Internet.
1. Definitions
User – a person that uses the Website, i.e. a natural person with full legal capacity, a legal
person, or an organizational unit which is not a legal person to which specific provisions grant
legal capacity.
Nexocode – NEXOCODE sp. z o.o. with its registered office in Kraków, ul. Wadowicka 7, 30-347 Kraków, entered into the Register of Entrepreneurs of the National Court
Register kept by the District Court for Kraków-Śródmieście in Kraków, 11th Commercial Department
of the National Court Register, under the KRS number: 0000686992, NIP: 6762533324.
Website – website run by Nexocode, at the URL: nexocode.com whose content is available to
authorized persons.
Cookies – small files saved by the server on the User's computer, which the server can read when
when the website is accessed from the computer.
SSL protocol – a special standard for transmitting data on the Internet which unlike ordinary
methods of data transmission encrypts data transmission.
System log – the information that the User's computer transmits to the server which may contain
various data (e.g. the user’s IP number), allowing to determine the approximate location where
the connection came from.
IP address – individual number which is usually assigned to every computer connected to the
Internet. The IP number can be permanently associated with the computer (static) or assigned to
a given connection (dynamic).
GDPR – Regulation 2016/679 of the European Parliament and of the Council of 27 April 2016 on the
protection of individuals regarding the processing of personal data and onthe free transmission
of such data, repealing Directive 95/46 / EC (General Data Protection Regulation).
Personal data – information about an identified or identifiable natural person ("data subject").
An identifiable natural person is a person who can be directly or indirectly identified, in
particular on the basis of identifiers such as name, identification number, location data,
online identifiers or one or more specific factors determining the physical, physiological,
genetic, mental, economic, cultural or social identity of a natural person.
Processing – any operations performed on personal data, such as collecting, recording, storing,
developing, modifying, sharing, and deleting, especially when performed in IT systems.
2. Cookies
The Website is secured by the SSL protocol, which provides secure data transmission on the Internet.
The Website, in accordance with art. 173 of the Telecommunications Act of 16 July 2004 of the
Republic of Poland, uses Cookies, i.e. data, in particular text files, stored on the User's end
device. Cookies are used to:
improve user experience and facilitate navigation on the site;
help to identify returning Users who access the website using the device on which Cookies were
saved;
creating statistics which help to understand how the Users use websites, which allows to improve
their structure and content;
adjusting the content of the Website pages to specific User’s preferences and optimizing the
websites website experience to the each User's individual needs.
Cookies usually contain the name of the website from which they originate, their storage time on the
end device and a unique number. On our Website, we use the following types of Cookies:
"Session" – cookie files stored on the User's end device until the Uses logs out, leaves the
website or turns off the web browser;
"Persistent" – cookie files stored on the User's end device for the time specified in the Cookie
file parameters or until they are deleted by the User;
"Performance" – cookies used specifically for gathering data on how visitors use a website to
measure the performance of a website;
"Strictly necessary" – essential for browsing the website and using its features, such as
accessing secure areas of the site;
"Functional" – cookies enabling remembering the settings selected by the User and personalizing
the User interface;
"First-party" – cookies stored by the Website;
"Third-party" – cookies derived from a website other than the Website;
"Facebook cookies" – You should read Facebook cookies policy: www.facebook.com
"Other Google cookies" – Refer to Google cookie policy: google.com
3. How System Logs work on the Website
User's activity on the Website, including the User’s Personal Data, is recorded in System Logs. The
information collected in the Logs is processed primarily for purposes related to the provision of
services, i.e. for the purposes of:
analytics – to improve the quality of services provided by us as part of the Website and adapt
its functionalities to the needs of the Users. The legal basis for processing in this case is
the legitimate interest of Nexocode consisting in analyzing Users' activities and their
preferences;
fraud detection, identification and countering threats to stability and correct operation of the
Website.
4. Cookie mechanism on the Website
Our site uses basic cookies that facilitate the use of its resources. Cookies contain useful
information
and are stored on the User's computer – our server can read them when connecting to this computer
again.
Most web browsers allow cookies to be stored on the User's end device by default. Each User can
change
their Cookie settings in the web browser settings menu:
Google ChromeOpen the menu (click the three-dot icon in the upper right corner), Settings >
Advanced. In
the "Privacy and security" section, click the Content Settings button. In the "Cookies and site
date"
section you can change the following Cookie settings:
Deleting cookies,
Blocking cookies by default,
Default permission for cookies,
Saving Cookies and website data by default and clearing them when the browser is closed,
Specifying exceptions for Cookies for specific websites or domains
Internet Explorer 6.0 and 7.0
From the browser menu (upper right corner): Tools > Internet Options >
Privacy, click the Sites button. Use the slider to set the desired level, confirm the change with
the OK
button.
Mozilla Firefox
browser menu: Tools > Options > Privacy and security. Activate the “Custom” field.
From
there, you can check a relevant field to decide whether or not to accept cookies.
Opera
Open the browser’s settings menu: Go to the Advanced section > Site Settings > Cookies and site
data. From there, adjust the setting: Allow sites to save and read cookie data
Safari
In the Safari drop-down menu, select Preferences and click the Security icon.From there,
select
the desired security level in the "Accept cookies" area.
Disabling Cookies in your browser does not deprive you of access to the resources of the Website.
Web
browsers, by default, allow storing Cookies on the User's end device. Website Users can freely
adjust
cookie settings. The web browser allows you to delete cookies. It is also possible to automatically
block cookies. Detailed information on this subject is provided in the help or documentation of the
specific web browser used by the User. The User can decide not to receive Cookies by changing
browser
settings. However, disabling Cookies necessary for authentication, security or remembering User
preferences may impact user experience, or even make the Website unusable.
5. Additional information
External links may be placed on the Website enabling Users to directly reach other website. Also,
while
using the Website, cookies may also be placed on the User’s device from other entities, in
particular
from third parties such as Google, in order to enable the use the functionalities of the Website
integrated with these third parties. Each of such providers sets out the rules for the use of
cookies in
their privacy policy, so for security reasons we recommend that you read the privacy policy document
before using these pages.
We reserve the right to change this privacy policy at any time by publishing an updated version on
our
Website. After making the change, the privacy policy will be published on the page with a new date.
For
more information on the conditions of providing services, in particular the rules of using the
Website,
contracting, as well as the conditions of accessing content and using the Website, please refer to
the
the Website’s Terms and Conditions.