Blogs

Read below our blogs and updates on the latest technologies for data engineering, advanced analytics and data science.

Let’s turn your data into value. Book your call right here.

Knowledge sharing

Explaining AI: Interpreting using SHAP

The possibilities of Machine Learning models are almost limitless. Ranging from predicting customer churn over determining the right discount to maintaining your machines using...

Read more

ChatGPT3: the future of natural language processing is here!

Are you looking for a way to improve your natural language processing systems? Look no further than ChatGPT3, the latest and most advanced language...

Read more

Architecture behind the Qatar World Cup 2022 model

We explain the steps we took to predict the winner of the Qatar World Cup 2022 in this blog. The concepts covered include the...

Read more

Connecting and Using MS Graph in Azure Data Factory

Companies are creating more and more data, on which they want to gain insights. One valuable source of data is data from within the...

Read more

Loading mechanisms – Part I

As there are huge amounts of data available within companies, data is also moved in increasing quantities from one data storage to another for...

Read more

Feature Store

Everyone who has already come in touch with data science, has already heard of features used in such models. One aspect that can become...

Read more

Kimball in a data lake? Come again?

Most companies are already familiar with data modelling (be it Kimball or any other modelling technique) and data warehousing with a classical ETL (Extract-Transform-Load)...

Read more

Pandas, Koalas and PySpark in Python

If you landed on this page to learn more about animals, I have to disappoint you. Pandas, Koalas and PySpark are all packages that...

Read more

Transfer learning in Spark for image recognition

Transfer learning in Spark demystified in less than 3 minutes reading Businesses that want to classify a huge set of images in batch per...

Read more

How ALM streamlines BI projects: Azure DevOps

Application Lifecycle Management (ALM) refers to a (software) development process which has been setup in a governed and easy-to-manage way. ALM provides added value...

Read more

The Journey of attaining the Azure Data Engineer certificate

On February 23, 2021, Microsoft released a new beta certification exam, Exam DP-203: Data Engineering on Microsoft Azure. It is replacing the exams DP-200:...

Read more

Things to consider when creating a Data Lake

Have you wondered what a data lake is? What are typical use cases for this lake? How can you benefit from a data lake...

Read more

Managed Big Data: DataBricks, Spark as a Service

The title accompanying this blog post is quite the mouth full. This blog post will explain why you should be using Spark. If a...

Read more

Process Mining: Understanding Simple Process Discovery Techniques using Python

Hi and welcome to this blog on process mining Process mining is a set of techniques used in the field of process management and...

Read more

The Data Lakehouse

Every year, a new buzzword emerges in Data and Analytics; get to know the ‘Data Lakehouse’. Will this be the future of all analytical...

Read more

The Data Vault Methodology

Data Warehouses have served the purpose of providing a source of value-added information for quite some decades. In supporting business users in their day-to-day...

Read more