Blogs
Read below our blogs and updates on the latest technologies for data engineering, advanced analytics and data science.
Knowledge sharing
Explaining AI: Interpreting using SHAP
The possibilities of Machine Learning models are almost limitless. Ranging from predicting customer churn over determining the right discount to maintaining your machines using...
Read moreChatGPT3: the future of natural language processing is here!
Are you looking for a way to improve your natural language processing systems? Look no further than ChatGPT3, the latest and most advanced language...
Read moreArchitecture behind the Qatar World Cup 2022 model
We explain the steps we took to predict the winner of the Qatar World Cup 2022 in this blog. The concepts covered include the...
Read moreConnecting and Using MS Graph in Azure Data Factory
Companies are creating more and more data, on which they want to gain insights. One valuable source of data is data from within the...
Read moreLoading mechanisms – Part I
As there are huge amounts of data available within companies, data is also moved in increasing quantities from one data storage to another for...
Read moreFeature Store
Everyone who has already come in touch with data science, has already heard of features used in such models. One aspect that can become...
Read moreKimball in a data lake? Come again?
Most companies are already familiar with data modelling (be it Kimball or any other modelling technique) and data warehousing with a classical ETL (Extract-Transform-Load)...
Read morePandas, Koalas and PySpark in Python
If you landed on this page to learn more about animals, I have to disappoint you. Pandas, Koalas and PySpark are all packages that...
Read moreTransfer learning in Spark for image recognition
Transfer learning in Spark demystified in less than 3 minutes reading Businesses that want to classify a huge set of images in batch per...
Read moreHow ALM streamlines BI projects: Azure DevOps
Application Lifecycle Management (ALM) refers to a (software) development process which has been setup in a governed and easy-to-manage way. ALM provides added value...
Read moreThe Journey of attaining the Azure Data Engineer certificate
On February 23, 2021, Microsoft released a new beta certification exam, Exam DP-203: Data Engineering on Microsoft Azure. It is replacing the exams DP-200:...
Read moreThings to consider when creating a Data Lake
Have you wondered what a data lake is? What are typical use cases for this lake? How can you benefit from a data lake...
Read moreManaged Big Data: DataBricks, Spark as a Service
The title accompanying this blog post is quite the mouth full. This blog post will explain why you should be using Spark. If a...
Read moreProcess Mining: Understanding Simple Process Discovery Techniques using Python
Hi and welcome to this blog on process mining Process mining is a set of techniques used in the field of process management and...
Read moreThe Data Lakehouse
Every year, a new buzzword emerges in Data and Analytics; get to know the ‘Data Lakehouse’. Will this be the future of all analytical...
Read moreThe Data Vault Methodology
Data Warehouses have served the purpose of providing a source of value-added information for quite some decades. In supporting business users in their day-to-day...
Read more