Blogs

Read below our blogs and updates on the latest technologies for data engineering, advanced analytics and data science.

Let’s turn your data into value. Book your call right here.

Knowledge sharing

Unlocking the power of AutoML in Databricks: simplifying Mmachine learning workflows

In today's data-driven world, organizations are constantly seeking ways to extract insights and value from their data. Machine learning (ML) has emerged as a...

Read more

Visualization of real time data using Databricks and Grafana

In today's fast-paced digital world, the ability to monitor and analyze data in (near) real-time has become essential for businesses across industries. Whether it's...

Read more

Top insights from the Databricks summit 2024

The annual Databricks Summit took place in San Francisco this year, spanning from June 10 to June 13. This event is the ideal place...

Read more

Databricks as a unified platform for Generative AI Development

Databricks is a pioneer in AI innovations for over a decade now and with the rise of Generative AI, it will take a new...

Read more

Getting Started with DBRX: First Impressions of Databricks’ Innovative Language Model

Databricks is advancing rapidly with its Generative AI platform. It is increasingly integrating foundation models and large language models. The most recent addition is...

Read more

Your all-in-one Generative AI platform: Azure AI Studio

During the last couple of months, generative AI has changed productivity, operations and customer support. Several AI providers have put a lot of emphasis...

Read more

Geospatial data: finding pubs along cycling routes

Part 1: Geospatial fundamentals “Data is everywhere”, a tagline you will often hear or read, especially when people are talking about big data. And...

Read more

Internet of Things on AWS

The Internet of Things. It is a 'word' that has been around since the early 2000's, but the concept has been around for much...

Read more

Architecting with Ease: Harnessing the Power of Amazon Q Assistant for Streamlined AWS Solutions

Note: This blog was written when Amazon Q was still in Public Preview. Some features may have changed at the moment of reading. Features...

Read more

Redefining Data Lakes: Is AWS Lake Formation the Answer to Past Troubles?

Over the past years the term 'data lake' has dominated many conversations about data and rightfully so. Whether it is used as a buzzword...

Read more

Streamlining data warehousing: Harnessing the power of Amazon Redshift with dbt integration

Nowadays, data engineers and scientists cannot keep up with all the ETL-tools popping up like mushrooms from the ground. Imagine a tool, not strictly...

Read more

What orchestrator to use for your ETL jobs?

In today's data-driven world, organizations are facing the ever-increasing complexity of data pipelines. The need to handle diverse data formats, sizes, sources and processing...

Read more

How AWS Well-Architected can improve your AWS cloud architecture

When talking about Amazon Web Services (AWS), most people associate it with Amazon's online shopping platform. However, if you're familiar with the "Web Services"...

Read more

Uploading Json data to an on-premise REST API using Azure Data Factory and Azure Databricks

The goal of this exercise is to upload some data we read from csv files, structure them as Json and upload the data to...

Read more

Exploring the potential of Databricks Assistant: the future of Data Lakehouse interaction?

On the past Databricks Data & AI Summit (2023), Databricks announced LakehouseIQ, an AI-powered engine that helps you get more insights in your data...

Read more

Support that makes a difference

Welcome to the Intellus support blog! We offer expert support beyond analytical projects, helping customers with day-to-day tasks, ad-hoc requests, technical questions and small...

Read more

Explaining AI: Interpreting using SHAP

The possibilities of Machine Learning models are almost limitless. Ranging from predicting customer churn over determining the right discount to maintaining your machines using...

Read more

ChatGPT: the future of natural language processing is here!

Are you looking for a way to improve your natural language processing systems? Look no further than ChatGPT, the latest and most advanced language...

Read more

Connecting and Using MS Graph in Azure Data Factory

Companies are creating more and more data, on which they want to gain insights. One valuable source of data is data from within the...

Read more

Loading mechanisms – Part I

As there are huge amounts of data available within companies, data is also moved in increasing quantities from one data storage to another for...

Read more

Feature Store

Everyone who has already come in touch with data science, has already heard of features used in such models. One aspect that can become...

Read more

Kimball in a data lake? Come again?

Most companies are already familiar with data modelling (be it Kimball or any other modelling technique) and data warehousing with a classical ETL (Extract-Transform-Load)...

Read more

Pandas, Koalas and PySpark in Python

If you landed on this page to learn more about animals, I have to disappoint you. Pandas, Koalas and PySpark are all packages that...

Read more

Transfer learning in Spark for image recognition

Transfer learning in Spark demystified in less than 3 minutes reading Businesses that want to classify a huge set of images in batch per...

Read more

How ALM streamlines BI projects: Azure DevOps

Application Lifecycle Management (ALM) refers to a (software) development process which has been setup in a governed and easy-to-manage way. ALM provides added value...

Read more

The Journey of attaining the Azure Data Engineer certificate

On February 23, 2021, Microsoft released a new beta certification exam, Exam DP-203: Data Engineering on Microsoft Azure. It is replacing the exams DP-200:...

Read more

Things to consider when creating a Data Lake

Have you wondered what a data lake is? What are typical use cases for this lake? How can you benefit from a data lake...

Read more

Managed Big Data: DataBricks, Spark as a Service

The title accompanying this blog post is quite the mouth full. This blog post will explain why you should be using Spark. If a...

Read more

Process Mining: Understanding Simple Process Discovery Techniques using Python

Hi and welcome to this blog on process mining Process mining is a set of techniques used in the field of process management and...

Read more

The Data Lakehouse

Every year, a new buzzword emerges in Data and Analytics; get to know the ‘Data Lakehouse’. Will this be the future of all analytical...

Read more

The Data Vault Methodology

Data Warehouses have served the purpose of providing a source of value-added information for quite some decades. In supporting business users in their day-to-day...

Read more

Architecture behind the Qatar World Cup 2022 model

We explain the steps we took to predict the winner of the Qatar World Cup 2022 in this blog. The concepts covered include the...

Read more