Run SSIS Project in Azure SSIS-IR with Power query as a Source using Azure File Share and call SSIS Packages in ADF Pipeline – Part 2
In the previous post, we discussed creating an SSIS package and executing it in a local machine, in this post we will discuss how we can shift the SSIS package from a local machine to…
Run SSIS Project in Azure SSIS-IR with Power query as a Source using Azure File Share and call SSIS Packages in ADF Pipeline
Introduction Azure Data Factory (ADF), is a great tool to schedule and orchestrate cloud activities. Although ADF has many features to cover various data integration needs, it is not as flexible as its on-premises predecessor, SQL…
Clickhouse Cluster setup and Replication Configuration Part-2
In the previous post we discussed about basic background of clickhouse sharding and replication process, in this blog post I will discuss in detail about designing and running queries against the cluster. Cluster Setup Let us build…
Clickhouse Cluster setup and Replication Configuration Part-1
ClickHouse was specifically designed to work in clusters located in different data centers. The DBMS can be scaled linearly(Horizontal Scaling) to hundreds of nodes. This is mainly to address the scaling issues that arise with an increase…
Filter by Column MySQL vs Clickhouse
Filter by column was a typical business case requirement for reporting or creating analytical dashboards. Even though there are many OLAP databases available in market . In this blog post i will delve deep in…
Indexes in MySQL
Indexes in MySQL (Mysql indexes) are a great tool for optimizing SQL queries. To understand how they work, let’s look at working with data without them. Reading data from disk Files are stored in hard drive…
TPC-DS Benchmark On Clickhouse Part 2
Loading data Into Clickhouse In the previous post I discussed about TPCDS Data generation you can find details here The dataset I’m working with is broken up into 121 .dat files. The data generated by…
TPC-DS Benchmark On Clickhouse Part 1
ClickHouse ClickHouse is an open source, columnar-oriented database that’s been developed primarily by engineers at Yandex. Yandex is one of Europe’s largest Internet-focused businesses. There search engine alone was receiving 150 million searches a day in 2012. ClickHouse…
Clickhouse – Introduction
Introduction Clickhouse is an open source columnar database that has received much attention in recent years and is mainly used in the field of data analysis (OLAP). Characteristics of OLAP scenarios Read more than write…