29. 12. 2021.

Apache Ignite – distributed In-memory SQL database

Apache Ignite is one of the very few In-memory SQL compliant distributed databases/data grid among open-source projects. It’s often called “Redis done right” or “Redis on steroid”, because Redis looks primitive and limited when compared with Apache Ignite. Ignite offers great flexibility and lot of features that can easily fit to many use cases. Instead […]

23. 12. 2021.

YugabyteDb – distributed SQL database for a new age

Recently I’ve got a chance to try YugabyteDb, one of the new age databases which try to tackle with new requirements such as scalability, resilience, high availability, Cloud/Hybrid readiness and new architecture styles based on microservices. Although Yugabyte is relatively young company, it attracts a lot of attention, not only from architects/developers/admins, but also from […]

06. 12. 2021.

How to create a real time machine learning pipeline with StreamSets Transformer

Artificial Intelligence (AI) with its subset ML (Machine learning) is probably one of the hottest topics in IT industry today. Many companies are struggling to implement AI algorithms into data pipelines to make smarter decisions with more or less success. First of all, the AI is a wide topics which requires knowledge of math, statistics, […]

29. 11. 2021.

Complex near real-time transformations in data pipelines

For many years, ETL daily batch job was the dominant way to perform data transformations before loading in Data Warehouse. These days requirements are quite different starting with the most important one which is to ensure that new data has to be available for AI/ML and analysis near real time. Moreover, classical DWH databases are […]

18. 08. 2021.

Functional monitoring of Microservices architecture by using Apache Superset

Many of you who have started to develop modern apps by using Microservices approach, have already learned that development tools, debuggers, performance monitoring and tracing lag behind the desired architecture. Situation is even worse when it comes to functional monitoring, where your goal is to find out what is going on with your system from […]

04. 05. 2021.

Tuning Connection pool in modern Microservice architecture

Connection pool has always been a great way to ensure a low latency when establishing connection with a database, while at the same time keeping the number of open sessions under control. It’s one of the best ways to balance speed with resource consumption. With connection pool in place, connection is already established and ready […]

21. 02. 2020.

External C procedures – how to setup and performance boost you can expect

In this post I’ll explain what you need to do to setup external C procedure call with Oracle 18c. I’ll also show for a CPU intensive task, what performance boost you might expect. For those of you who remember how difficult it was to configure LISTENER and TNSNAMES to work with C external procedures, will […]

14. 01. 2020.

Oracle Multiversion Read Consistency and its impact on ETL

Imagine you need to create an ETL workflow that uses the Oracle database as one of its sources or even more common situation where you need to synchronize local table with table from the remote Oracle database via dblink. The most common scenario is to apply the following steps: I have a local table for […]

10. 01. 2020.

Case for using unusable index

I work with Oracle more than I can remember, but every day I find something new. This time I’m asked to drop one index on the production system, which is the simplest task you can get, right? I new there had to be a problem with that, otherwise someone else would do it. In any […]

07. 01. 2020.

Oracle database links – performance impact

In this post I’m going to explain the most important drawbacks when using db links technology for purposes for which it is not design for. To be more precise, the case in which db links are used to massively exchange data between many separate databases will be explained. In the previous post you can find […]