wynalazkowo - eksperymenty małe i duże

AWS Big Data Platform

AWS Big Data Platform

Taming the Big Data Beast: How AWS EMR Simplifies Large-Scale Analytics In today’s data-driven world, businesses are constantly bombarded with information. This „big data” presents both challenges and opportunities. But how do you efficiently analyze and extract insights from massive datasets? Enter AWS Elastic MapReduce (EMR), a powerful big data platform on the AWS cloudWięcej oAWS Big Data Platform[…]

Multidimensional cube – what is that and how can be used?

Multidimensional cube – what is that and how can be used?

Starting from the begining: what is it multidimensional cube? A multidimensional cube is a data structure that allows for analysis of information from multiple perspectives. Imagine a data warehouse transformed. Instead of flat tables, data is organized into a cube-like structure. Each dimension represents a specific aspect of your data, like product category, customer location,Więcej oMultidimensional cube – what is that and how can be used?[…]

Apache Airflow

Apache Airflow

Recently I had the opportunity to play with this kind of scheduler for data pipelines tasks. It is mega simple to setup either on bare-metal or as docker worker or in Kubernetes using Helmchart acc. to this desciption: https://airflow.apache.org/docs/apache-airflow/stable/installation/index.html This software enables to create data pipelines for extracting data, decorating and saving in different place.Więcej oApache Airflow[…]

Graph database

Graph database

A graph database is defined as a specialized, single-purpose platform for creating and manipulating graphs. Graphs contain nodes, edges, and properties, all of which are used to represent and store data in a way that relational databases are not equipped to do. Graph analytics is another commonly used term, and it refers specifically to the process ofWięcej oGraph database[…]

TIBCO

TIBCO

As one of my task is also learning TIBCO EMS – here are my notes. You can expect more one this topic;) TIBCO Messaging provides components to support messaging / communications. From high-performance (millions of messages a second) to low latency (sub-microsecond delivery) to fully transactional enterprise-class distribution, streaming data and open source support.Components ofWięcej oTIBCO[…]

Zookeeper

Zookeeper

I have heard about this software many times in relation to BigData, Hadoop, Kafka, but never went into detailed knowledge about this until today. So here is short description of this technology and usage of it: ” Zookeeper was invented by Apache as opensource project: https://github.com/apache/zookeeper. ZooKeeper is a centralized service for maintaining configuration information,Więcej oZookeeper[…]

Omnitracker – integration with ServiceNow – last words

Omnitracker – integration with ServiceNow – last words

This is continuation of story, which started last year in March. For following months we were fighting, fixing and adjusting this interface. Here are some more issues which should have been considered at the beginning of this project, but everybody knows that the wisest you are after the facts;): there should be a mapping betweenWięcej oOmnitracker – integration with ServiceNow – last words[…]