Taming the Big Data Beast: How AWS EMR Simplifies Large-Scale Analytics In today’s data-driven world, businesses are constantly bombarded with information. This „big data” presents both challenges and opportunities. But how do you efficiently analyze and extract insights from massive datasets? Enter AWS Elastic MapReduce (EMR), a powerful big data platform on the AWS cloudWięcej oAWS Big Data Platform[…]
Kategoria: Polecane
Starting from the begining: what is it multidimensional cube? A multidimensional cube is a data structure that allows for analysis of information from multiple perspectives. Imagine a data warehouse transformed. Instead of flat tables, data is organized into a cube-like structure. Each dimension represents a specific aspect of your data, like product category, customer location,Więcej oMultidimensional cube – what is that and how can be used?[…]
Recently I had the opportunity to play with this kind of scheduler for data pipelines tasks. It is mega simple to setup either on bare-metal or as docker worker or in Kubernetes using Helmchart acc. to this desciption: https://airflow.apache.org/docs/apache-airflow/stable/installation/index.html This software enables to create data pipelines for extracting data, decorating and saving in different place.Więcej oApache Airflow[…]
A graph database is defined as a specialized, single-purpose platform for creating and manipulating graphs. Graphs contain nodes, edges, and properties, all of which are used to represent and store data in a way that relational databases are not equipped to do. Graph analytics is another commonly used term, and it refers specifically to the process ofWięcej oGraph database[…]
The Three V’s of Big Data: Volume, Velocity, and Variety Volume: Data is being generated in larger quantities by an ever-growing array of sources including social media and e-commerce sites, mobile apps, and IoT connected sensors and devices. Businesses and organizations are finding new ways to leverage Big Data to their advantage, but also face theWięcej oStreaming data architecture[…]
Here are some more useful German links: https://www.thoughtco.com/english-german-computer-and-internet-glossary-4085499 https://www.veem.com/library/german-business-phrases-help-communicate-email/
As one of my task is also learning TIBCO EMS – here are my notes. You can expect more one this topic;) TIBCO Messaging provides components to support messaging / communications. From high-performance (millions of messages a second) to low latency (sub-microsecond delivery) to fully transactional enterprise-class distribution, streaming data and open source support.Components ofWięcej oTIBCO[…]
I have heard about this software many times in relation to BigData, Hadoop, Kafka, but never went into detailed knowledge about this until today. So here is short description of this technology and usage of it: ” Zookeeper was invented by Apache as opensource project: https://github.com/apache/zookeeper. ZooKeeper is a centralized service for maintaining configuration information,Więcej oZookeeper[…]
It is short description of our approach to fast upgrade of Omnitracker with 800GB database. The initial configuration was not so obvious as you could think: Linux cluster with Oracle DB 11.2 Windows server 2012 – CPU with 1 core and 16GB RAM Omnitracker 10.0.100 It was working fine for almost last 10 years withoutWięcej oOmnitracker upgrade – from antique to modernity[…]
This is continuation of story, which started last year in March. For following months we were fighting, fixing and adjusting this interface. Here are some more issues which should have been considered at the beginning of this project, but everybody knows that the wisest you are after the facts;): there should be a mapping betweenWięcej oOmnitracker – integration with ServiceNow – last words[…]