Browsing Tag
bigdata
9 posts
The two versions of Parquet
A few days ago, the creators of DuckDB wrote the article: Query Engines: Gatekeepers of the Parquet File…
Optimizing Transformations in Pentaho: Case Study
In my current role as an IT Analyst, I was tasked with verifying and correcting an automatic transformation…
What to use parquet or CSV?
History of Parquet File: A Big Data Storage Revolution The Parquet file format has emerged as a dominant…
Business Intelligence and Analytics: A Comprehensive Guide
Introduction In the modern business landscape, data is king. With the advent of digital technology, businesses are generating…
Big data models 📊 vs. Computer memory 💾
Data pipelines are the backbone of any data-intensive project. As datasets grow beyond memory size (“out-of-core”), handling them…
Connecting Multiple Kafka Clusters in ClickHouse Using Named Collections
Introduction: ClickHouse is a powerful columnar database renowned for its speed and efficiency. A pivotal strength lies in…
How working/install Pig with Notebooks?
🐷📝 Basic commands to work with Pig in Notebooks 🔗Related content You can find post related in: 📀Google…
How working/install Spark with Notebooks?
🌟📝 Basic commands to work with Spark in Notebooks like a Standalone cluster 🔗Related content You can find…
Light Up⭐️Star — Light Up the Road to Open Source!
Check out GitHub: https://github.com/apache/incubator-seatunnel SeaTunnel Connector Acess Plan During the recent live event of the SeaTunnel Connector Access…