Arun JijoinJavarevisitedSubqueries and CTEs in Spark: Enhancing Data Analysis and ManipulationIn the intricate world of data analytics, the power to craft sophisticated and efficient queries is invaluable. Delving into the realm of…10 min read·Apr 24, 2024----
Arun JijoinJavarevisitedBeefing Up Redshift PerformanceMPP is an predestined tool for any Data Warehousing and Big Data use case. Amazon Red Shift overhaul all of its peers in its space due to…5 min read·Apr 2, 2021----
Arun JijoinJavarevisitedSpark 3.0 — New Functions in a NutshellRecently Apache Spark community releases the preview of Spark 3.0 which holds many significant new features that will help Spark to make a…8 min read·Jun 14, 2020--3--3
Arun JijoinDataKare SolutionsSpark SQL — Salient functions in a NutshellAs, Spark DataFrame becomes de-facto standard for data processing in Spark, it is a good idea to be aware key functions of Spark sql that…3 min read·Dec 27, 2019----
Arun JijoinJavarevisitedCurious case of Island of IsolationGarbage collector is one of the major primitives in the JAVA world. The tool that clears the unused / unreachable objects from the memory…4 min read·Jun 25, 2019--1--1
Arun JijoinDataKare SolutionsKey factors to consider when optimizing Spark JobsDeveloping a spark application is fairly simple and straightforward, as spark provides featured pack APIs. Be that as it may, the tedious…8 min read·Mar 21, 2019--1--1
Arun JijoinDataKare SolutionsStructured Streaming: EssentialsThis is the second chapter under the series “Structured Streaming” which center around covering all the essential details to set up a…4 min read·Mar 3, 2019----
Arun JijoinDataKare SolutionsStructured Streaming: Kafka integrationThis article focuses on explaining how to integrate Spark’s new stream processing engine Structured Streaming with Apache Kafka along with…5 min read·Feb 10, 2019----