Arun Jijo – Medium

Arun Jijo

Published in
Javarevisited

Subqueries and CTEs in Spark: Enhancing Data Analysis and Manipulation

In the intricate world of data analytics, the power to craft sophisticated and efficient queries is invaluable. Delving into the realm of…

Apr 24, 2024

Subqueries and CTEs in Spark: Enhancing Data Analysis and Manipulation

Apr 24, 2024

Published in
Javarevisited

Beefing Up Redshift Performance

MPP is an predestined tool for any Data Warehousing and Big Data use case. Amazon Red Shift overhaul all of its peers in its space due to…

Apr 2, 2021

Beefing Up Redshift Performance

Apr 2, 2021

Published in
Javarevisited

Spark 3.0 — New Functions in a Nutshell

Recently Apache Spark community releases the preview of Spark 3.0 which holds many significant new features that will help Spark to make a…

Jun 14, 2020

Spark 3.0 — New Functions in a Nutshell

Jun 14, 2020

Published in
DataKare Solutions

Spark SQL — Salient functions in a Nutshell

As, Spark DataFrame becomes de-facto standard for data processing in Spark, it is a good idea to be aware key functions of Spark sql that…

Dec 27, 2019

Spark SQL — Salient functions in a Nutshell

Dec 27, 2019

Published in
Javarevisited

Curious case of Island of Isolation

Garbage collector is one of the major primitives in the JAVA world. The tool that clears the unused / unreachable objects from the memory…

Jun 25, 2019

Curious case of Island of Isolation

Jun 25, 2019

Published in
DataKare Solutions

Key factors to consider when optimizing Spark Jobs

Developing a spark application is fairly simple and straightforward, as spark provides featured pack APIs. Be that as it may, the tedious…

Mar 21, 2019

Key factors to consider when optimizing Spark Jobs

Mar 21, 2019

Published in
DataKare Solutions

Structured Streaming: Essentials

This is the second chapter under the series “Structured Streaming” which center around covering all the essential details to set up a…

Mar 3, 2019

Mar 3, 2019

Published in
DataKare Solutions

Structured Streaming

Introduction

Feb 26, 2019

Feb 26, 2019

Published in
DataKare Solutions

Structured Streaming: Kafka integration

This article focuses on explaining how to integrate Spark’s new stream processing engine Structured Streaming with Apache Kafka along with…

Feb 10, 2019

Feb 10, 2019

Arun Jijo

Arun Jijo

Data engineer at DataKare Solutions who gained expertise at Apache Nifi, Kafka, Spark and passionate in Java.

Following

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech