Skip to content
Latest Posts:
Creating test data using Python
How to replace multiple strings with their corresponding replacement strings in Scala
How to make the first character uppercase of each word of a List in Scala

Clear ur Doubt

  • Home
  • Technologies
    • Scala
    • Java
    • Hadoop
    • Apache Spark
    • C
  • Tips & Tricks
    • Unix
    • Java 8
  • Projects
    • Python
      • Simple ETL Pipeline
  • Home
  • Blog
  • hadoop

Tag: hadoop

Aug 02

Apache Spark Overview

Sai Gowtham BadvityLeave a commentApache Spark, Hadoop, HDFSApache Spark, Cluster, hadoop, Spark

Apache Spark is a fast and general-purpose cluster computing system. Spark supports in-memory processing for parallel computation which is why it is pretty much faster than a Map-Reduce process. It supports a rich set of tools like Spark SQL    Read more

Jul 31

Impala Compute Stats

Sai Gowtham BadvityLeave a commentCompute Stats, Hadoop, ImpalaApache Impala, Big Data, Compute Stats, hadoop

Cloudera Impala provides an interface for executing SQL queries on data(Big Data) stored in HDFS or HBase in a fast and interactive way. Impala improves the performance of an SQL query by applying various optimization techniques. “Compute Stats” is one Read more

Jul 26

How to update the last modified timestamp of a file in HDFS?

Sai Gowtham BadvityLeave a commenthadoop jar, HDFS, Javahadoop, hadoop jar, HDFS, Java

Sometimes, we come across a situation where we may need to update the last modified time of a file in HDFS. We can achieve it using Hadoop FileSystem API in Java. Step 1: Create a simple maven project in Eclipse. Read more

Recent Posts

  • Creating test data using Python
  • How to replace multiple strings with their corresponding replacement strings in Scala
  • How to make the first character uppercase of each word of a List in Scala

Recent Comments

  • Sai Gowtham Badvity on Calculator program in Scala
  • Sai Aditya on Calculator program in Scala
  • pragadeeshwaran on How to add a new column and update its value based on the other column in the Dataframe in Spark
May 2025
S M T W T F S
 123
45678910
11121314151617
18192021222324
25262728293031
« Aug    
Copyright © All rights reserved.
University Hub by WEN Themes
Scroll Up