Ruan Bekker

Sysadmins

Linux and Open Source Enthusiast.

  • Home
  • About Me
  • AWS
  • DevOps
  • Docker
  • Subscribe
  • Slack
  • Buy me a Coffee
  • Developer Jobs
  • Newsletter

Hadoop

A collection of 4 posts

July 22, 2017

Amazon EMR Performance Comparison dealing with Hadoops SmallFiles Problem

Today I would like to have a dive into Job Performance with Hadoop, running on the Managed Hadoop Framework of Amazon Web Services, which is Elastic MapReduce (EMR). Hadoop does not deal well with lots of small files, and I…

BigData Hadoop EMR AWS S3DistCp Performance

March 20, 2017

Bash Script setup a 3 Node Hadoop Cluster on LXC Containers

Just a quick post on setting up a 3 Node Hadoop Cluster on LXC Containers Instructions: Once the setup has completed, and you ssh to the master node, it will format hdfs and start the daemons, this section was configured…

Hadoop Scripting LXD LXC

February 16, 2017

AWS: Create EMR Cluster with Java SDK Examples

Today, providing some basic examples on creating a EMR Cluster and adding steps to the cluster with the AWS Java SDK. This tutorial will show how to create an EMR Cluster in eu-west-1 with 1x m3.xlarge Master Node and…

AWS BigData Hadoop EMR Java

April 18, 2016

Setup Hadoop 2.7 MultiNode Cluster on Ubuntu

We will setup a 4 Node Hadoop Cluster using Hadoop 2.7.1 and Ubuntu 14.04. Our cluster will consist of: Ubuntu 14.04 Hadoop 2.7.1 HDFS 1 Master Node 3 Slave Nodes After we have setup…

BigData Hadoop HDFS

Page 1 of 1

Subscribe to Sysadmins

Get the latest and greatest from Sysadmins delivered straight to your inbox every week.

Sysadmins © 2022. Royce theme by Just Good Themes. Powered by Ghost.

Back to top