Docker using Digitalocean

In this post, we show how to operate docker using Digitalocean’s cloud provider. With the introduction of docker-machine in the recent versions of Docker, spinning a new droplet with docker is a few bash command lines away.

Requirements

...
Read More

What is a Monad?

In this post, we provide the minimal category theory background that is required to define a monad.

Definition of a Category

A category contains the following:

  1. Objects: A sequence of objects, denoted by : Read More

Monitor Elasticsearch with Nagios

In this post, we demonstrate how to monitor the status of an elasticsearch cluster using Nagios on Ubuntu 14.04.

Nagios server setup

We assume that a nagios server has been configured on the address nagios.your_host_name.com as described in...

Read More

Convert XML to JSON (Java)

In this post, we present how to convert an XML document into JSON. In most cases, working with JSON documents is much more preferable compared to working with XML documents. To do so, we will make use of the following...

Read More

Introduction to Grunt

In this post, we will introduce Grunt. Grunt is a tool that helps web developers to automate the web development process. Here we focus on javascript code minification, concatenation and code quality check (JShint).

Source code

The...

Read More

Bulk Download of ArXiv's Repository

In this post, we demonstrate step-by-step how to download all papers from ArXiv. ArXiv is an open access repository in which researchers share their manuscripts before their publication to a conference or journal.

The arXiv dataset is available on...

Read More

Hello World in Apache Spark

In this post, we present a hello world application in Apache Spark.

Spark is a general engine for large-scal data processing. The main differentiating factor compared to the map-reduce framework is it’s ability to cache intermediate results in-memory.

Read More

Introduction to Spark

In this post, we will briefly introduce Apache Spark.

Spark is a general engine for large-scal data processing. The main differenting factor compared to the map-reduce framework is it’s ability to cache intermediate results in-memory.

To start with Spark,...

Read More

Elasticsearch: Unassigned Shards Problem

During index creation in some cases elasticsearch fails to assign properly its shards. Usually, this issue arises when the cluster runs out of space. If it happens to run your cluster in production, you might want to force elasticsearch to...

Read More

DevOps: Introduction to Fabric.

In this post, we will introduce Fabric.

Fabric is a simple, yet powerful, tool for remotely controlling a farm of servers from your terminal.

As a warmup example, let’s assume that you want to check the uptime of a...

Read More