Docker using Digitalocean

In this post, we show how to operate docker using Digitalocean’s cloud provider. With the introduction of docker-machine in the recent versions of Docker, spinning a new droplet with docker is a few bash command lines away.

What is a Monad?

In this post, we provide the minimal category theory background that is required to define a monad.

Definition of a Category

A category \(\mathcal{C}\) contains the following:

Objects: A sequence of objects, denoted by \(obj(\mathcal{C})\): \(A, B, C\)

Qlocktwo In Angular (Javascript)

In this post, we provide details on an AngularJS implementation of Qlocktwo.

A demo is deployed on Heroku.

Development

Setup the environment

First, you must install the javascript toolkit of nodejs, bower,...

Monitor Elasticsearch with Nagios

In this post, we demonstrate how to monitor the status of an elasticsearch cluster using Nagios on Ubuntu 14.04.

Nagios server setup

We assume that a nagios server has been configured on the address nagios.your_host_name.com as described...

Convert XML to JSON (Java)

In this post, we present how to convert an XML document into JSON. In most cases, working with JSON documents is much more preferable compared to working with XML documents. To do so, we will make use of the following...

Import CSV files into Apache Hive

A common task that might be required by a big data scientist is to load a CSV file into Apache Hive. Apache Hive is a SQL-like database sitting on top of Hadoop’s distributed file system (HDFS).

In this post,...

How to setup multiple versions of Scala on Mac OS

In this post, we demonstrate how to install multiple versions of Scala on a MacOS environment. We will assume that the package manager HomeBrew is already installed.

Installing brew

To o install brew, follow the instruction Read More

Introduction to Grunt

In this post, we will introduce Grunt. Grunt is a tool that helps web developers to automate the web development process. Here we focus on javascript code minification, concatenation and code quality check (JShint).

Source code

The...

Bulk Download of ArXiv's Repository

In this post, we demonstrate step-by-step how to download all papers from ArXiv. ArXiv is an open access repository in which researchers share their manuscripts before their publication to a conference or journal.

The arXiv dataset is available on...

Hello World in Apache Spark

In this post, we present a hello world application in Apache Spark.

Spark is a general engine for large-scal data processing. The main differentiating factor compared to the map-reduce framework is it’s ability to cache intermediate results in-memory.

Introduction to Spark

In this post, we will briefly introduce Apache Spark.

Spark is a general engine for large-scal data processing. The main differenting factor compared to the map-reduce framework is it’s ability to cache intermediate results in-memory.

To start with Spark,...

Elasticsearch: Unassigned Shards Problem

During index creation in some cases elasticsearch fails to assign properly its shards. Usually, this issue arises when the cluster runs out of space. If it happens to run your cluster in production, you might want to force elasticsearch to...

Load Balancing Apache Tomcat using Nginx.

In this post, we will configure Nginx Server as a load balancer for Apache Tomcat. First, we will provide the configuration of nginx for proxying a single Apache Tomcat instance. Later, we will show how this configuration can be...

DevOps: Introduction to Fabric.

In this post, we will introduce Fabric.

Fabric is a simple, yet powerful, tool for remotely controlling a farm of servers from your terminal.

As a warmup example, let’s assume that you want to check the uptime of a...

Configure Tomcat's Plugin in Nagios (Ubuntu)

In this post, we demonstrate how to monitor Apache Tomcat 7 using Nagios on Ubuntu 14.04.

We assume that nagios server has been configured on the address nagios.your_host_name.com as described in the following guide:

Nagios on Ubuntu

...

We're up and running!

This is the first post. The current blog runs on github pages using Jekyll. It has been bootstrapped using Jekyll Now repository on GitHub.

Pretty cool, no DBs, plugins, etc like in Wordpress. Also no worries for backups...