Productionizing Apache Spark (Data Pipelines)

Apache Spark On Production (for Data Pipelines) This is the second post about Running Spark On Production, you can read the first post from here In the first post, we talked briefly about spark and then discussed the data exploration use case and compared between the available different tools . In…

Hive, a must known tool for any data engineer

Hive is a data warehouse system built on top of hadoop for allowing querying and managing data sets. Who ? Hive was created by Facebook and is currently highly adopted by many firms including Netflix, Facebook and Bookings. Why ? Actually not everyone is fond of writing java programs for every problem…

The best language for data scientists/engineers !?

I recently answered this questions from a friend and I would like to share my answer with you. The question originally was As a data scientist or data engineer , what the best language for your job java or python ? specially in NLP and text mining field. It mainly depends on…

Simpe Redis: A Simple Interface For Using Redis

In my last post, I talked briefly about Redis and how to install it. In this post, I will try to go deeper and will introduce a very simple interface for using Redis in seconds from Java. At first I would like to introduce you to some important commands in…

Redis : Installation and configuration

Redis is a famous caching layer and in-memory database that is used in a lot of large-scale projects. Redis is used by Twitter GitHub, Pinterest, Snapchat, StackOverflow and Flickr. It supports data structures such as strings, hashes, lists, sets, sorted sets, bitmaps and geospatial indexes with radius queries. Some common…

[CleanCode] Meaningful Names [Part 2]

In this series of posts , I would try to share with you some of my highlights on Clean Code Chapters. In this post my focus would be on Chapter 2. This is the second post discussing Chapter 2 so if you didn't read the first one , please go and check…

[CleanCode] Meaningful Names [Part 1]

In this series of posts , I would try to share with you some of my highlights on Clean Code Chapters. Names are everywhere in software. We name our variables, our functions, our arguments, classes, and packages.Because we do so much of it, we’d better do it well. What…

[Clean Code] Some thoughts about Clean Code ?

In this series of posts , I would try to share with you some of my highlights on Clean Code Chapters. In this post my focus would be on chapter 1. We can define Clean code as code that always looks like it was written by someone who cares. "I…