Showing posts with label Shark. Show all posts
Showing posts with label Shark. Show all posts

Friday, December 19, 2014

I want the last Hadoop improvement / new tool !

For those who are really excited by the latest improvement of Hadoop, I invite them to use it first on a development environment. Sometimes Hadoop is a little too fresh and they are some bugs.

That is why also having Hadoop, most of the time, implies to do migration every 6 months to get the latest patches.

Sunday, September 21, 2014

Summingbird !

Last week, I went to a meetup about streaming platform and there was a great guy who presents Summingbird : library that lets you write MapReduce programs that look like native Scala or Java collection transformations and execute them on a number of well-known distributed MapReduce platforms, including Storm and Scalding.

Friday, October 18, 2013

Hadoop 2.0 !

Apache Hadoop 2.0 has just been released some days ago ! Hadoop is no longer only a MapReduce container but a multi data-framework container and provides High Availability, HDFS Federation, NFS and snapshot !

Saturday, April 6, 2013

Scala !

I am learning Scala, very powerful and used inside Shark !