A nice project here to do some local test/development ! You can find others interesting projects in the Hortonworks gallery.
PostgreSQL, BI, DWH, Hadoop, DevOps, DataOps, Machine Learning, Cloud and others topics !
Labels
Administration
Analytics
Architecture
Aster
Automation
Best practice
BI
Bitcoin
Bug
Business Intelligence
CDO
Data visualization
Databases
DataFlow
DataLake
DataMesh
DataOps
Datawarehouse
Detente
development
DevOps
ElasticSearch
enterpr1se 3.0
ETL
Flume
Fun
Games
Git
Google Cloud Platform
Graph Database
Hadoop
Hadoop 2.0
Hbase
Hive
Impala
Informatica
IoT
Java
Javascript
Jazz
Jenkins
Kafka
linux
Machine Learning
Mahout
MapReduce
Meta Development
Monitoring
Mood
Music
Oozie
Optimisation
performance
Pig
Python
Quality
R
Real Time
Scala
scam
Shark
SolR
Spark
SQL
Standards
Statistics
Stinger
Storm
SVN
Talend
Task
TED
Teradata
Thinking
Ubuntu
Useful
Web development
WTF
Yarn
Zeppelin
Zookeeper
Showing posts with label Storm. Show all posts
Showing posts with label Storm. Show all posts
Friday, November 6, 2015
Hadoop Mini Clusters !
Labels:
development,
Hadoop,
Hadoop 2.0,
Hbase,
Hive,
Kafka,
MapReduce,
Oozie,
Optimisation,
Storm,
Useful,
Yarn,
Zookeeper
Location:
Cork, Irlande
Sunday, September 21, 2014
Summingbird !
Last week, I went to a meetup about streaming platform and there was a great guy who presents Summingbird : library that lets you write MapReduce programs that look like native Scala or Java collection transformations and execute them on a number of well-known distributed MapReduce platforms, including Storm and Scalding.
Labels:
Architecture,
development,
Hadoop,
Scala,
Shark,
Storm,
Useful
Location:
San Diego, Californie, États-Unis
Wednesday, July 23, 2014
My Hadoop is not working, what can I do ?
Keep calm and ;-)
- First check your logs
- Is the service is running ? (netstat -nat | grep ...)
- Is it possible to access it ? (telnet ip port)
- Is there a problem linked with path, java libraries, environment variable or exec ?
- Am I using the correct user ?
- What is the security system in place ?
- Are nodes well synchronized ?
- What about memory issue ? (swap should be desactivated also)
Friday, October 18, 2013
Hadoop 2.0 !
Apache Hadoop 2.0 has just been released some days ago ! Hadoop is no longer only a MapReduce container but a multi data-framework container and provides High Availability, HDFS Federation, NFS and snapshot !
Wednesday, April 10, 2013
Storm & Hadoop/Hive partitioning load !
I am currently working on how Storm can load data into a partitioned Hadoop/Hive table.
This is how I do :
This is how I do :
- put hadoop libs into the Storm lib directory
- add the hadoop xml conf and parse them using conf.addRessource();
- create a HDFSBolt (implements IRichBolt)
- add some private HashMap<String partition, FSDataOutputStream fsDataOutputStream >
- override execute function (if the partition already exists use current buffer else create a new one)
Friday, March 22, 2013
Storm & real-time ETL !
Storm is a amazing scalable, fault-tolerant, open-source, real-time ETL. Let's storm !
Saturday, March 16, 2013
Main Storm daemons !
- Nimbus (The Storm JobTracker)
- Supervisor (The supervisor daemon is responsible for starting and stopping worker processes)
- UI (administration website)
Subscribe to:
Posts (Atom)