1. Review your memory configuration to maximize CPU utilisation
2. Review your YARN settings especially the Capacity Scheduler
3. Review your application design, parameter used, join strategy, file format
Of course with checking your ganglia / Ambari Metrics, voilĂ !
PS : For those who don't trust Multi-tenant Hadoop cluster, please call me ;-)
2. Review your YARN settings especially the Capacity Scheduler
3. Review your application design, parameter used, join strategy, file format
Of course with checking your ganglia / Ambari Metrics, voilĂ !
PS : For those who don't trust Multi-tenant Hadoop cluster, please call me ;-)
No comments:
Post a Comment