Batch processing jobs still not meeting user expectations after putting Hadoop in the mix? Great article with a good analogy in regards to bottlenecks while grocery shopping
Corona divides the job tracker’s responsibilities in two. First, a new manager manages cluster resources and keeps an eye on what’s available in that cluster. At the same time, Corona creates a dedicated job tracker for each job, which means the job tracker no longer has to be tied to the cluster. With Corona, smaller jobs can be processed right on the requester’s own machine.
Will this help improve overall throughput? Looking forward to giving this a dry-run.