http://www.tuicool.com/articles/r2QJVr http://so.searchtech.pro/articles/2013/06/16/1371392427213.html What I believe to be the best combination is: map-reduce implementation like apache hadoop or gridgain or JPPF (for processing large datasets) + jd…