Large Scale Data Processing
Today's applications generate and consume large amounts of data. This fact drives the need for developing systems that efficiently store and process large volumes of data. Taking advantage of cluster computing, large scale data processing systems distribute large data sets on the cluster nodes, where they are processed in parallel. Parallel database systems and
MapReduce tools are representatives of this category. There is also much interest nowadays in systems that attempt to combine them (Greenplum, Aster Data,
HadoopDB).