Large Scale Data Processing

Today's applications generate and consume large amounts of data. This fact drives the need for developing systems that efficiently store and process large volumes of data. Taking advantage of cluster computing, large scale data processing systems distribute large data sets on the cluster nodes, where they are processed in parallel. Parallel database systems and MapReduce tools are representatives of this category. There is also much interest nowadays in systems that attempt to combine them (Greenplum, Aster Data, HadoopDB).

This topic: Sandbox > LSDP
Topic revision: r1 - 2010-01-15 - ChristinaBoumpouka

No permission to view TWiki.WebBottomBar