Difference: DistributedFileSystemsOverview (1 vs. 2)

Revision 22010-06-16 - ChristinaBoumpouka

Line: 1 to 1
 
META TOPICPARENT name="WebHome"

Hadoop Distributed File System (HDFS)

Changed:
<
<
a small paragraph.
>
>
Developed and used as the primary storage system of the Hadoop framework, HDFS is capable of handling large files of application data which are split into blocks, replicated and stored on different nodes of the cluster, thus achieving fault-tolerance and high availability. HDFS is designed to be deployed on commodity hardware, with emphasis on batch processing and high throughput of data access rather than low latency of data access. It follows a write-once-read-many access model for files, which simplifies data coherency issues. HDFS uses a master/slave architecture, where a single master node, NameNode, manages the file system namespace and regulates access to files by clients. Although this architectural design is simple and effective, it introduces a single point of failure in an HDFS cluster. HDFS is widely used in production sites (Yahoo!, Facebook, Last.fm, etc [1]).
 

CEPH

Added:
>
>

References

  1. Applications and organizations using Hadoop
 -- IoannisKonstantinou - 10 Jun 2010
Added:
>
>
-- ChristinaBoumpouka - 16 Jun 2010

Revision 12010-06-10 - IoannisKonstantinou

Line: 1 to 1
Added:
>
>
META TOPICPARENT name="WebHome"

Hadoop Distributed File System (HDFS)

a small paragraph.

CEPH

-- IoannisKonstantinou - 10 Jun 2010

 
This site is powered by the TWiki collaboration platform Powered by Perl

No permission to view TWiki.WebBottomBar