Differences
This shows you the differences between two versions of the page.
— | hdfs_filesystem [2015/09/08 21:08] (current) – created - external edit 127.0.0.1 | ||
---|---|---|---|
Line 1: | Line 1: | ||
+ | ====== HDFS (Hadoop Distributed File System) ====== | ||
+ | * [[Hadoop]] | ||
+ | * [[hdfs]] | ||
+ | |||
+ | ** NameNode ** - master server that manages the file system namespace and regulates access to files by clients | ||
+ | |||
+ | ==== About ==== | ||
+ | |||
+ | From the documentation: | ||
+ | |||
+ | HDFS is the primary distributed storage used by Hadoop applications. A HDFS cluster primarily consists of a NameNode that manages the file system metadata and DataNodes that store the actual data. | ||
+ | |||
+ | HDFS is highly fault-tolerant and is designed to be deployed on low-cost hardware. HDFS provides high throughput access to application data and is suitable for applications that have large data sets. HDFS relaxes a few POSIX requirements to enable streaming access to file system data. |