{"id":12915,"date":"2013-01-01T16:16:14","date_gmt":"2013-01-01T16:16:14","guid":{"rendered":"https:\/\/www.techopedia.com\/definition\/hadoop-distributed-file-system\/"},"modified":"2017-01-17T15:27:05","modified_gmt":"2017-01-17T15:27:05","slug":"hadoop-distributed-file-system","status":"publish","type":"definition","link":"https:\/\/www.techopedia.com\/definition\/29129\/hadoop-distributed-file-system-hdfs","title":{"rendered":"Hadoop Distributed File System"},"content":{"rendered":"
The Hadoop Distributed File System (HDFS) is a distributed file system that runs on standard or low-end hardware. Developed by Apache Hadoop, HDFS works like a standard distributed file system but provides better data throughput and access through the MapReduce algorithm, high fault tolerance and native support of large data sets.\n<\/p>\n
The HDFS stores a large amount of data placed across multiple machines, typically in hundreds and thousands of simultaneously connected nodes, and provides data reliability by replicating each data instance as three different copies – two in one group and one in another. These copies may be replaced in the event of failure.\n<\/p>\n
The HDFS architecture consists of clusters, each of which is accessed through a single NameNode software tool installed on a separate machine to monitor and manage the that cluster’s file system and user access mechanism. The other machines install one instance of DataNode to manage cluster storage.\n<\/p>\n
Because HDFS is written in Java, it has native support for Java application programming interfaces (API) for application integration and accessibility. It also may be accessed through standard Web browsers.<\/p>\n","protected":false},"excerpt":{"rendered":"
What Does Hadoop Distributed File System Mean? The Hadoop Distributed File System (HDFS) is a distributed file system that runs on standard or low-end hardware. Developed by Apache Hadoop, HDFS works like a standard distributed file system but provides better data throughput and access through the MapReduce algorithm, high fault tolerance and native support of […]<\/p>\n","protected":false},"author":7813,"featured_media":0,"comment_status":"open","ping_status":"closed","template":"","format":"standard","meta":{"_acf_changed":false,"_lmt_disableupdate":"","_lmt_disable":"","om_disable_all_campaigns":false,"footnotes":""},"definitioncat":[227,222,262,228],"class_list":["post-12915","definition","type-definition","status-publish","format-standard","hentry","definitioncat-data-management","definitioncat-database","definitioncat-identity-access-governance","definitioncat-risk-management"],"acf":[],"yoast_head":"\n