Hadoop and its Relation with Cloud
Hadoop is an open source, distributed processing framework that is used to develop software for reliable, scalable, distributed computing; the kind of distributed computing that is needed to nurture the growth of big data. It manages the processing of data and storage for big data applications that runs in clustered systems. The emerging ecosystem of big data technologies tactically places Hadoop in the centre. These technologies are used to support advanced analytics initiatives that include predictive analytics, data mining, and machine learning applications.
At core, Hadoop utilizes modules such as Hadoop Distributed File System (HDFS), Hadoop MapReduce and Hadoop YARN. HDFS is a file system that is distributed by EMC, Intel and IBM to provide high throughput access to application data. The motive is to distribute the processing of large data sets over clusters of low-cost computers. Similarly, Hadoop MapReduce is a component that allows users to distribute large sets of data over a series of computers for parallel processing, while Hadoop YARN is a skeletal framework that manages job scheduling and the management of cluster resources.
Since cloud technology is ideally suitable to deliver the required computation power for big data applications; merging it with the Hadoop framework eases the processing of large, parallel data sets. It can also recourse enormous amounts of computing power that can be scaled as needed. Hadoop can be looked at as a platform that runs on cloud computing and provides users with distributed data mining, complementing the growth of data-driven technologies and cloud infrastructures.
By Leni Kaufman, VP & CIO, Newport News Shipbuilding
By George Evans, CIO, Singing River Health System
By John Kamin, EVP and CIO, Old National Bancorp
By Elliot Garbus, VP-IoT Solutions Group & GM-Automotive...
By Gregory Morrison, SVP & CIO, Cox Enterprises
By Alberto Ruocco, CIO, American Electric Power
By Sam Lamonica, CIO & VP Information Systems, Rosendin...
By Sergey Cherkasov, CIO, PhosAgro
By Pascal Becotte, MD-Global Supply Chain Practice for the...
By Stephen Caulfield, Executive Director, Global Field...
By Shamim Mohammad, SVP & CIO, CarMax
By Ronald Seymore, Managing Director, Enterprise Performance...
By Brad Bodell, SVP and CIO, CNO Financial Group, Inc.
By Jim Whitehurst, CEO, Red Hat
By Clark Golestani, EVP and CIO, Merck
By Scott Craig, Vice President of Product Marketing, Lexmark...
By Dave Kipe, SVP, Global Operations, Scholastic Inc.
By Meerah Rajavel, CIO, Forcepoint
By Amit Bahree, Executive, Global Technology and Innovation,...
By Greg Tacchetti, CIO, State Auto Insurance