Title: Optimizing Big Data Solutions with Hadoop.
Authors: Mrs. Madhavi R K, Mrs. Prathibha.T, Mrs.Pushpa Shavi
Abstract: Big Data’ describes techniques and technologies to store, distribute, manage and analyze large-sized datasets with high-velocity. Big data can be structured, unstructured or semi-structured, resulting in incapability of conventional data management methods. Data is generated from various different sources and can arrive in the system at various rates. In order to process these large amounts of data in an inexpensive and efficient way, parallelism is used.. Hadoop is the core platform for structuring Big Data, and solves the problem of making it useful for analytics purposes. Hadoop is an open source software project that enables the distributed processing of large data sets with a very high degree of fault tolerance.
Keywords: Big Data, Hadoop, Map Reduce, HDFS, Hadoop Components
International Journal of Applied Pattern Recognition, Vol. 6, No. 2, 2019 (Special Issue)
Received: 12 Jan 2019
Accepted: 24 Mar 2019
Published online: 10 April 2019