- Big data is a complex term which is difficult to manage a traditional system like RDBMS, DB2, SQL etc.
- Hadoop is an open-source software framework, for storing massive data and running applications on a cluster of the commodity.
- It provides massive storages, enumerates processing power and ability to handle multiple concord task or job.
* Characteristics of Hadoop :
1. Open source -> Free of cost
2. Volume -> No limit of data
3. Velocity -> Multithreading and Parallel processing
4. Veracity -> 100% secure
5. Verities -> Structure and unstructured
6. Variability -> Dynamic behaviour
7. Visualization -> Visualizing meaningful usage of data.
* Features of Hadoop:
1. Distributed processing
2. Fault Tolerance
3. Reliability
4. High Availability
5. Scalability
6. Economic
7. Easy to use.
* Limitations of Hadoop:
1. Issues with small files.
2. Support only batch processing.
3. Iterative Processing.
4. Vulnerable by natures.
5. Security.