International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 137 | Views: 192 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Research Paper | Computer Science & Engineering | India | Volume 4 Issue 8, August 2015


Big Data Analytics Framework using Machine Learning on Multiple Datasets

Surekha Sharad Muzumdar | Jharna Majumdar [9]


Abstract: Over 2.5 quintillion bytes of data have been created in last two years alone. These kinds of data comes from various sources such as healthcare informatics, weather information, sensors data, cell phone GPS signals, social media, digital images and videos, transactional information, etc. Big Data refers to huge collection of data sets that are so complex that it becomes so difficult to process using traditional data processing applications. Therefore it requires new set of framework to manage and process Big Data. Map Reduce plays a significant role in processing Big Data. In this paper, the multiple datasets such as data from healthcare organization, weather dataset and movie ratings dataset are stored and organized directly to distributed file system like HDFS. Then finally data is analyzed using Apache Hive for faster query access. In this paper Machine learning techniques are used to solve a big data analytics in a better and simple way.


Keywords: Big data, Hive, Hadoop, HDFS, Machine Learning, COBWEB


Edition: Volume 4 Issue 8, August 2015,


Pages: 414 - 418


How to Download this Article?

Type Your Valid Email Address below to Receive the Article PDF Link


Verification Code will appear in 2 Seconds ... Wait

Top