Downloads: 117 | Views: 288
Research Paper | Computer Science & Engineering | India | Volume 7 Issue 9, September 2018 | Popularity: 6.3 / 10
BigData: A Case Study of Spark Mllib and Hive
Shubhajoy Das
Abstract: The extent to which data is generated has shown a tremendous increase in the past decade because of social networks, sensornetworks, geographicinformationsystems, Financial Institutions, Supply chains. The storage capacity of computers have increased to stay competitive, but a big problem is that the access speeds of the disk has not improved to that extent to be at par with disk space improvement. Big Data comes to the rescue with a framework to analyse massive amounts of data in a distributed environment which is both horizontally and vertically scalable. Data sets with trillions of rows can be analysed very fast to provide valuable insights from data. Cloud service providers such as amazon, Alibaba Cloud have made available robust infrastructure for Big Data. We study Apache Hive, Spark Mllib in profiling a Stack Overflow Dataset and Collaborative Filtering algorithm in Spark Mllib for movie recommendations.
Keywords: BigData, SparkMllib, Collaborative Filtering, Hadoop, Spark, Apache, Hive, Amazon aws, HDFS
Edition: Volume 7 Issue 9, September 2018
Pages: 865 - 868
Make Sure to Disable the Pop-Up Blocker of Web Browser
Similar Articles
Downloads: 1 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1
Masters Thesis, Computer Science & Engineering, India, Volume 12 Issue 4, April 2023
Pages: 1324 - 1330A Thesis on News Recommendation
Abhik Naskar, Sudeshna Sarkar
Downloads: 4 | Weekly Hits: ⮙1 | Monthly Hits: ⮙4
Research Paper, Computer Science & Engineering, Kazakhstan, Volume 13 Issue 11, November 2024
Pages: 1485 - 1488Enhancing Recommendation Systems with Fuzzy Logic-Based Collaborative Filtering
Yernar Seitay
Downloads: 68
Research Paper, Computer Science & Engineering, India, Volume 7 Issue 4, April 2018
Pages: 137 - 140Intelligent Health and Education Trust Recommendation System
Bhavin Rathod, Deepraj Sawant, Tejas Shetye, Silviya D'Monte
Downloads: 103 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1
Dissertation Chapters, Computer Science & Engineering, India, Volume 4 Issue 7, July 2015
Pages: 1721 - 1725Secured Load Rebalancing for Distributed Files System in Cloud
Jayesh D. Kamble, Y. B. Gurav
Downloads: 105
Survey Paper, Computer Science & Engineering, India, Volume 4 Issue 6, June 2015
Pages: 2188 - 2190Exploring Social User Behavior with Personalized Recommendation
Karishma Ahire, K.M. Varpe