Abstract: The amount of data is growing day by day. Simultaneously, new challenges and needs are appearing. We wonder how to collect and process this large amount of data and to get the information we need for business management. Tools and technologies for that task should be scalable, distributed and with high bandwidth so that we can quickly get to the information we want. As a solution impose NoSQL databases. But the databases which are not having a high quality system for processing and analysis of data are not much helpful. Some solutions appeared that have started to address these problems but there is still room for development. Some of these solutions are Apache Hadoop and Hive, which are used every day in big companies like Facebook and Google. This paper provides insight into existing technologies that implement such solutions, which have their advantages and disadvantages, and we will try to predict how this story will develop in the future.
Keywords: Big data, Data, Data warehouse, NoSQL, Hadoop, Hive