HADOOP TOOL AND OPTIMIZATION DATA WAREHOUSE
Keywords:
data warehouses, optimization, big data, technology, costs.Abstract
Data warehouses have grown to be the largest databases in the organization. Data warehouses grow based on the number of users, the amount of data stored, the number of data sources, and the complexity of the reports and analytical requirements. As data warehouses grow, there are performance and concurrency issues, storage and processing costs that can become unacceptable. In such cases, organizations need to optimize their data warehouses, and the choice of optimization methods and technologies is a pressing issue.
Approaches are proposed for the use of the optimal Hadoop data storage technology, which is used in data warehouses to reduce storage and processing costs, as well as to improve the efficiency of reporting and analysis.
References
1. https://www.tadviser.ru/index.php/Статья:Большие_данные_(Big_Data)_мировой_рынок
2. https://ru.wikipedia.org/wiki/Hadoop
3. https://m.habr.com/ru/post/240405/
4. https://www.ibm.com/developerworks/ru/library/bd-hadoopyarn/index.html
5. Kimball R. The Data Warehouse Lifecycle Toolkit, 2nd Edition: Practical Techniques for Building Data Warehouse and Business Intelligence Systems/ Kimball R., Ross M., etc. – John Wiley & Sons, 2008.