Principles of Green Data Mining
- Johannes Schneider, University of Liechtenstein, Vaduz, Liechtenstein
- Marcus Basalla, University of Liechtenstein, IWI, Vaduz, Liechtenstein
- Stefan Seidel, University of Liechtenstein, IWI, Vaduz, Liechtenstein
AbstractThis paper develops a set of principles for green data mining, related to the key stages of business un- derstanding, data understanding, data preparation, modeling, evaluation, and deployment. The principles are grounded in a review of the Cross Industry Stand- ard Process for Data mining (CRISP-DM) model and relevant literature on data mining methods and Green IT. We describe how data scientists can contribute to designing environmentally friendly data mining pro- cesses, for instance, by using green energy, choosing between make-or-buy, exploiting approaches to data reduction based on business understanding or pure statistics, or choosing energy friendly models.
Return to previous page