Big Data and Data Analytics
The main goal of my presentation is to introduce the participants to the most novel concepts in computational statistics and their implications into broader range of decisions based on Big Data. These days a data scientist has to work simultaneously on at least three different fronts. First, the data gathering plan called the design. Invariably, designs become extremely useful as we are flooded with data. Designs help us to focus on the aim of the study and on reduction of complexity. Secondly, the software environment we choose to analyze our data.
The competition here is quite strong but the future will belong to open source solutions. The Author of this presentation belongs to the club of R-ofiles, that is a world-wide community of data scientists willing to share their tools. Finally, the third front of the battle of the data scientist is selection of an appropriate statistical algorithm. Here the Big Data revolution has dramatically changed the perspective. We now move much more audaciously to extremely high dimensional data with new statistical tools. The concepts of the talk will be illustrated with examples from the Authors experience, that is from signal processing, medical and financial data.