Skip to content

Statistics

Statistics is indisputably one of the most important components and basis for any machine learning application. Thus, basic knowledge in various areas of this subfield is also indispensable if one wants to understand the algorithms behind machine learning more precisely.

Statistical Connections | Photo: dilbert.com

In general, statistical methods deal with being able to express a relationship between different variables and these inferences also mathematically. In other words, cause-effect relationships (Causation) are sought:

  • How much better does the grade on an exam get if you study more?
  • How does the election result change depending on the campaign that was run?
  • Is it safer to fly by plane or to take the train?

In order to be able to examine such correlations more precisely, data analysis also includes tools for evaluating, displaying and summarizing large amounts of data. Graphical evaluations, such as bar charts, pie charts or line charts, are just as much a part of the statistical repertoire as the calculation of mean values or medians.

Some of our Articles in the Field of Statistics

Median

What is the Median?

Learn about the median and its significance in data analysis. Explore its computation, applications, and limitations.

Arima

What is the ARIMA Model?

Master time series forecasting with ARIMA models: Learn to analyze and predict trends in data. Step-by-step guide with Python examples.

Game Theory / Spieltheorie

What is Game Theory?

Discover the power of game theory and its real-world applications in policy making, negotiation, and decision-making. Learn more in this article.

Multivariate Analysis / Multivariate Analyse

What is Multivariate Analysis?

Unlock the power of multivariate analysis: Explore techniques to analyze and uncover relationships in your data in our comprehensive guide.

Bayesian Statistics / Bayessche Statistik

What are Bayesian Statistics?

Unlocking insights with Bayesian statistics: Optimize decision-making and quantify uncertainty for robust data analysis.

Confidence Intervals / Konfidenzintervalle

What are Confidence Intervals?

Quantify uncertainty and make informed decisions with Confidence Intervals: Measure the reliability of estimates and enhance statistical analysis.

Difference between statistical methods and stochastics

In everyday language, probability theory is often mistakenly assigned to statistics, although this is not true. Statistics is merely a subfield of so-called stochastics. In addition to data analysis, this also includes probability theory, i.e. all calculations relating to random experiments such as coin tossing, dice rolling or betting.

This is important because statistical methods do not include probability calculations, even though this is sometimes erroneously claimed. Statistical calculations are clearly more important for machine learning algorithms and form one of the most significant foundations for ML. Probabilities are only used within artificial intelligence when outputting results. A machine learning algorithm will never be able to make a prediction with complete certainty. Instead, results are output with probabilities to express how certain the algorithm is about the outcome. So a probability of 99.5% means that the model is very sure that its prediction will be correct.

Conclusion

Statistical methods are one of the most important foundations for understanding and correctly applying models in the field of machine learning. The contributions in this chapter aim to explain the methods that are indispensable for basic machine learning.

Cookie Consent with Real Cookie Banner