Fortran Statistics and Machine Learning

Important

FSML is in a pre-alpha state. Existing procedures and API may change significantly.

Description

FSML (Fortran Statistics and Machine Learning) is a scientific toolkit consisting of common statistical and machine learning procedures, including basic descriptive statistics (e.g., mean, variance, correlation), common statistical tests (e.g., t-test, Mann–Whitney U), linear parametric methods and models (e.g., multiple OLS regression, discriminant analysis), and non-linear statistical and machine learning procedures (e.g., k-means clustering).

Key features:

Common statistics and machine learning techniques (as used in modern research).
Familiar/intuitive interface (similarities to popular Python or R libs).
Core procedures are kept pure (to simplify parallelisation and testing), while impure wrappers handle optional arguments and errors for safe conventional use.
Minimal requirements/dependencies (Fortran 2008 or later, and stdlib).

Handbook

The FSML Handbook. includes a short tutorial, detailed API documentation, as well as information for contributors and licence (MIT) details. The documentation pages were generated by FORD.

Development

Aims and Scope

The aim is to create an easy-to-use library for modern Fortran applications that covers many statistics and machine learning procedures that are commonly used in research.

Background

FSML started as an effort to rewrite, re-structure, clean-up, and enhance old Fortran code I've written in the past 15 years, and to bundle and publish it as a well organised and well documented library.

The published research below uses some of the to-be-reworked code and demonstrates some applications of the above-mentioned methods:

Mutz and Ehlers (2019) (k-means and hierarchical clustering, and discriminant analysis).
Mutz et al. (2015) (multiple regression in cross validation and bootstrap setting, principal component analysis, and Bayesian classifier).

Progress

Currently covered are procedures for sample statistics (STS), statistical distributions (DST) and statistical tests (TST). See the full list here. Additionally planned are procedures that rely heavily on linear algebra (e.g., PCA), nonlinear algorithmic procedures (e.g., k-means clustering), and machine learning framework extensions (e.g., cross-validation).

Alpha

I will consider the library to be in "alpha" once FSML covers all of the originally planned functionality.

Beta

This stage is reached once FSML:

has undergone substantial testing (incl. comparisons to other libs).
has proper documentation.
fully works with GFortran and LFortran compilers.

Name		Name	Last commit message	Last commit date
Latest commit History 159 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
app		app
doc		doc
example		example
src		src
test		test
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENCE		LICENCE
README.md		README.md
doc.md		doc.md
fpm.toml		fpm.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Fortran Statistics and Machine Learning

Description

Handbook

Development

Aims and Scope

Background

Progress

Alpha

Beta

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

sebastian-mutz/fsml

Folders and files

Latest commit

History

Repository files navigation

Fortran Statistics and Machine Learning

Description

Handbook

Development

Aims and Scope

Background

Progress

Alpha

Beta

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages