statistical software

Tidy Data, Tidy Types, and Tidy Operations

The notion of tidy data is a concept known from R and used in many available libraries and frameworks today with great success. Tidy data together with proper data types and semantically allowed operations simplifies data science, machine learning and data stewardship by a large margin. In this article we will highlight the core properties of "Tidy Data, Tidy Types, and Tidy Operations" with the help of a concise example and how those properties can be successively achieved and maintained.