Data mining

Here we have a lot of information about different topics.

Slides

Data mining.

Model

Material

Clustering

Slides clustering

Association rules (Apriori)

Slides Apriori algorithm

Principal Components Algorithms(PCA)

Slides PCA

In this section we will find information about different topics as Association rules, clustering and, reduction of dimensionality.

Laboratories

Apriori algorithm

We are interested in find out if the products \(X\) and \(Y\) are sistutes or complementaries. in a database with transactions made it of inovices in a period of time:

id

Products

1

Milk, Bread,…,

2

Eggs, Coce,…,

3

Milk, Cookies,…,

.

N

Oranges, lettuce

How to known what products will be bought togheter of a total of \(N\)? if we assest each possible combination then will have a computational complexity of

\begin{equation} 3^{N} - 2^{N-1} + 1 \end{equation}

Therefore the Apriori algorithm allow us solve this in a more effcient way.

Clustering

The unsupervised learning is a field of learning.

Warning

The models based in distance are affected by curse of dimensionality to see more information about that please refer to.