Data mining
Here we have a lot of information about different topics.
Slides
Model |
Material |
|---|---|
Clustering |
|
Association rules (Apriori) |
|
Principal Components Algorithms(PCA) |
In this section we will find information about different topics as Association rules, clustering and, reduction of dimensionality.
Laboratories
Apriori algorithm
We are interested in find out if the products \(X\) and \(Y\) are sistutes or complementaries. in a database with transactions made it of inovices in a period of time:
id |
Products |
1 |
Milk, Bread,…, |
2 |
Eggs, Coce,…, |
3 |
Milk, Cookies,…, |
. |
… |
N |
Oranges, lettuce |
How to known what products will be bought togheter of a total of \(N\)? if we assest each possible combination then will have a computational complexity of
Therefore the Apriori algorithm allow us solve this in a more effcient way.
Clustering
The unsupervised learning is a field of learning.
Warning
The models based in distance are affected by curse of dimensionality to see more information about that please refer to.