An undergraduate course on data mining.
This project is maintained by chatox
This is not a full repository of datasets for data mining, but instead some datasets that are used in the practice sessions.
To download: go to the data/ directory in the repository.
| Filename | Credits |
|---|---|
| mobile/ | Mobile purchase data |
| CovidLockDownCatalonia/ | CrisisNLP Team |
| Instacart/ | Kaggle |
| annthyroid/ | ODDS |
| cardiotocography/ | ODDS |
| movie_dialog_corpus/ | Movie Dialog Corpus |
| aemet/ | AEMET |
| movielens-32M-filtered/ | MovieLens 32M |
| Filename | Credits |
|---|---|
| movielens-1M.zip | MovieLens 1M |
| movielens-25M-filtered | MovieLens 25M |
| services_purchased.csv | B2B service purchase history |
| BreadBasket_DMS.csv | Vikram Venkataramanan |
| user_queries.csv | Ahmad 2016 |
| EstamosPorTi.json.gz | Gabriele 2018 |
| furniture-sales.csv | Tableau community forum |
| prices-split-adjusted.csv | Kaggle |
| DCEP-reports-en.txt.gz | DCEP |
| instacart/ | Instacart |