An undergraduate course on data mining.
This project is maintained by chatox
This is not a full repository of datasets for data mining, but instead some datasets that are used in the practice sessions.
To download: go to the data/ directory in the repository.
Filename | Credits |
---|---|
mobile/ | Mobile purchase data |
CovidLockDownCatalonia/ | CrisisNLP Team |
Instacart/ | Kaggle |
annthyroid/ | ODDS |
cardiotocography/ | ODDS |
movie_dialog_corpus/ | Movie Dialog Corpus |
aemet/ | AEMET |
movielens-32M-filtered/ | MovieLens 32M |
Filename | Credits |
---|---|
movielens-1M.zip | MovieLens 1M |
movielens-25M-filtered | MovieLens 25M |
services_purchased.csv | B2B service purchase history |
BreadBasket_DMS.csv | Vikram Venkataramanan |
user_queries.csv | Ahmad 2016 |
EstamosPorTi.json.gz | Gabriele 2018 |
furniture-sales.csv | Tableau community forum |
prices-split-adjusted.csv | Kaggle |
DCEP-reports-en.txt.gz | DCEP |
instacart/ | Instacart |