An undergraduate course on data mining.
This project is maintained by chatox
This is not a full repository of datasets for data mining, but instead some datasets that are used in the practice sessions.
To download: go to the data/ directory in the repository.
Filename | Credits |
---|---|
device_db.csv, re_dataset.csv | Mobile purchase data |
CovidLockDownCatalonia/ | CrisisNLP Team |
Instacart/ | Kaggle |
annthyroid.csv | ODDS |
movie_dialog_corpus/ | Movie Dialog Corpus |
aemet-barcelona-airport-2016-2024.json | AEMET |
movielens-25M-filtered | Filtered MovieLens 25M |
Filename | Credits |
---|---|
movielens-1M.zip | MovieLens |
services_purchased.csv | B2B service purchase history |
BreadBasket_DMS.csv | Vikram Venkataramanan |
user_queries.csv | Ahmad 2016 |
EstamosPorTi.json.gz | Gabriele 2018 |
furniture-sales.csv | Tableau community forum |
prices-split-adjusted.csv | Kaggle |
DCEP-reports-en.txt.gz | DCEP |
instacart/ | Instacart |