data-mining-course

An undergraduate course on data mining.

This project is maintained by chatox

Datasets for data mining practice sessions

This is not a full repository of datasets for data mining, but instead some datasets that are used in the practice sessions.

:file_folder: To download: go to the data/ directory in the repository.

Credits

Datasets used during practices

Filename Credits
device_db.csv, re_dataset.csv Mobile purchase data
CovidLockDownCatalonia/ CrisisNLP Team
Instacart/ Kaggle
annthyroid.csv ODDS
movie_dialog_corpus/ Movie Dialog Corpus
aemet-barcelona-airport-2016-2024.json AEMET
movielens-25M-filtered Filtered MovieLens 25M

Other datasets (used in previous years)

Filename Credits
movielens-1M.zip MovieLens
services_purchased.csv B2B service purchase history
BreadBasket_DMS.csv Vikram Venkataramanan
user_queries.csv Ahmad 2016
EstamosPorTi.json.gz Gabriele 2018
furniture-sales.csv Tableau community forum
prices-split-adjusted.csv Kaggle
DCEP-reports-en.txt.gz DCEP
instacart/ Instacart