This repository contains an extended version of the dataset used for the paper:
Pump and Dumps in the Bitcoin Era: Real Time Detection of Cryptocurrency Market Manipulations
If you use this dataset, or use the findings from the paper, please cite:
@INPROCEEDINGS{9209660,
author={M. {La Morgia} and A. {Mei} and F. {Sassi} and J. {Stefa}},
booktitle={2020 29th International Conference on Computer Communications and Networks (ICCCN)},
title={Pump and Dumps in the Bitcoin Era: Real Time Detection of Cryptocurrency Market Manipulations},
year={2020},
pages={1-9},
doi={10.1109/ICCCN49398.2020.9209660}
}
The dataset contains a list of pump and dumps arranged by groups on Telegram. See the paper for a more detailed description of the dataset generation process.
The pump events are listed inside the pump_telegram.csv file.
Each row of this file contains:
- symbol: the symbol (SYM) of the pumped coin.
- group: the code of the group that arranged the pump and dump. More information about the groups can the found in the
group.csvfile - date: the pump and dump date
- hour: the pump and dump hour expressed in UTC
- exchange: the exchange targeted by the group
All the pump and dumps in the dataset are on the trading pair SYM/BTC.
We provide a script to download the transactions from the Binance exchange that we used to train the machine learning model. Each transaction contains the following fields, check the Binance documentation for further information:
- timestamp: the timestamp of the transaction
- datetime: the datetime of the transaction
- side: indicates the type of the transaction: sell or buy
- price: the trading price
- amount: the amount of money traded
- btc_volume: the trading volume expressed in BTC
- group_name: The name of the pump and dump group
- group_code: Abbreviation used in the dataset for the group
- last_time_checked: Last time we retrieved the pump and dumps arranged by the group from the Telegram channel
- telegram_link: Link to the Telegram channel of the group
Your contribution is very welcome!
If you want to help us mantaining the dataset updated, feel free to create a pull request.
Please, make sure to fill all the fields of the pump_telegram.csv in your pull request.
If you add pump and dumps arranged by groups that are not in the group.csv file, please update this file as well.
We will review your pull request and merge it on the master branch or contact you for additional information.
Clone this repository and run:
pip3 install -r requirements.txt
To download all the transactions of the pump and dumps carried out on Binance run the downloader.py script.
python3 downloader.py