Skip to content

deepestML/pump-and-dump-dataset

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Pump and dump dataset

This repository contains an extended version of the dataset used for the paper:

Pump and Dumps in the Bitcoin Era: Real Time Detection of Cryptocurrency Market Manipulations

If you use this dataset, or use the findings from the paper, please cite:

@INPROCEEDINGS{9209660,
  author={M. {La Morgia} and A. {Mei} and F. {Sassi} and J. {Stefa}},
  booktitle={2020 29th International Conference on Computer Communications and Networks (ICCCN)}, 
  title={Pump and Dumps in the Bitcoin Era: Real Time Detection of Cryptocurrency Market Manipulations}, 
  year={2020},
  pages={1-9},
  doi={10.1109/ICCCN49398.2020.9209660}
  }

The dataset

The dataset contains a list of pump and dumps arranged by groups on Telegram. See the paper for a more detailed description of the dataset generation process. The pump events are listed inside the pump_telegram.csv file.

The pump and dumps file (pump_telegram.csv)

Each row of this file contains:

  • symbol: the symbol (SYM) of the pumped coin.
  • group: the code of the group that arranged the pump and dump. More information about the groups can the found in the group.csv file
  • date: the pump and dump date
  • hour: the pump and dump hour expressed in UTC
  • exchange: the exchange targeted by the group

All the pump and dumps in the dataset are on the trading pair SYM/BTC.

We provide a script to download the transactions from the Binance exchange that we used to train the machine learning model. Each transaction contains the following fields, check the Binance documentation for further information:

  • timestamp: the timestamp of the transaction
  • datetime: the datetime of the transaction
  • side: indicates the type of the transaction: sell or buy
  • price: the trading price
  • amount: the amount of money traded
  • btc_volume: the trading volume expressed in BTC

The group file (group.csv)

  • group_name: The name of the pump and dump group
  • group_code: Abbreviation used in the dataset for the group
  • last_time_checked: Last time we retrieved the pump and dumps arranged by the group from the Telegram channel
  • telegram_link: Link to the Telegram channel of the group

Contribution

Your contribution is very welcome!

If you want to help us mantaining the dataset updated, feel free to create a pull request.

Please, make sure to fill all the fields of the pump_telegram.csv in your pull request. If you add pump and dumps arranged by groups that are not in the group.csv file, please update this file as well.

We will review your pull request and merge it on the master branch or contact you for additional information.

Installation

Clone this repository and run:

pip3 install -r requirements.txt

To download all the transactions of the pump and dumps carried out on Binance run the downloader.py script.

python3 downloader.py

About

The dataset contains a list of pump and dumps arranged by groups on Telegram.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 100.0%