You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Install docker (1.12 >=) on your machine. Linux and Windows users can use Kitematic to simply the process of creating and managing an Linux virtual machine for this scope
Install docker-compose (1.8 >=) on your machine
Make sure you have at least 10Gb of RAM available on the machine that is going to run the platform
Clone the project: git clone https://github.com/miciav/ExperimentsWithDocker.git
untar data files:
tar -xzvf zeppelin.tgz
tar -xzvf spark.tgz
tar -xzvf cassandra.tgz
tar -xzvf data.tgz
Platform deployment and management
Run the platform using docker-compose docker-compose up -d . This command will deploy the plaftform described in docker-compose.yml. Other yml files are for reasearch purpose only and describe a more coplex plaftform
To stop the platform type docker-compose down in the directory where docker-compose.yml is located
Utility scripts
run_plaftform.sh:
removes existing data directory
untars data files
deploys and runs the platform
destroy_platform.sh
shuts down the platform
removes associated containers
removes existing (old) compressed data files
compresses data directory into data files
peeping_data_stream.sh
allows to check that the streams of messages is coming from the producer to kafka
run_containerized_cqlsh.sh
attaches cassandra container
runs cqlsh to allow the user to interact with the database
Chagelog
05/09/2016 kafka in anonimous way, that is it can be deployed in a swarm without knowing in advance the host ip.
TODO list
consider to add StreamSet Datacollettor to simplify data ingestion