You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Parallel and distributed computing are techniques to improve speed and handle large-scale tasks by dividing problems into smaller parts. Parallel computing uses multiple processors in one system (shared memory) for simultaneous computation, while distributed computing connects multiple independent computers (network) to act as one system.
Parallelizing matrix multiplication using Cuda C. Tiling is also implemented to compare results. This repository is submitted as the third assignment for the CSC447 (Parallel Programming for Multicore and Cluster Systems) course at the Lebanese American University.
This team project is presented as the final project for the CSC447 (Parallel Programming for Multicore and Cluster Systems) course at the Lebanese American University under the supervision of Dr. Hamdan Abdellatef.
CLE Third Assignment - The objective of this project was to take the second general problem, which have been discussed in the lab classes and for which we have developed both a multithreaded and a multiprocess solution. The aim now was to convert it into a CUDA program to be ran in a GPU under Linux.
Co-occurrence matrices act as the input to many unsupervised learning algorithms, including those that learn word embedding, and modern spectral topic models. However, the computation of these inputs often takes longer time than the inference. While much thought has been given to implementing fast learning algorithms. The co-occurrence matrix co…