Parallel Computing Project (2023) - Performance Optimization Using POSIX Threads and OpenMP

Project Overview

This repository contains the final project for the "Parallel Computing" course, conducted in the 6th semester of the 2022-2023 academic year at the University of Piraeus, Department of Informatics. The project focuses on performance optimization of matrix operations using parallel computing techniques, particularly POSIX Threads and OpenMP, exploring the execution times across various matrix sizes and thread counts.

The objective is to implement parallel solutions, compare them with sequential execution, and analyze the impact of optimizations on performance. The system measures execution times under different optimization levels and scheduling policies using matrix multiplications.

Course Information

Institution: University of Piraeus
Department: Department of Informatics
Course: Parallel Computing (2022-2023)
Semester: 6th

Technologies Used

C (POSIX Threads, OpenMP)
GCC Compiler for code compilation and optimizations

Optimization Techniques Explored

The project implements and benchmarks different optimization techniques using matrix multiplications under several conditions:

Sequential Execution: Without any parallelization.
Parallel Execution with POSIX Threads: Using a varying number of threads.
Parallel Execution with OpenMP: Comparing dynamic, guided, and static scheduling policies.
- Dynamic Scheduling: Allocates chunks of iterations dynamically as threads finish their assigned chunks.
- Guided Scheduling: Similar to dynamic but reduces the chunk size over time.
- Static Scheduling: Pre-assigns equal-sized chunks of iterations to each thread.
Optimizations:
- GCC -O0: No optimization
- GCC -O3: Full optimization with loop vectorization

Dataset Details

The matrices used in the computations have varying sizes:

N = 2000, 100, 4000 for smaller-scale tests
N = 8000, K = 200, M = 40000 for large-scale tests

Different thread counts and chunk sizes were used to study their impact on performance:

Chunk sizes of 4 and 100 for scheduling policies in OpenMP.

Setup Instructions

Linux Requirement

It is necessary to use a Linux-based system for this project. You can either install a Linux distribution (e.g., Ubuntu 20.04.2 LTS) directly on your machine or run it in a virtual machine using software like VirtualBox or VMware Player. Ensure that the system supports multiple CPU cores, as parallel programs will be executed.
Compilation and Execution
- Compiling the Sequential Program
  
  To compile the provided sequential matrix multiplication program:
```
gcc -O3 -Wall -Wextra -o mm mm.c
```
  This creates an executable named mm. You can run the program with:
```
./mm <N> <K> <M> <print_results> <num_threads>
```
  Where:
  - <N>, <K>, <M> are the dimensions of the matrices.
  - <print_results>: Use a non-zero value to print the results or 0 to avoid printing.
  - <num_threads>: Although not used in the sequential version, it should be provided for compatibility with parallel versions.
  For example
```
./mm 100 400 600 0 1
```
- Compiling the Parallel Program with POSIX Threads
  
  To compile your parallel program using POSIX Threads:
```
gcc -O3 -pthread -Wall -Wextra -o mm_pthreads mm_pthreads.c
```
  This creates an executable named mm_pthreads. Run the program with:
```
./mm_pthreads <N> <K> <M> <print_results> <num_threads>
```
- Compiling the Parallel Program with OpenMP
  
  To compile the program using OpenMP:
```
gcc -O3 -fopenmp -Wall -Wextra -o mm_omp mm_omp.c
```
  This creates an executable named mm_omp. Set the number of threads for OpenMP using:
```
export OMP_NUM_THREADS=<num_threads>
```
  Then run the program with:
```
./mm_omp <N> <K> <M> <print_results> <num_threads>
```
Testing and Execution

Test your programs with the following matrix sizes:
- Square Matrices: N = K = M = 2000
- Tall A and Wide B: N = 100, K = 4000, M = 20000
- Wide A and Tall B: N = 4000, K = 100, M = 20000
For more accurate results, test with larger matrices that result in execution times greater than 1 minute for the sequential version. You can use smaller matrices during development for quick testing.

Project Documentation

For detailed information about the architecture, code structure, and functionalities, refer to the Project-documentation.pdf located in the /docs directory.

Contributors

_{Dimitris Stylianou}

_{Apostolis Siampanis}

_{Konstantinos Loizidis}

License

This project is licensed under the MIT License - see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Parallel Computing Project (2023) - Performance Optimization Using POSIX Threads and OpenMP

Project Overview

Course Information

Technologies Used

Optimization Techniques Explored

Dataset Details

Setup Instructions

Project Documentation

Contributors

License

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
docs		docs
LICENSE		LICENSE
README.md		README.md
mm.c		mm.c
mm_omp.c		mm_omp.c
mm_pthreads.c		mm_pthreads.c

License

dimitrisstyl7/parallel-computing-project-2023

Folders and files

Latest commit

History

Repository files navigation

Parallel Computing Project (2023) - Performance Optimization Using POSIX Threads and OpenMP

Project Overview

Course Information

Technologies Used

Optimization Techniques Explored

Dataset Details

Setup Instructions

Project Documentation

Contributors

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages