HPC-Laplace2D: A Hybrid MPI/OpenMP Solver

Overview

This repository contains a high-performance, matrix-free parallel solver for the 2D Laplace equation:

$$-\Delta u=f(x)$$

over a square domain $\Omega=(0,1)^{2}$.

The solver utilizes the Jacobi iteration method to compute the discrete solution on a Cartesian grid. To achieve high computational efficiency and scalability, the architecture implements a hybrid parallelization strategy utilizing both MPI and OpenMP.

Features

Hybrid Parallelization: Combines MPI for distributed memory domain decomposition (block partitioning by rows) and OpenMP for shared-memory multi-threading within each node.
Customizable Boundary Conditions: Supports both standard homogeneous Dirichlet boundary conditions (default) and user-defined non-homogeneous conditions.
Matrix-Free Architecture: Implements a custom dense matrix class using flattened 1D std::vector<double> arrays to optimize memory access patterns and cache utilization.
VTK Export: Automatically exports solutions and grid coordinates in .vtk format for immediate 3D visualization using tools like ParaView.

Software Architecture

The codebase is structured around two primary C++ classes to separate mathematical logic from data structure management:

1. The `Matrix` Class

A custom data structure designed to handle dense matrices efficiently. Data is stored sequentially in a row-major 1D vector. The class provides:

Overloaded operators (+, -, etc.) for clean mathematical syntax.
Optimized data access methods.
Built-in calculation of matrix norms for convergence checking.
Native .vtk file generation for data export.

2. The `Laplace` Class (Namespace: `elliptic`)

This class encapsulates the physics and numerical methods of the problem.

State Management: Stores boundary conditions (bc), max_iter, and tolerance.
solver(n, f): The core parallel solver. It takes grid resolution n and forcing term f as parameters.
MPI Implementation: The 2D grid is divided horizontally into row blocks and distributed across MPI ranks. Halo exchange (ghost cells) is implemented to communicate the top and bottom boundary rows between adjacent ranks during each Jacobi iteration.
OpenMP Implementation: The heavy computational loops calculating the 4-point Jacobi stencil are parallelized across local threads. Edge rows are deliberately excluded from OpenMP parallelization to minimize thread overhead.
serial_solver: A baseline single-core implementation provided for direct performance comparison and benchmarking.

Build and Run Instructions

The repository includes a Makefile configured to compile the code and run automated test suites.

Use the following commands to execute specific test scenarios (make run#N):

Command	Description	Output
`make run1`	Standard parallel test. Solves with homogeneous boundary conditions.	Generates `results.vtk`
`make run2`	Non-homogeneous test. Solves with $u=x+y$ on the boundary.	Generates `results.vtk`
`make run3`	Performance benchmark. Direct comparison between the parallel solver and the serial solver baseline.	Console output
`make run4`	Convergence analysis. Evaluates how the computational error scales with an increasing number of grid partitions ($n$).	Console output
`make run5`	Scalability test. Evaluates elapsed compute time as the number of CPU cores increases (1, 2, and 4 cores).	Console output

To remove compiled objects and generated files, run: make clean

Performance and Results Summary

Accuracy: Tests 1 and 2 successfully validate the solver against both homogeneous and non-homogeneous boundary conditions.
Convergence: Test 4 confirms that the numerical error decreases smoothly as the grid resolution ($n$) increases, proving the mathematical stability of the implementation.
Parallel Efficiency & Scalability: Test 3 demonstrates significant speedups over the serial baseline. Furthermore, Test 5 showcases excellent strong scaling: doubling the core count results in a near-perfect 50% reduction in elapsed execution time.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.vscode		.vscode
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
Notes.txt		Notes.txt
README.md		README.md
laplace.cpp		laplace.cpp
laplace.hpp		laplace.hpp
main.cpp		main.cpp
matrix.cpp		matrix.cpp
matrix.hpp		matrix.hpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HPC-Laplace2D: A Hybrid MPI/OpenMP Solver

Overview

Features

Software Architecture

1. The `Matrix` Class

2. The `Laplace` Class (Namespace: `elliptic`)

Build and Run Instructions

Performance and Results Summary

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

HPC-Laplace2D: A Hybrid MPI/OpenMP Solver

Overview

Features

Software Architecture

1. The Matrix Class

2. The Laplace Class (Namespace: elliptic)

Build and Run Instructions

Performance and Results Summary

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. The `Matrix` Class

2. The `Laplace` Class (Namespace: `elliptic`)

Packages