Hyperparameter optimization with Optuna #24

divijghose · 2024-09-18T15:53:56Z

Implement Hyperparameter Optimization using Optuna

Description

This pull request introduces hyperparameter optimization capabilities to FastVPINNs using Optuna. The changes allow for efficient, parallelized optimization across multiple GPUs.

New Files

fastvpinns/hyperparameter_tuning/optuna_tuner.py
fastvpinns/hyperparameter_tuning/objective.py

Key Changes

Added Optuna integration for hyperparameter tuning
Implemented GPU-aware parallelization for optimization trials
Modified main_poisson2d.py to support both YAML config and optimized hyperparameters
Added SQLite storage for Optuna studies to allow resuming interrupted optimizations

Usage

To use the new hyperparameter optimization feature:

Run with YAML config file:
python main_poisson2d.py input.yaml
Run with hyperparameter optimization:
python main_poisson2d.py --optimized --n-trials 200 --n-epochs 50000

The --optimized flag triggers the hyperparameter optimization process. The --n-trials argument specifies the number of optimization trials to run. The --n-epochs argument specifies the number of training iterations for each trial.

Dependencies

Added SQLAlchemy as a new dependency for Optuna's SQLite storage

GPU Utilization

The optimization process automatically detects available GPUs and distributes trials across them for efficient parallel execution.

Example

On a system with 2 GPUs, running:
python main_poisson2d.py --optimized --n-trials 200

will execute 200 optimization trials, automatically distributed across both GPUs.

Notes

The number of parallel jobs is set to the number of available GPUs by default
Users can adjust the hyperparameters to be optimized in the objective.py file
The optimization results are stored in fastvpinns_optuna.db for persistence

Testing

Tested on systems with 1 and 2 GPUs
Verified correct distribution of trials across available GPUs
Confirmed ability to resume interrupted optimization studies

Future Work

Implement visualization of hyperparameter tuning using native Optuna tools.

1. Running with a .yaml config file will lead to usual execution. 2. Running with the --optimized flag will tune hyperparameters with optuna. No input file needed.

1. objective.py defines the objective function for tuning. Contains the fastvpinns object returning metric for tuning. 2. optuna_tuner.py manages the hyperparameter tuning process.

…rgument.

1. Accept number of trials and number of training iteration for each trial as an argument.

1. Accept an is_optimized argument, True if hyperparameter optimization with Optuna is being used. 2. If is_optimized is True, geometry module doesn't print out the test mesh and VTK file for each trial. 3. Backward compatibility - default value of is_optimized is False, existing code with config file should work as is.

1. Creates an SQLite database if it doesn't exist. Can be used for stalled runs or parallel implementation. 2. Lists available number of GPUs and divides jobs.

thivinanandh

Core Changes : Changes made to the geometry_2d module. A new parameter is added as a new parameter and the default value is set as False.

All checks Passed.

divijghose added 11 commits September 11, 2024 13:18

Main file for hyperparameter tuning:

27041fb

1. Running with a .yaml config file will lead to usual execution. 2. Running with the --optimized flag will tune hyperparameters with optuna. No input file needed.

Module for hyperparameter tuning.

65637b7

1. objective.py defines the objective function for tuning. Contains the fastvpinns object returning metric for tuning. 2. optuna_tuner.py manages the hyperparameter tuning process.

Black formatting for hyperparameter optimization files.

9ac45cc

Black formatting for hyperparameter tuning files.

5aff902

Objective function that accepts number of training iterations as an a…

9d1618b

…rgument.

Changes to main file to incorporate hyperparameter tuning using Optuna

dc1a7f9

1. Accept number of trials and number of training iteration for each trial as an argument.

Parallel runs with optuna tuner

6e05813

1. Creates an SQLite database if it doesn't exist. Can be used for stalled runs or parallel implementation. 2. Lists available number of GPUs and divides jobs.

Files for hyperparameter tuning tests

3b63a4c

Black formatting for main file.

7aa1711

Black formatting.

d09e32f

divijghose added the enhancement New feature or request label Sep 18, 2024

divijghose requested a review from thivinanandh September 18, 2024 15:53

thivinanandh approved these changes Sep 20, 2024

View reviewed changes

thivinanandh merged commit a2fa2da into cmgcds:main Sep 20, 2024
22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hyperparameter optimization with Optuna #24

Hyperparameter optimization with Optuna #24

divijghose commented Sep 18, 2024

thivinanandh left a comment

Hyperparameter optimization with Optuna #24

Hyperparameter optimization with Optuna #24

Conversation

divijghose commented Sep 18, 2024

Implement Hyperparameter Optimization using Optuna

Description

New Files

Key Changes

Usage

Dependencies

GPU Utilization

Example

Notes

Testing

Future Work

thivinanandh left a comment

Choose a reason for hiding this comment