publications | Arash Sarshar

2026

CNSNS

Ensemble based Closed-Loop Optimal Control using Physics-Informed Neural Networks

Jostein Barry-Straume, Adwait D Verulkar, Arash Sarshar, and 2 more authors

Communications in Nonlinear Science and Numerical Simulation, 2026

Abs DOI HTML PDF

Physics-informed neural networks offer a scalable alternative to grid-based dynamic programming for solving Hamilton–Jacobi–Bellman (HJB) equations, but stable training and accurate feedback-policy recovery remain challenging for nonlinear systems and infinite-horizon objectives. This paper presents an ensemble closed-loop physics-informed framework that learns a differentiable approximation of the cost-to-go by minimizing an HJB-residual loss and recovers the feedback policy from value gradients. Reference solutions—analytic for linear–quadratic regulators and numerical otherwise—are used to warm-start training and to quantitatively evaluate value and policy errors. A broadened benchmark suite spanning linear–quadratic regulation, higher-dimensional cubic dynamics, and cart-pole and pendulum systems is examined, with comparisons against hard constraint embedding via the theory of functional connections, a Bellman-consistent actor–critic neural baseline, and value iteration. Ablations isolate the effects of warm-starting and boundary conditioning, and residual-weight scheduling characterizes the trade-off between value learning and policy recovery. Across benchmarks, the proposed method is most competitive on nonlinear pendulum dynamics, achieving the lowest value-function error and improved policy accuracy relative to dynamic-programming baselines, while constraint-embedding methods remain stronger on several linear–quadratic cases.

2025

CAMC

Physics-informed neural networks for PDE-constrained optimization and control

Jostein Barry-Straume, Arash Sarshar, Andrey A Popov, and 1 more author

Communications on Applied Mathematics and Computation, 2025

Abs DOI HTML PDF Video

The goal of optimal control is to determine a sequence of inputs for maximizing or minimizing a given performance criterion subject to the dynamics and constraints of the system under observation. This work introduces Control Physics-Informed Neural Networks (PINNs), which simultaneously learn both the system states and the optimal control signal in a single-stage framework that leverages the system’s underlying physical laws. While prior approaches often follow a two-stage process-modeling, the system first and then devising its control—the presented novel framework embeds the necessary optimality conditions directly into the network architecture and loss function. We demonstrate the effectiveness of the novel methodology by solving various open-loop optimal control problems governed by analytical, one-dimensional, and two-dimensional partial differential equations (PDEs).
CPC

Deep operator networks for Bayesian parameter estimation in PDEs

Amogh Raj, Sakol Bun, Keerthana Srinivasa, and 2 more authors

Computer Physics Communications, 2025

Abs DOI Video

We present a novel framework combining Deep Operator Networks (DeepONets) with Physics-Informed Neural Networks (PINNs) to solve partial differential equations (PDEs) while estimating their unknown parameters. By integrating data-driven learning with physical constraints, our method achieves robust and accurate solutions across diverse scenarios. Bayesian training is implemented through variational inference, allowing for comprehensive uncertainty quantification for both data and model uncertainties. This ensures reliable prediction and parameter estimates even in noisy conditions or when some of the physical equations governing the problem are missing. The framework demonstrates its efficacy in solving forward and inverse problems, including the 1D unsteady heat equation, 2D reaction-diffusion equations, 3D eigenvalue problem, and various regression tasks with sparse, noisy observations. This approach provides a computationally efficient and generalizable method for addressing uncertainty quantification in PDE surrogate modeling.

2024

JMLMC

Improving ADAM through an Implicit-Explicit (IMEX) Time-Stepping Approach

Abhinab Bhattacharjee, Andrey A Popov, Arash Sarshar, and 1 more author

Journal of Machine Learning for Modeling and Computing, 2024

Abs DOI HTML PDF Slides

The ADAM optimizer, often used in machine learning for neural network training, corresponds to an underlying ordinary differential equation (ODE) in the limit of very small learning rates. This work shows that the classical ADAM algorithm is a first-order implicit-explicit (IMEX) Euler discretization of the underlying ODE. Employing the time discretization point of view, we propose new extensions of the ADAM scheme obtained by using higher-order IMEX methods to solve the ODE. Based on this approach, we derive a new optimization algorithm for neural network training that performs better than classical ADAM on several regression and classification problems.

2022

SISC

A Fast Time-Stepping Strategy for Dynamical Systems Equipped with a Surrogate Model

Arash Sarshar, Steven Roberts, and Adrian Sandu

SIAM Journal on Scientific Computing, 2022

Abs DOI arXiv HTML

This work proposes a new accelerated time-stepping strategy that combines information from both full complex models and inexpensive surrogate models. The approach is based on the multirate infinitesimal general-structure additive Runge-Kutta (MRI-GARK) framework.
arXiv

A Meta-learning Formulation of the Autoencoder Problem for Non-linear Dimensionality Reduction

Arash Sarshar and Adrian Sandu

arXiv preprint, 2022

Abs arXiv PDF

We show that the canonical formulation of autoencoders suffers from several deficiencies that can hinder their performance. Using a meta-learning approach, we reformulate the autoencoder problem as a bi-level optimization procedure that explicitly solves the dimensionality reduction task.
JCP

A unified formulation of splitting-based implicit time integration schemes

Adrian Sandu, Steven Roberts, and Arash Sarshar

Journal of Computational Physics, 2022

Abs DOI arXiv HTML

This work proposes a unified formulation of splitting time integration schemes in the framework of general-structure additive Runge-Kutta (GARK) methods.

2021

JCAM

Alternating directions implicit integration in a general linear method framework

Arash Sarshar, Steven Roberts, and Adrian Sandu

Journal of Computational and Applied Mathematics, 2021

Abs DOI HTML PDF

Alternating Directions Implicit (ADI) integration is an operator splitting approach to solve parabolic and elliptic partial differential equations in multiple dimensions based on solving sequentially a set of related one-dimensional equations. Classical ADI methods have order at most two, due to the splitting errors. This work proposes a new ADI approach based on the partitioned General Linear Methods framework, allowing construction of high order ADI methods.
Preprint

Linearly-Implicit General Linear Methods

Arash Sarshar and Adrian Sandu

Virginia Tech Technical Report, 2021

Abs PDF

Linearly implicit Runge-Kutta methods provide a fitting balance between implicit treatment of stiff systems and computational costs. We extend the class of linearly implicit Runge-Kutta methods to include multi-stage and multi-step methods. We discuss the order condition to achieve high stage order and overall accuracy while admitting arbitrary Jacobians.
CAMC

Parallel implicit-explicit general linear methods

Arash Sarshar, Steven Roberts, and Adrian Sandu

Communications on Applied Mathematics and Computation, 2021

Abs DOI arXiv HTML

This work proposes a unified formulation of splitting time integration schemes in the framework of general-structure additive Runge-Kutta (GARK) methods. We develop implicit-implicit (IMIM) GARK schemes and show that classical splitting methods belong to the IMIM GARK family.
JCAM

Analytical Jacobian-vector products for the matrix-free time integration of partial differential equations

Arash Sarshar, Steven Roberts, and Adrian Sandu

Journal of Computational and Applied Mathematics, 2021

Abs DOI HTML PDF

Traditional time discretization methods use a single timestep for the entire system of interest and can perform poorly when the dynamics of the system exhibits a wide range of time scales. This work extends the MRI-GARK framework by introducing coupled implicit methods to solve stiff multiscale systems.
JSC

Implicit multirate GARK methods

Adrian Sandu, Michael Günther, Steven Roberts, and 1 more author

Journal of Scientific Computing, 2021

Abs DOI arXiv HTML

This work considers multirate generalized-structure additively partitioned Runge-Kutta (MrGARK) methods for solving stiff systems of ordinary differential equations (ODEs) with multiple time scales.

2020

SISC

Coupled multirate infinitesimal GARK schemes for stiff systems with multiple time scales

Adrian Sandu, Michael Günther, Steven Roberts, and 1 more author

SIAM Journal on Scientific Computing, 2020

Abs DOI arXiv HTML

This work extends the MRI-GARK framework by introducing coupled implicit methods to solve stiff multiscale systems. The coupled approach has the potential to considerably improve the overall stability of the scheme.

2019

SISC

Design of High-Order Decoupled Multirate GARK Schemes

Arash Sarshar, Steven Roberts, and Adrian Sandu

SIAM Journal on Scientific Computing, 2019

Abs DOI arXiv HTML Video Slides

Multirate time integration methods apply different step sizes to resolve different components of the system based on the local activity levels. This local selection of step sizes allows increased computational efficiency while achieving the desired solution accuracy. This work focuses on the design of practical high-order multirate methods using the theoretical framework of generalized additive Runge-Kutta (MrGARK) methods.

2017

CF

A numerical investigation of matrix-free implicit time-stepping methods for large CFD simulations

Paul Tranquilli, Arash Sarshar, and Adrian Sandu

Computers & Fluids, 2017

Abs DOI arXiv HTML

This paper is concerned with development and testing of advanced time-stepping methods for large unsteady CFD problems. We compare explicit methods with matrix-free implementations of implicit, linearly-implicit, as well as Rosenbrock-Krylov methods.

2013

EuCAP

Design of a stacked stub-loaded patch element for X-band reflectarray antenna with true time delay

Arash Sarshar, A Khodabandeh, and N Komjani

In 2013 European Conference on Antennas and Propagation (EuCAP), 2013

Abs HTML

A multilayer reflectarray element consisting of a stacked stub-loaded patch is proposed for X-band linear polarization reflectarray antenna.