Tom Bertalan

Princeton University, Chemical and Biological Engineering, Graduate Student

Followers

Following

Public Views

Address: Princeton, New Jersey, United States

less

InterestsView All (6)

Uploads

Papers by Tom Bertalan

Data-driven and Physics Informed Modelling of Chinese Hamster Ovary Cell Bioreactors

arXiv (Cornell University), May 4, 2023

Download

Some of the variables, some of the parameters, some of the times, with some physics known: Identification with partial information

Computers & Chemical Engineering

Implementation and (Inverse Modified) Error Analysis for implicitly-templated ODE-nets

arXiv (Cornell University), Mar 31, 2023

Neural Network approach to reduced order modeling of multiphase flows

APS Division of Fluid Dynamics Meeting Abstracts, 2020

An Equal Space for Complex Data with Unknown Internal Order: Observability, Gauge Invariance and Manifold Learning

Download

Dimension Reduction for Heterogeneous Populations of Oscillators

This dissertation discusses coarse-graining methods and applications for simulations of large het... more This dissertation discusses coarse-graining methods and applications for simulations of large heterogeneous populations of neurons. These simulations are structured as large coupled sets of ordinary differential equations describing the state evolution for many qualitatively similar, but quantitatively distinct individual units. In full generality, the direct coupling between these units is not all-to-all, but is mediated through a directed network. With sufficiently strong coupling and weak heterogeneity across the population, a common outcome for such simulations is synchronization. Here, the states of individual units, while not identical, can be neatly approximated by a smooth function of some latent independent variable. It is this smooth structure that we seek to exploit in this dissertation for both didactic and computational purposes. Briefly put, the polynomial chaos expansion (PCE) methods used in this dissertation are reminiscent of Fourier expansions, recast in a setting...

Download

OpenMG: A new multigrid implementation in Python

Numerical Linear Algebra with Applications, 2014

Download

Learning emergent partial differential equations in a learned emergent space

Nature Communications

We propose an approach to learn effective evolution equations for large systems of interacting ag... more We propose an approach to learn effective evolution equations for large systems of interacting agents. This is demonstrated on two examples, a well-studied system of coupled normal form oscillators and a biologically motivated example of coupled Hodgkin-Huxley-like neurons. For such types of systems there is no obvious space coordinate in which to learn effective evolution laws in the form of partial differential equations. In our approach, we accomplish this by learning embedding coordinates from the time series data of the system using manifold learning as a first step. In these emergent coordinates, we then show how one can learn effective partial differential equations, using neural networks, that do not only reproduce the dynamics of the oscillator ensemble, but also capture the collective bifurcations when system parameters vary. The proposed approach thus integrates the automatic, data-driven extraction of emergent space coordinates parametrizing the agent dynamics, with mach...

Download

Learning effective stochastic differential equations from microscopic simulations: combining stochastic numerics and deep learning

ArXiv, 2021

We identify effective stochastic differential equations (SDE) for coarse observables of fine-grai... more We identify effective stochastic differential equations (SDE) for coarse observables of fine-grained particleor agent-based simulations; these SDE then provide coarse surrogate models of the fine scale dynamics. We approximate the drift and diffusivity functions in these effective SDE through neural networks, which can be thought of as effective stochastic ResNets. The loss function is inspired by, and embodies, the structure of established stochastic numerical integrators (here, Euler-Maruyama and Milstein); our approximations can thus benefit from error analysis of these underlying numerical schemes. They also lend themselves naturally to “physics-informed” gray-box identification when approximate coarse models, such as mean field equations, are available. Our approach does not require long trajectories, works on scattered snapshot data, and is designed to naturally handle different time steps per snapshot. We consider both the case where the coarse collective observables are know...

Download

Alpha-particle radiotherapy: For large solid tumors diffusion trumps targeting

Biomaterials, 2017

Download

Coarse-Grained Descriptions of Dynamics for Networks with both Intrinsic and Structural Heterogeneities

Finding accurate reduced descriptions for large, complex, dynamically evolving networks is a cruc... more Finding accurate reduced descriptions for large, complex, dynamically evolving networks is a crucial enabler to their simulation, analysis, and, ultimately, design. Here we propose and illustrate a systematic and powerful approach to obtaining good collective coarse-grained observables-- variables successfully summarizing the detailed state of such networks. Finding such variables can naturally lead to successful reduced dynamic models for the networks. The main premise enabling our approach is the assumption that the behavior of a node in the network depends (after a short initial transient) on the node identity: a set of descriptors that quantify the node properties, whether intrinsic (e.g. parameters in the node evolution equations) or structural (imparted to the node by its connectivity in the particular network structure). The approach creates a natural link with modeling and "computational enabling technology" developed in the context of Uncertainty Quantification. I...

Download

Dimension reduction in heterogeneous neural networks: generalized Polynomial Chaos (gPC) and ANalysis-Of-VAriance (ANOVA)

We propose, and illustrate via a neural network example, two different approaches to coarse-grain... more We propose, and illustrate via a neural network example, two different approaches to coarse-graining large heterogeneous networks. Both approaches are inspired from, and use tools developed in, methods for uncertainty quantification in systems with multiple uncertain parameters - in our case, the parameters are heterogeneously distributed on the network nodes. The approach shows promise in accelerating large scale network simulations as well as coarse-grained fixed point, periodic solution and stability analysis. We also demonstrate that the approach can successfully deal with structural as well as intrinsic heterogeneities.

Download

Initializing LSTM internal states via manifold learning

We present an approach, based on learning an intrinsic data manifold, for the initialization of t... more We present an approach, based on learning an intrinsic data manifold, for the initialization of the internal state values of LSTM recurrent neural networks, ensuring consistency with the initial observed input data. Exploiting the generalized synchronization concept, we argue that the converged, "mature" internal states constitute a function on this learned manifold. The dimension of this manifold then dictates the length of observed input time series data required for consistent initialization. We illustrate our approach through a partially observed chemical model system, where initializing the internal LSTM states in this fashion yields visibly improved performance. Finally, we show that learning this data manifold enables the transformation of partially observed dynamics into fully observed ones, facilitating alternative identification paths for nonlinear dynamical systems.

Download

LOCA: LOcal Conformal Autoencoder for standardized data coordinates

We propose a deep-learning based method for obtaining standardized data coordinates from scientif... more We propose a deep-learning based method for obtaining standardized data coordinates from scientific measurements.Data observations are modeled as samples from an unknown, non-linear deformation of an underlying Riemannian manifold, which is parametrized by a few normalized latent variables. By leveraging a repeated measurement sampling strategy, we present a method for learning an embedding in R^d that is isometric to the latent variables of the manifold. These data coordinates, being invariant under smooth changes of variables, enable matching between different instrumental observations of the same phenomenon. Our embedding is obtained using a LOcal Conformal Autoencoder (LOCA), an algorithm that constructs an embedding to rectify deformations by using a local z-scoring procedure while preserving relevant geometric information. We demonstrate the isometric embedding properties of LOCA on various model settings and observe that it exhibits promising interpolation and extrapolation c...

Download

Global and local reduced models for interacting, heterogeneous agents

Chaos, 2021

Large collections of coupled, heterogeneous agents can manifest complex dynamical behavior presen... more Large collections of coupled, heterogeneous agents can manifest complex dynamical behavior presenting difficulties for simulation and analysis. However, if the collective dynamics lie on a low-dimensional manifold, then the original agent-based model may be approximated with a simplified surrogate model on and near the low-dimensional space where the dynamics live. Analytically identifying such simplified models can be challenging or impossible, but here we present a data-driven coarse-graining methodology for discovering such reduced models. We consider two types of reduced models: globally based models that use global information and predict dynamics using information from the whole ensemble and locally based models that use local information, that is, information from just a subset of agents close (close in heterogeneity space, not physical space) to an agent, to predict the dynamics of an agent. For both approaches, we are able to learn laws governing the behavior of the reduced...

Download

Personalized Algorithm Generation: A Case Study in Meta-Learning ODE Integrators

We study the meta-learning of numerical algorithms for scientific computing, which combines the m... more We study the meta-learning of numerical algorithms for scientific computing, which combines the mathematically driven, handcrafted design of general algorithm structure with a data-driven adaptation to specific classes of tasks. This represents a departure from the classical approaches in numerical analysis, which typically do not feature such learning-based adaptations. As a case study, we develop a machine learning approach that automatically learns effective solvers for initial value problems in the form of ordinary differential equations (ODEs), based on the Runge-Kutta (RK) integrator architecture. By combining neural network approximations and meta-learning, we show that we can obtain highorder integrators for targeted families of differential equations without the need for computing integrator coefficients by hand. Moreover, we demonstrate that in certain cases we can obtain superior performance to classical RK methods. This can be attributed to certain properties of the ODE ...

Download

LOCA: LOcal Conformal Autoencoder for standardized data coordinates

We propose a deep-learning based method for obtaining standardized data coordinates from scientif... more We propose a deep-learning based method for obtaining standardized data coordinates from scientific measurements.Data observations are modeled as samples from an unknown, non-linear deformation of an underlying Riemannian manifold, which is parametrized by a few normalized latent variables. By leveraging a repeated measurement sampling strategy, we present a method for learning an embedding in $\mathbb{R}^d$ that is isometric to the latent variables of the manifold. These data coordinates, being invariant under smooth changes of variables, enable matching between different instrumental observations of the same phenomenon. Our embedding is obtained using a LOcal Conformal Autoencoder (LOCA), an algorithm that constructs an embedding to rectify deformations by using a local z-scoring procedure while preserving relevant geometric information. We demonstrate the isometric embedding properties of LOCA on various model settings and observe that it exhibits promising interpolation and extr...

Download

Transformations between deep neural networks

We propose to test, and when possible establish, an equivalence between two different artificial ... more We propose to test, and when possible establish, an equivalence between two different artificial neural networks by attempting to construct a data-driven transformation between them, using manifold-learning techniques. In particular, we employ diffusion maps with a Mahalanobis-like metric. If the construction succeeds, the two networks can be thought of as belonging to the same equivalence class. We first discuss transformation functions between only the outputs of the two networks; we then also consider transformations that take into account outputs (activations) of a number of internal neurons from each network. In general, Whitney's theorem dictates the number of measurements from one of the networks required to reconstruct each and every feature of the second network. The construction of the transformation function relies on a consistent, intrinsic representation of the network input space. We illustrate our algorithm by matching neural network pairs trained to learn (a) obs...

Download

Learning emergent PDEs in a learned emergent space

We extract data-driven, intrinsic spatial coordinates from observations of the dynamics of large ... more We extract data-driven, intrinsic spatial coordinates from observations of the dynamics of large systems of coupled heterogeneous agents. These coordinates then serve as an emergent space in which to learn predictive models in the form of partial differential equations (PDEs) for the collective description of the coupled-agent system. They play the role of the independent spatial variables in this PDE (as opposed to the dependent, possibly also data-driven, state variables). This leads to an alternative description of the dynamics, local in these emergent coordinates, thus facilitating an alternative modeling path for complex coupled-agent systems. We illustrate this approach on a system where each agent is a limit cycle oscillator (a so-called Stuart-Landau oscillator); the agents are heterogeneous (they each have a different intrinsic frequency ω) and are coupled through the ensemble average of their respective variables. After fast initial transients, we show that the collective ...

Download

Dimension reduction in heterogeneous neural networks: Generalized Polynomial Chaos (gPC) and ANalysis-Of-VAriance (ANOVA)

The European Physical Journal Special Topics

Download

Data-driven and Physics Informed Modelling of Chinese Hamster Ovary Cell Bioreactors

arXiv (Cornell University), May 4, 2023

Download

Some of the variables, some of the parameters, some of the times, with some physics known: Identification with partial information

Computers & Chemical Engineering

Implementation and (Inverse Modified) Error Analysis for implicitly-templated ODE-nets

arXiv (Cornell University), Mar 31, 2023

Neural Network approach to reduced order modeling of multiphase flows

APS Division of Fluid Dynamics Meeting Abstracts, 2020

An Equal Space for Complex Data with Unknown Internal Order: Observability, Gauge Invariance and Manifold Learning

Download

Dimension Reduction for Heterogeneous Populations of Oscillators

Download

OpenMG: A new multigrid implementation in Python

Numerical Linear Algebra with Applications, 2014

Download

Learning emergent partial differential equations in a learned emergent space

Nature Communications

Download

Learning effective stochastic differential equations from microscopic simulations: combining stochastic numerics and deep learning

ArXiv, 2021

Download

Alpha-particle radiotherapy: For large solid tumors diffusion trumps targeting

Biomaterials, 2017

Download

Coarse-Grained Descriptions of Dynamics for Networks with both Intrinsic and Structural Heterogeneities

Download

Dimension reduction in heterogeneous neural networks: generalized Polynomial Chaos (gPC) and ANalysis-Of-VAriance (ANOVA)

Download

Initializing LSTM internal states via manifold learning

Download

LOCA: LOcal Conformal Autoencoder for standardized data coordinates

Download

Global and local reduced models for interacting, heterogeneous agents

Chaos, 2021

Download

Personalized Algorithm Generation: A Case Study in Meta-Learning ODE Integrators

Download

LOCA: LOcal Conformal Autoencoder for standardized data coordinates

We propose a deep-learning based method for obtaining standardized data coordinates from scientif... more We propose a deep-learning based method for obtaining standardized data coordinates from scientific measurements.Data observations are modeled as samples from an unknown, non-linear deformation of an underlying Riemannian manifold, which is parametrized by a few normalized latent variables. By leveraging a repeated measurement sampling strategy, we present a method for learning an embedding in $\mathbb{R}^d$ that is isometric to the latent variables of the manifold. These data coordinates, being invariant under smooth changes of variables, enable matching between different instrumental observations of the same phenomenon. Our embedding is obtained using a LOcal Conformal Autoencoder (LOCA), an algorithm that constructs an embedding to rectify deformations by using a local z-scoring procedure while preserving relevant geometric information. We demonstrate the isometric embedding properties of LOCA on various model settings and observe that it exhibits promising interpolation and extr...

Download

Transformations between deep neural networks

Download

Learning emergent PDEs in a learned emergent space

Download

Dimension reduction in heterogeneous neural networks: Generalized Polynomial Chaos (gPC) and ANalysis-Of-VAriance (ANOVA)

The European Physical Journal Special Topics

Download

Tom Bertalan

Uploads

Papers by Tom Bertalan

Log In