Dynamic Programming Matrix Multiplication

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

CUDA-L2 is a system that combines large language models (LLMs) and reinforcement learning (RL) to automatically optimize Half-precision General Matrix Multiply (HGEMM) CUDA kernels. CUDA-L2 ...

C&EN

Hierarchical Reinforcement Learning with Dynamic Meta Agent for Adaptive Cut Selection in Integer Programming with Applications to Sensor Network Design

Department of Chemical Engineering, Indian Institute of Technology Delhi, Hauz Khas, New Delhi 110016, India Indian Institute of Technology Delhi-Abu Dhabi, Khalifa City B 20010, Abu Dhabi, UAE ...

IEEE

Decentralized Sparse Matrix Multiplication Under Byzantine Attacks

Abstract: Distributed computations, such as distributed matrix multiplication, can be vulnerable to significant security issues, notably Byzantine attacks. These attacks may target either worker nodes ...

GitHub

Slangpy Math library matrix multiplication incomplete

2. mul(x: slangpy.math.float2x2, y: slangpy.math.float2) -> slangpy.math.float2 3. mul(x: slangpy.math.float2, y: slangpy.math.float2x2) -> slangpy.math.float2 4. mul ...

Frontiers

Optimal portfolio selection in jump-uncertain stochastic markets via maximum principle and dynamic programming

1 Department of Statistics and Mathematics, Bindura University of Science Education, Bindura, Zimbabwe 2 Department of Mathematics, University of Botswana, Gaborone, Botswana This study develops a ...

IEEE

Robust Controllability of Boolean Control Networks via Dynamic Programming

Abstract: This article presents a novel dynamic programming approach to determine the robust controllability of Boolean control networks (BCNs) subject to stochastic disturbances. By applying ...

Frontiers

Numerical simulation of the dynamic evolution of effective stress and pore water pressure within a rock matrix

The dynamic evolution of effective stress and pore water pressure is a key scientific issue of study in the field of seepage-stress coupling in geotechnical engineering. To address the imbalance in ...

Scientific Research Publishing

Puterman, M.L. (2014) Markov Decision Processes: Discrete Stochastic Dynamic Programming. John Wiley & Sons.

ABSTRACT: Offline reinforcement learning (RL) focuses on learning policies using static datasets without further exploration. With the introduction of distributional reinforcement learning into ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results