CUDA-L2 is a system that combines large language models (LLMs) and reinforcement learning (RL) to automatically optimize Half-precision General Matrix Multiply (HGEMM) CUDA kernels. CUDA-L2 ...
Department of Chemical Engineering, Indian Institute of Technology Delhi, Hauz Khas, New Delhi 110016, India Indian Institute of Technology Delhi-Abu Dhabi, Khalifa City B 20010, Abu Dhabi, UAE ...
Abstract: Distributed computations, such as distributed matrix multiplication, can be vulnerable to significant security issues, notably Byzantine attacks. These attacks may target either worker nodes ...
2. mul(x: slangpy.math.float2x2, y: slangpy.math.float2) -> slangpy.math.float2 3. mul(x: slangpy.math.float2, y: slangpy.math.float2x2) -> slangpy.math.float2 4. mul ...
1 Department of Statistics and Mathematics, Bindura University of Science Education, Bindura, Zimbabwe 2 Department of Mathematics, University of Botswana, Gaborone, Botswana This study develops a ...
Abstract: This article presents a novel dynamic programming approach to determine the robust controllability of Boolean control networks (BCNs) subject to stochastic disturbances. By applying ...
The dynamic evolution of effective stress and pore water pressure is a key scientific issue of study in the field of seepage-stress coupling in geotechnical engineering. To address the imbalance in ...
ABSTRACT: Offline reinforcement learning (RL) focuses on learning policies using static datasets without further exploration. With the introduction of distributional reinforcement learning into ...