Tim's Arxiv FrontPage Generated on 2024-05-03. This frontpage is generated by scraping new papers on Arxiv and using an embedding model to find papers matching topics I'm interested in. Currently, the false positive rate is fairly high. The repo is here. Forked and customized from this project
	Artificial General Intelligence
2024-05-02	Leveraging Procedural Generation for Learning Autonomous Peg-in-Hole Assembly in Space The ability to autonomously assemble structures is crucial for the development of future space infrastructure.However, the unpredictable conditions of space pose significant challenges for robotic systems, necessitating the development of advanced learning techniques to enable autonomous assembly.In this study, we present a novel approach for learning autonomous peg-in-hole assembly in the context of space robotics.Our focus is on enhancing the generalization and adaptability of autonomous systems through deep reinforcement learning. 0.847By integrating procedural generation and domain randomization, we train agents in a highly parallelized simulation environment across a spectrum of diverse scenarios with the aim of acquiring a robust policy.The proposed approach is evaluated using three distinct reinforcement learning algorithms to investigate the trade-offs among various paradigms.We demonstrate the adaptability of our agents to novel scenarios and assembly sequences while emphasizing the potential of leveraging advanced simulation techniques for robot learning in space.Our findings set the stage for future advancements in intelligent robotic systems capable of supporting ambitious space missions and infrastructure development beyond Earth. link
2024-05-02	Neural-Parareal: Dynamically Training Neural Operators as Coarse Solvers for Time-Parallelisation of Fusion MHD Simulations The fusion research facility ITER is currently being assembled to demonstrate that fusion can be used for industrial energy production, while several other programmes across the world are also moving forward, such as EU-DEMO, CFETR, SPARC and STEP.The high engineering complexity of a tokamak makes it an extremely challenging device to optimise, and test-based optimisation would be too slow and too costly.Instead, digital design and optimisation must be favored, which requires strongly-coupled suites of High-Performance Computing calculations.In this context, having surrogate models to provide quick estimates with uncertainty quantification is essential to explore and optimise new design options.Furthermore, these surrogates can in turn be used to accelerate simulations in the first place.This is the case of Parareal, a time-parallelisation method that can speed-up large HPC simulations, where the coarse-solver can be replaced by a surrogate.A novel framework, Neural-Parareal, is developed to integrate the training of neural operators dynamically as more data becomes available.For a given input-parameter domain, as more simulations are being run with Parareal, the large amount of data generated by the algorithm is used to train new surrogate models to be used as coarse-solvers for future Parareal simulations, leading to progressively more accurate coarse-solvers, and thus higher speed-up.It is found that such neural network surrogates can be much more effective than traditional coarse-solver in providing a speed-up with Parareal.This study is a demonstration of the convergence of HPC and AI which simply has to become common practice in the world of digital engineering design. 0.828 link
2024-05-02	GAIA: A General AI Assistant for Intelligent Accelerator Operations Large-scale machines like particle accelerators are usually run by a team of experienced operators.In case of a particle accelerator, these operators possess suitable background knowledge on both accelerator physics and the technology comprising the machine.Due to the complexity of the machine, particular subsystems of the machine are taken care of by experts, who the operators can turn to.In this work the reasoning and action (ReAct) prompting paradigm is used to couple an open-weights large language model (LLM) with a high-level machine control system framework and other tools, e.g. the electronic logbook or machine design documentation.By doing so, a multi-expert retrieval augmented generation (RAG) system is implemented, which assists operators in knowledge retrieval tasks, interacts with the machine directly if needed, or writes high level control system scripts. 0.825This consolidation of expert knowledge and machine interaction can simplify and speed up machine operation tasks for both new and experienced human operators. link
2024-05-02	Student Reflections on Self-Initiated GenAI Use in HCI Education This study explores students' self-initiated use of Generative Artificial Intelligence (GenAI) tools in an interactive systems design class. 0.872Through 12 group interviews, students revealed the dual nature of GenAI in (1) stimulating creativity and (2) speeding up design iterations, alongside concerns over its potential to cause shallow learning and reliance. 0.842GenAI's benefits were pronounced in the execution phase of design, aiding rapid prototyping and ideation, while its use in initial insight generation posed risks to depth and reflective practice.This reflection highlights the complex role of GenAI in Human-Computer Interaction education, emphasizing the need for balanced integration to leverage its advantages without compromising fundamental learning outcomes. 0.835 link
	Complex Systems
2024-05-02	Liénard Type Nonlinear Oscillators and Quantum Solvability Li\'{e}nard-type nonlinear oscillators with linear and nonlinear damping terms exhibit diverse dynamical behavior in both the classical and quantum regimes.In this paper, we consider examples of various one-dimensional Li\'{e}nard type-I and type-II oscillators.The associated Euler-Lagrange equations are divided into groups based on the characteristics of the damping and forcing terms.The Li\'{e}nard type-I oscillators often display localized solutions, isochronous and non-isochronous oscillations and are also precisely solvable in quantum mechanics in general, where the ordering parameters play an important role.These include Mathews-Lakshmanan and Higgs oscillators.However, the classical solutions of some of the nonlinear oscillators are expressed in terms of elliptic functions and have been found to be quasi-exactly solvable in the quantum region.The three-dimensional generalizations of these classical systems add more degrees of freedom, which show complex dynamics. 0.843Their quantum equivalents are also explored in this article.The isotonic generalizations of the non-isochronous nonlinear oscillators have also been solved both classically and quantum mechanically to advance the studies.The modified Emden equation categorized as Li\'{e}nard type-II exhibits isochronous oscillations at the classical level.This property makes it a valuable tool for studying the underlying nonlinear dynamics.The study on the quantum counterpart of the system provides a deeper understanding of the behavior in the quantum realm as a typical PT-symmetric system. link
2024-05-02	A Survey on Semantic Communication Networks: Architecture, Security, and Privacy Semantic communication, emerging as a breakthrough beyond the classical Shannon paradigm, aims to convey the essential meaning of source data rather than merely focusing on precise yet content-agnostic bit transmission.By interconnecting diverse intelligent agents (e.g., autonomous vehicles and VR devices) via semantic communications, the semantic communication networks (SemComNet) supports semantic-oriented transmission, efficient spectrum utilization, and flexible networking among collaborative agents.Consequently, SemComNet stands out for enabling ever-increasing intelligent applications, such as autonomous driving and Metaverse.However, being built on a variety of cutting-edge technologies including AI and knowledge graphs, SemComNet introduces diverse brand-new and unexpected threats, which pose obstacles to its widespread development.Besides, due to the intrinsic characteristics of SemComNet in terms of heterogeneous components, autonomous intelligence, and large-scale structure, a series of critical challenges emerge in securing SemComNet. 0.821In this paper, we provide a comprehensive and up-to-date survey of SemComNet from its fundamentals, security, and privacy aspects.Specifically, we first introduce a novel three-layer architecture of SemComNet for multi-agent interaction, which comprises the control layer, semantic transmission layer, and cognitive sensing layer.Then, we discuss its working modes and enabling technologies.Afterward, based on the layered architecture of SemComNet, we outline a taxonomy of security and privacy threats, while discussing state-of-the-art defense approaches.Finally, we present future research directions, clarifying the path toward building intelligent, robust, and green SemComNet.To our knowledge, this survey is the first to comprehensively cover the fundamentals of SemComNet, alongside a detailed analysis of its security and privacy issues. link
2024-05-02	GRBoondi: A code for evolving Generalized Proca theories on arbitrary backgrounds While numerical simulations offer unparalleled precision and robustness in studying complex physical systems, their execution is often hindered by complexity, costliness, and time consumption due to the intricate equations involved. 0.82This challenge is already encountered in General Relativity, where non-flat spacetimes exacerbate the computational burden.This complexity is further intensified when dealing with additional degrees of freedom. 0.82To address this challenge head-on, we introduce GRBoondi, a groundbreaking fixed-background numerical relativity code designed to provide a unified interface for numerically solving Generalized Proca theories.GRBoondi grants users the ability to make arbitrary modifications to the Proca equations of motion on any background, providing a robust and versatile tool for exploring diverse classes of Generalized Proca theories.This letter serves as part of the submission of GRBoondi to the Journal of Open Source Software.For access to the code, please visit https://github.com/ShaunFell/GRBoondi.git. link
2024-05-02	Information propagation in Gaussian processes on multilayer networks Complex systems with multiple processes evolving on different temporal scales are naturally described by multilayer networks, where each layer represents a different timescale. 0.864In this work, we show how the multilayer structure shapes the generation and propagation of information between layers.We derive a general decomposition of the multilayer probability for continuous stochastic processes described by Fokker-Planck operators.In particular, we focus on Gaussian processes, for which this solution can be obtained analytically.By explicitly computing the mutual information between the layers, we derive the fundamental principles that govern how information is propagated by the topology of the multilayer network.In particular, we unravel how edges between nodes in different layers affect their functional couplings.We find that interactions from fast to slow layers alone do not generate information, leaving the layers statistically independent even if they affect their dynamical evolution.On the other hand, interactions from slow to fast nodes lead to non-zero mutual information, which can then be propagated along specific paths of interactions between layers.We employ our results to study the interplay between information and instability, identifying the critical layers that drive information when pushed to the edge of stability.Our work generalizes previous results obtained in the context of discrete stochastic processes, allowing us to understand how the multilayer nature of complex systems affects their functional structure. 0.827 link
	Decision Making Under Uncertainty
2024-05-02	Uncertainty-aware self-training with expectation maximization basis transformation Self-training is a powerful approach to deep learning.The key process is to find a pseudo-label for modeling.However, previous self-training algorithms suffer from the over-confidence issue brought by the hard labels, even some confidence-related regularizers cannot comprehensively catch the uncertainty.Therefore, we propose a new self-training framework to combine uncertainty information of both model and dataset.Specifically, we propose to use Expectation-Maximization (EM) to smooth the labels and comprehensively estimate the uncertainty information.We further design a basis extraction network to estimate the initial basis from the dataset.The obtained basis with uncertainty can be filtered based on uncertainty information. 0.825It can then be transformed into the real hard label to iteratively update the model and basis in the retraining process.Experiments on image classification and semantic segmentation show the advantages of our methods among confidence-aware self-training algorithms with 1-3 percentage improvement on different datasets. link
2024-05-02	Uncertainty for Active Learning on Graphs Uncertainty Sampling is an Active Learning strategy that aims to improve the data efficiency of machine learning models by iteratively acquiring labels of data points with the highest uncertainty. 0.827While it has proven effective for independent data its applicability to graphs remains under-explored.We propose the first extensive study of Uncertainty Sampling for node classification: (1) We benchmark Uncertainty Sampling beyond predictive uncertainty and highlight a significant performance gap to other Active Learning strategies.(2) We develop ground-truth Bayesian uncertainty estimates in terms of the data generating process and prove their effectiveness in guiding Uncertainty Sampling toward optimal queries.We confirm our results on synthetic data and design an approximate approach that consistently outperforms other uncertainty estimators on real datasets.(3) Based on this analysis, we relate pitfalls in modeling uncertainty to existing methods.Our analysis enables and informs the development of principled uncertainty estimation on graphs. link
	Reinforcement Learning
2024-05-02	Leveraging Procedural Generation for Learning Autonomous Peg-in-Hole Assembly in Space The ability to autonomously assemble structures is crucial for the development of future space infrastructure.However, the unpredictable conditions of space pose significant challenges for robotic systems, necessitating the development of advanced learning techniques to enable autonomous assembly.In this study, we present a novel approach for learning autonomous peg-in-hole assembly in the context of space robotics.Our focus is on enhancing the generalization and adaptability of autonomous systems through deep reinforcement learning. 0.851By integrating procedural generation and domain randomization, we train agents in a highly parallelized simulation environment across a spectrum of diverse scenarios with the aim of acquiring a robust policy.The proposed approach is evaluated using three distinct reinforcement learning algorithms to investigate the trade-offs among various paradigms. 0.836We demonstrate the adaptability of our agents to novel scenarios and assembly sequences while emphasizing the potential of leveraging advanced simulation techniques for robot learning in space.Our findings set the stage for future advancements in intelligent robotic systems capable of supporting ambitious space missions and infrastructure development beyond Earth. link
2024-05-02	Towards Interpretable Reinforcement Learning with Constrained Normalizing Flow Policies Reinforcement learning policies are typically represented by black-box neural networks, which are non-interpretable and not well-suited for safety-critical domains. 0.83To address both of these issues, we propose constrained normalizing flow policies as interpretable and safe-by-construction policy models.We achieve safety for reinforcement learning problems with instantaneous safety constraints, for which we can exploit domain knowledge by analytically constructing a normalizing flow that ensures constraint satisfaction.The normalizing flow corresponds to an interpretable sequence of transformations on action samples, each ensuring alignment with respect to a particular constraint.Our experiments reveal benefits beyond interpretability in an easier learning objective and maintained constraint satisfaction throughout the entire learning process.Our approach leverages constraints over reward engineering while offering enhanced interpretability, safety, and direct means of providing domain knowledge to the agent without relying on complex reward functions. 0.828 link
2024-05-02	Reinforcement Learning for Edit-Based Non-Autoregressive Neural Machine Translation Non-autoregressive (NAR) language models are known for their low latency in neural machine translation (NMT).However, a performance gap exists between NAR and autoregressive models due to the large decoding space and difficulty in capturing dependency between target words accurately.Compounding this, preparing appropriate training data for NAR models is a non-trivial task, often exacerbating exposure bias.To address these challenges, we apply reinforcement learning (RL) to Levenshtein Transformer, a representative edit-based NAR model, demonstrating that RL with self-generated data can enhance the performance of edit-based NAR models. 0.832We explore two RL approaches: stepwise reward maximization and episodic reward maximization. 0.837We discuss the respective pros and cons of these two approaches and empirically verify them.Moreover, we experimentally investigate the impact of temperature setting on performance, confirming the importance of proper temperature setting for NAR models' training. link
2024-05-02	Behavior Imitation for Manipulator Control and Grasping with Deep Reinforcement Learning The existing Motion Imitation models typically require expert data obtained through MoCap devices, but the vast amount of training data needed is difficult to acquire, necessitating substantial investments of financial resources, manpower, and time.This project combines 3D human pose estimation with reinforcement learning, proposing a novel model that simplifies Motion Imitation into a prediction problem of joint angle values in reinforcement learning.This significantly reduces the reliance on vast amounts of training data, enabling the agent to learn an imitation policy from just a few seconds of video and exhibit strong generalization capabilities.It can quickly apply the learned policy to imitate human arm motions in unfamiliar videos.The model first extracts skeletal motions of human arms from a given video using 3D human pose estimation.These extracted arm motions are then morphologically retargeted onto a robotic manipulator.Subsequently, the retargeted motions are used to generate reference motions.Finally, these reference motions are used to formulate a reinforcement learning problem, enabling the agent to learn a policy for imitating human arm motions. 0.826This project excels at imitation tasks and demonstrates robust transferability, accurately imitating human arm motions from other unfamiliar videos.This project provides a lightweight, convenient, efficient, and accurate Motion Imitation model.While simplifying the complex process of Motion Imitation, it achieves notably outstanding performance. link
2024-05-02	Learning Force Control for Legged Manipulation Controlling contact forces during interactions is critical for locomotion and manipulation tasks.While sim-to-real reinforcement learning (RL) has succeeded in many contact-rich problems, current RL methods achieve forceful interactions implicitly without explicitly regulating forces. 0.831We propose a method for training RL policies for direct force control without requiring access to force sensing.We showcase our method on a whole-body control platform of a quadruped robot with an arm.Such force control enables us to perform gravity compensation and impedance control, unlocking compliant whole-body manipulation.The learned whole-body controller with variable compliance makes it intuitive for humans to teleoperate the robot by only commanding the manipulator, and the robot's body adjusts automatically to achieve the desired position and force.Consequently, a human teleoperator can easily demonstrate a wide variety of loco-manipulation tasks.To the best of our knowledge, we provide the first deployment of learned whole-body force control in legged manipulators, paving the way for more versatile and adaptable legged robots. link
	Trajectory Optimization
2024-05-02	MFTraj: Map-Free, Behavior-Driven Trajectory Prediction for Autonomous Driving This paper introduces a trajectory prediction model tailored for autonomous driving, focusing on capturing complex interactions in dynamic traffic scenarios without reliance on high-definition maps. 0.825The model, termed MFTraj, harnesses historical trajectory data combined with a novel dynamic geometric graph-based behavior-aware module.At its core, an adaptive structure-aware interactive graph convolutional network captures both positional and behavioral features of road users, preserving spatial-temporal intricacies.Enhanced by a linear attention mechanism, the model achieves computational efficiency and reduced parameter overhead.Evaluations on the Argoverse, NGSIM, HighD, and MoCAD datasets underscore MFTraj's robustness and adaptability, outperforming numerous benchmarks even in data-challenged scenarios without the need for additional information such as HD maps or vectorized maps.Importantly, it maintains competitive performance even in scenarios with substantial missing data, on par with most existing state-of-the-art models.The results and methodology suggest a significant advancement in autonomous driving trajectory prediction, paving the way for safer and more efficient autonomous systems. link
2024-05-02	Non-iterative Optimization of Trajectory and Radio Resource for Aerial Network We address a joint trajectory planning, user association, resource allocation, and power control problem to maximize proportional fairness in the aerial IoT network, considering practical end-to-end quality-of-service (QoS) and communication schedules.Though the problem is rather ancient, apart from the fact that the previous approaches have never considered user- and time-specific QoS, we point out a prevalent mistake in coordinate optimization approaches adopted by the majority of the literature.Coordinate optimization approaches, which repetitively optimize radio resources for a fixed trajectory and vice versa, generally converge to local optima when all variables are differentiable. 0.858However, these methods often stagnate at a non-stationary point, significantly degrading the network utility in mixed-integer problems such as joint trajectory and radio resource optimization.We detour this problem by converting the formulated problem into the Markov decision process (MDP).Exploiting the beneficial characteristics of the MDP, we design a non-iterative framework that cooperatively optimizes trajectory and radio resources without initial trajectory choice.The proposed framework can incorporate various trajectory planning algorithms such as the genetic algorithm, tree search, and reinforcement learning. 0.839Extensive comparisons with diverse baselines verify that the proposed framework significantly outperforms the state-of-the-art method, nearly achieving the global optimum.Our implementation code is available at https://github.com/hslyu/dbspf. link
	Active Inference
2024-05-02	Customizing Text-to-Image Models with a Single Image Pair Art reinterpretation is the practice of creating a variation of a reference work, making a paired artwork that exhibits a distinct artistic style.We ask if such an image pair can be used to customize a generative model to capture the demonstrated stylistic difference.We propose Pair Customization, a new customization method that learns stylistic difference from a single image pair and then applies the acquired style to the generation process.Unlike existing methods that learn to mimic a single concept from a collection of images, our method captures the stylistic difference between paired images.This allows us to apply a stylistic change without overfitting to the specific image content in the examples.To address this new task, we employ a joint optimization method that explicitly separates the style and content into distinct LoRA weight spaces.We optimize these style and content weights to reproduce the style and content images while encouraging their orthogonality.During inference, we modify the diffusion process via a new style guidance based on our learned weights. 0.821Both qualitative and quantitative experiments show that our method can effectively learn style while avoiding overfitting to image content, highlighting the potential of modeling such stylistic differences from a single image pair. link

Tim's Arxiv FrontPage

Artificial General Intelligence

Complex Systems

Decision Making Under Uncertainty

Reinforcement Learning

Trajectory Optimization

Active Inference