menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

ML News

source image

Arxiv

6d

read

222

img
dot

Image Credit: Arxiv

Planning and Learning in Risk-Aware Restless Multi-Arm Bandit Problem

  • In this work, a generalized restless multi-arm bandit problem with risk-awareness is addressed.
  • Indexability conditions for risk-aware objective are established and a solution based on Whittle index is provided.
  • A Thompson sampling approach is proposed for the learning problem with unknown transition probabilities, achieving bounded regret.
  • Numerical experiments illustrate the efficacy of the method in reducing risk exposure in various applications.

Read Full Article

like

13 Likes

source image

Arxiv

6d

read

172

img
dot

Image Credit: Arxiv

K-Means Clustering With Incomplete Data with the Use of Mahalanobis Distances

  • Effectively applying the K-means algorithm to clustering tasks with incomplete features remains an important research area.
  • Recent work has shown that unifying K-means clustering and imputation into one single objective function yields superior results.
  • In this work, a unified K-means algorithm that incorporates Mahalanobis distances is proposed.
  • Extensive experiments demonstrate that the proposed algorithm consistently outperforms existing approaches in handling incomplete data.

Read Full Article

like

10 Likes

source image

Arxiv

6d

read

308

img
dot

Image Credit: Arxiv

MrSteve: Instruction-Following Agents in Minecraft with What-Where-When Memory

  • Significant advances have been made in developing general-purpose embodied AI in environments like Minecraft through the adoption of LLM-augmented hierarchical approaches.
  • Low-level controllers frequently become performance bottlenecks due to repeated failures, which is primarily caused by the absence of an episodic memory system.
  • MrSteve (Memory Recall Steve) is a novel low-level controller equipped with Place Event Memory (PEM) that captures what, where, and when information from episodes, addressing the limitations of previous models.
  • The proposed Exploration Strategy and Memory-Augmented Task Solving Framework improves task-solving and exploration efficiency compared to existing methods.

Read Full Article

like

18 Likes

source image

Arxiv

6d

read

292

img
dot

Image Credit: Arxiv

Embedding Byzantine Fault Tolerance into Federated Learning via Consistency Scoring

  • Federated learning (FL) enables training a shared model without transmitting private data to a central server.
  • FL is vulnerable to Byzantine attacks from compromised edge devices, degrading model performance.
  • The proposed plugin integrates into existing FL techniques to achieve Byzantine resilience.
  • By generating virtual data samples and evaluating model consistency scores, compromised devices can be filtered out, maintaining the benefits of FL.

Read Full Article

like

17 Likes

source image

Arxiv

6d

read

45

img
dot

Image Credit: Arxiv

Space to Policy: Scalable Brick Kiln Detection and Automatic Compliance Monitoring with Geospatial Data

  • Air pollution kills 7 million people annually.
  • The brick kiln sector contributes to economic development but also accounts for 8-14% of air pollution in India.
  • Researchers have developed a scalable machine-learning pipeline to detect and classify brick kilns using satellite imagery.
  • Automated compliance analysis based on government policies has been performed, and the study emphasizes the need for inclusive policies balancing environmental sustainability and the livelihoods of workers.

Read Full Article

like

2 Likes

source image

Arxiv

6d

read

74

img
dot

Image Credit: Arxiv

Achieving Group Fairness through Independence in Predictive Process Monitoring

  • Predictive process monitoring focuses on forecasting future states of ongoing process executions.
  • This work addresses group fairness in predictive process monitoring by investigating independence and ensuring predictions are unaffected by sensitive group membership.
  • The study explores independence through metrics such as demographic parity and threshold-independent distribution-based alternatives.
  • The effectiveness of fairness metrics and composite loss functions is validated through a controlled experimental setup.

Read Full Article

like

4 Likes

source image

Arxiv

6d

read

12

img
dot

Image Credit: Arxiv

Efficient Fine-Tuning of Single-Cell Foundation Models Enables Zero-Shot Molecular Perturbation Prediction

  • Efficient Fine-Tuning of Single-Cell Foundation Models Enables Zero-Shot Molecular Perturbation Prediction
  • Researchers present a study on predicting transcriptional responses to novel drugs for biomedical research and drug discovery.
  • They leverage single-cell foundation models pre-trained on a wide range of single cells, allowing molecular conditioning while preserving biological representation.
  • The approach enables efficient fine-tuning and zero-shot generalization to unseen cell lines, demonstrating state-of-the-art results in model performance.

Read Full Article

like

Like

source image

Arxiv

6d

read

20

img
dot

Image Credit: Arxiv

Tokenphormer: Structure-aware Multi-token Graph Transformer for Node Classification

  • Graph Neural Networks (GNNs) are widely used in graph data mining tasks.
  • Traditional GNNs face limitations in the receptive field during message passing processes.
  • A new model called Tokenphormer utilizes fine-grained token-based representation learning to capture local and structural information.
  • Tokenphormer achieves state-of-the-art performance on node classification tasks.

Read Full Article

like

1 Like

source image

Arxiv

6d

read

185

img
dot

Image Credit: Arxiv

Real-Time Decision-Making for Digital Twin in Additive Manufacturing with Model Predictive Control using Time-Series Deep Neural Networks

  • This paper presents a Model Predictive Control (MPC) framework using a deep neural network for real-time decision-making in additive manufacturing.
  • The framework, named Time-Series Dense Encoder (TiDE), can predict future states within the prediction horizon in one shot, accelerating the MPC process.
  • Using Directed Energy Deposition (DED) additive manufacturing as a case study, the MPC achieves precise temperature tracking and melt pool depth control.
  • Compared to a PID controller, the MPC provides smoother laser power profiles with competitive or better melt pool temperature control performance.

Read Full Article

like

11 Likes

source image

Arxiv

6d

read

78

img
dot

Image Credit: Arxiv

Logarithmic Regret for Nonlinear Control

  • Researchers have addressed the problem of learning to control an unknown nonlinear dynamical system through sequential interactions.
  • They focus on achieving fast sequential learning in continuous control problems where the system dynamics depend smoothly on unknown parameters.
  • The study demonstrates that fast sequential learning is attainable if the optimal control policy is persistently exciting.
  • Additionally, they derive a regret bound which grows with the square root of the number of interactions for non-persistently exciting optimal policies.

Read Full Article

like

4 Likes

source image

Arxiv

6d

read

24

img
dot

Image Credit: Arxiv

No Equations Needed: Learning System Dynamics Without Relying on Closed-Form ODEs

  • Data-driven modeling of dynamical systems is a crucial area of machine learning.
  • A new approach called direct semantic modeling is proposed, which predicts the behavior of a dynamical system directly from data.
  • This approach bypasses the need for complex post-hoc analysis and enhances the transparency and flexibility of the resulting models.
  • The direct semantic modeling approach simplifies the modeling pipeline and allows for intuitive inductive biases and direct editing of the model's behavior.

Read Full Article

like

1 Like

source image

Arxiv

6d

read

242

img
dot

Image Credit: Arxiv

Wasserstein Gradient Flows for Moreau Envelopes of f-Divergences in Reproducing Kernel Hilbert Spaces

  • Researchers propose a regularization approach for f-divergences using Moreau envelopes in reproducing kernel Hilbert spaces.
  • The regularization is achieved by employing a squared maximum mean discrepancy (MMD) associated with a characteristic kernel K.
  • The authors analyze the gradients of MMD-regularized f-divergences and study Wasserstein gradient flows based on their findings.
  • Proof-of-concept numerical examples are provided for flows starting from empirical measures.

Read Full Article

like

14 Likes

source image

Arxiv

6d

read

152

img
dot

Image Credit: Arxiv

Optimal Rates and Saturation for Noiseless Kernel Ridge Regression

  • Kernel ridge regression (KRR) is a fundamental method for learning functions from finite samples.
  • A comprehensive study of KRR in the noiseless regime reveals optimal convergence rates determined by eigenvalue decay and target function's smoothness.
  • KRR exhibits extra-smoothness compared to typical functions in the native reproducing kernel Hilbert space (RKHS).
  • A novel error bound for noisy KRR achieves minimax optimality in both noiseless and noisy regimes.

Read Full Article

like

9 Likes

source image

Arxiv

6d

read

176

img
dot

Image Credit: Arxiv

Understanding Optimal Feature Transfer via a Fine-Grained Bias-Variance Analysis

  • Researchers have conducted a study on transfer learning to optimize downstream performance.
  • They introduced a simple linear model that utilizes a pretrained feature transform.
  • The researchers derived the exact asymptotics of the downstream risk and its fine-grained bias-variance decomposition.
  • The study revealed that the optimal featurization is naturally sparse and undergoes a phase transition from hard selection to soft selection of relevant features.

Read Full Article

like

10 Likes

source image

Arxiv

6d

read

314

img
dot

Image Credit: Arxiv

Faster Linear Systems and Matrix Norm Approximation via Multi-level Sketched Preconditioning

  • Researchers have developed a new class of preconditioned iterative methods for solving linear systems, resulting in faster runtimes for various linear algebraic problems.
  • The methods involve constructing a low-rank Nyström approximation to the matrix using sparse random matrix sketching.
  • The approximation is then used to create a preconditioner, which is inverted quickly using additional levels of random sketching and preconditioning.
  • The research proves the convergence of the methods is dependent on the average condition number of the matrix, which improves as the rank of the Nyström approximation increases.

Read Full Article

like

18 Likes

For uninterrupted reading, download the app