这是indexloc提供的服务,不要输入任何密码
Skip to main content

Showing 1–50 of 149,322 results for author: D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.17730  [pdf, ps, other

    cs.AI

    Online Submission and Evaluation System Design for Competition Operations

    Authors: Zhe Chen, Daniel Harabor, Ryan Hechnenberger, Nathan R. Sturtevant

    Abstract: Research communities have developed benchmark datasets across domains to compare the performance of algorithms and techniques However, tracking the progress in these research areas is not easy, as publications appear in different venues at the same time, and many of them claim to represent the state-of-the-art. To address this, research communities often organise periodic competitions to evaluate… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    Comments: This work was presented at the Workshop on the International Planning Competition (WIPC 2024)

  2. arXiv:2507.17728  [pdf, ps, other

    cs.CL

    Megrez2 Technical Report

    Authors: Boxun Li, Yadong Li, Zhiyuan Li, Congyi Liu, Weilin Liu, Guowei Niu, Zheyue Tan, Haiyang Xu, Zhuyu Yao, Tao Yuan, Dong Zhou, Yueqing Zhuang, Bo Zhao, Guohao Dai, Yu Wang

    Abstract: We present Megrez2, a novel lightweight and high-performance language model architecture optimized for device native deployment. Megrez2 introduces a novel cross-layer expert sharing mechanism, which significantly reduces total parameter count by reusing expert modules across adjacent transformer layers while maintaining most of the model's capacity. It also incorporates pre-gated routing, enablin… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  3. Application of new conformal cooling layouts to the green injection molding of complex slender polymeric parts with high dimensional specifications

    Authors: Abelardo Torres Alba, Jorge Manuel Mercado Colmenero, Juan de Dios Caballero Garcia, Cristina Martin Donate

    Abstract: Eliminating warpage in injection molded polymeric parts is one of the most important problems in the injection molding industry today. This situation is critical in geometries that are particularly susceptible to warping due to their geometric features, and this occurs with topologies of great length and slenderness with high changes in thickness. These features are, in these special geometries, i… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    Journal ref: (2023). Application of new conformal cooling layouts to the green injection molding of complex slender polymeric parts with high dimensional specifications. Polymers, 15(3), 558

  4. arXiv:2507.17718  [pdf, ps, other

    cs.CL cs.AI cs.HC

    AI Telephone Surveying: Automating Quantitative Data Collection with an AI Interviewer

    Authors: Danny D. Leybzon, Shreyas Tirumala, Nishant Jain, Summer Gillen, Michael Jackson, Cameron McPhee, Jennifer Schmidt

    Abstract: With the rise of voice-enabled artificial intelligence (AI) systems, quantitative survey researchers have access to a new data-collection mode: AI telephone surveying. By using AI to conduct phone interviews, researchers can scale quantitative studies while balancing the dual goals of human-like interactivity and methodological rigor. Unlike earlier efforts that used interactive voice response (IV… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  5. arXiv:2507.17717  [pdf, ps, other

    cs.CL cs.AI

    From Feedback to Checklists: Grounded Evaluation of AI-Generated Clinical Notes

    Authors: Karen Zhou, John Giorgi, Pranav Mani, Peng Xu, Davis Liang, Chenhao Tan

    Abstract: AI-generated clinical notes are increasingly used in healthcare, but evaluating their quality remains a challenge due to high subjectivity and limited scalability of expert review. Existing automated metrics often fail to align with real-world physician preferences. To address this, we propose a pipeline that systematically distills real user feedback into structured checklists for note evaluation… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  6. arXiv:2507.17692  [pdf, ps, other

    cs.LG cs.CV

    Joint Asymmetric Loss for Learning with Noisy Labels

    Authors: Jialiang Wang, Xianming Liu, Xiong Zhou, Gangfeng Hu, Deming Zhai, Junjun Jiang, Xiangyang Ji

    Abstract: Learning with noisy labels is a crucial task for training accurate deep neural networks. To mitigate label noise, prior studies have proposed various robust loss functions, particularly symmetric losses. Nevertheless, symmetric losses usually suffer from the underfitting issue due to the overly strict constraint. To address this problem, the Active Passive Loss (APL) jointly optimizes an active an… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    Comments: Accepted by ICCV 2025

  7. arXiv:2507.17691  [pdf, ps, other

    cs.SE cs.AI cs.CR cs.LG cs.PL

    CASCADE: LLM-Powered JavaScript Deobfuscator at Google

    Authors: Shan Jiang, Pranoy Kovuri, David Tao, Zhixun Tan

    Abstract: Software obfuscation, particularly prevalent in JavaScript, hinders code comprehension and analysis, posing significant challenges to software testing, static analysis, and malware detection. This paper introduces CASCADE, a novel hybrid approach that integrates the advanced coding capabilities of Gemini with the deterministic transformation capabilities of a compiler Intermediate Representation (… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  8. arXiv:2507.17688  [pdf, ps, other

    cs.HC cs.LG

    Mindfulness Meditation and Respiration: Accelerometer-Based Respiration Rate and Mindfulness Progress Estimation to Enhance App Engagement and Mindfulness Skills

    Authors: Mohammad Nur Hossain Khan, David creswell, Jordan Albert, Patrick O'Connell, Shawn Fallon, Mathew Polowitz, Xuhai "orson" Xu, Bashima islam

    Abstract: Mindfulness training is widely recognized for its benefits in reducing depression, anxiety, and loneliness. With the rise of smartphone-based mindfulness apps, digital meditation has become more accessible, but sustaining long-term user engagement remains a challenge. This paper explores whether respiration biosignal feedback and mindfulness skill estimation enhance system usability and skill deve… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    Comments: Accepted in Proc. ACM Interact. Mob. Wearable Ubiquitous Technology (IMWUT)

  9. arXiv:2507.17682  [pdf, ps, other

    cs.SD cs.CV cs.MM eess.AS

    Audio-Vision Contrastive Learning for Phonological Class Recognition

    Authors: Daiqi Liu, Tomás Arias-Vergara, Jana Hutter, Andreas Maier, Paula Andrea Pérez-Toro

    Abstract: Accurate classification of articulatory-phonological features plays a vital role in understanding human speech production and developing robust speech technologies, particularly in clinical contexts where targeted phonemic analysis and therapy can improve disease diagnosis accuracy and personalized rehabilitation. In this work, we propose a multimodal deep learning framework that combines real-tim… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    Comments: conference to TSD 2025

  10. arXiv:2507.17678  [pdf, ps, other

    eess.IV cs.CV

    MCM: Mamba-based Cardiac Motion Tracking using Sequential Images in MRI

    Authors: Jiahui Yin, Xinxing Cheng, Jinming Duan, Yan Pang, Declan O'Regan, Hadrien Reynaud, Qingjie Meng

    Abstract: Myocardial motion tracking is important for assessing cardiac function and diagnosing cardiovascular diseases, for which cine cardiac magnetic resonance (CMR) has been established as the gold standard imaging modality. Many existing methods learn motion from single image pairs consisting of a reference frame and a randomly selected target frame from the cardiac cycle. However, these methods overlo… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    Comments: Medical Image Computing and Computer-Assisted Intervention (MICCAI), Reconstruction and Imaging Motion Estimation Workshop (RIME), 2025

  11. arXiv:2507.17668  [pdf, ps, other

    cs.LG cs.AI

    How Should We Meta-Learn Reinforcement Learning Algorithms?

    Authors: Alexander David Goldie, Zilin Wang, Jakob Nicolaus Foerster, Shimon Whiteson

    Abstract: The process of meta-learning algorithms from data, instead of relying on manual design, is growing in popularity as a paradigm for improving the performance of machine learning systems. Meta-learning shows particular promise for reinforcement learning (RL), where algorithms are often adapted from supervised or unsupervised learning despite their suboptimality for RL. However, until now there has b… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    Comments: Accepted paper at Reinforcement Learning Conference (RLC) 2025

  12. arXiv:2507.17665  [pdf, ps, other

    cs.CV cs.RO

    Perspective-Invariant 3D Object Detection

    Authors: Ao Liang, Lingdong Kong, Dongyue Lu, Youquan Liu, Jian Fang, Huaici Zhao, Wei Tsang Ooi

    Abstract: With the rise of robotics, LiDAR-based 3D object detection has garnered significant attention in both academia and industry. However, existing datasets and methods predominantly focus on vehicle-mounted platforms, leaving other autonomous platforms underexplored. To bridge this gap, we introduce Pi3DET, the first benchmark featuring LiDAR data and 3D bounding box annotations collected from multipl… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    Comments: ICCV 2025; 46 pages, 18 figures, 22 tables; Project Page at https://pi3det.github.io

  13. arXiv:2507.17664  [pdf, ps, other

    cs.CV cs.RO

    Talk2Event: Grounded Understanding of Dynamic Scenes from Event Cameras

    Authors: Lingdong Kong, Dongyue Lu, Ao Liang, Rong Li, Yuhao Dong, Tianshuai Hu, Lai Xing Ng, Wei Tsang Ooi, Benoit R. Cottereau

    Abstract: Event cameras offer microsecond-level latency and robustness to motion blur, making them ideal for understanding dynamic environments. Yet, connecting these asynchronous streams to human language remains an open challenge. We introduce Talk2Event, the first large-scale benchmark for language-driven object grounding in event-based perception. Built from real-world driving data, we provide over 30,0… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    Comments: Preprint; 42 pages, 17 figures, 16 tables; Project Page at https://talk2event.github.io

  14. arXiv:2507.17649  [pdf, ps, other

    cs.RO

    Event Detection for Active Lower Limb Prosthesis

    Authors: J. D. Clark, P. Ellison

    Abstract: Accurate event detection is key to the successful design of semi-passive and powered prosthetics. Kinematically, the natural knee is complex, with translation and rotation components that have a substantial impact on gait characteristics. When simplified to a pin joint, some of this behaviour is lost. This study investigates the role of cruciate ligament stretch in event detection. A bicondylar kn… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  15. arXiv:2507.17647  [pdf, ps, other

    cs.DB

    SHINE: A Scalable HNSW Index in Disaggregated Memory

    Authors: Manuel Widmoser, Daniel Kocher, Nikolaus Augsten

    Abstract: Approximate nearest neighbor (ANN) search is a fundamental problem in computer science for which in-memory graph-based methods, such as Hierarchical Navigable Small World (HNSW), perform exceptionally well. To scale beyond billions of high-dimensional vectors, the index must be distributed. The disaggregated memory architecture physically separates compute and memory into two distinct hardware uni… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  16. arXiv:2507.17614  [pdf, ps, other

    quant-ph cs.DC

    Comparing performance of variational quantum algorithm simulations on HPC systems

    Authors: Marco De Pascale, Tobias Valentin Bauer, Yaknan John Gambo, Mario Hernández Vera, Stefan Huber, Burak Mete, Amit Jamadagni, Amine Bentellis, Marita Oliv, Luigi Iapichino, Jeanette Miriam Lorenz

    Abstract: Variational quantum algorithms are of special importance in the research on quantum computing applications because of their applicability to current Noisy Intermediate-Scale Quantum (NISQ) devices. The main building blocks of these algorithms (among them, the definition of the Hamiltonian and of the ansatz, the optimizer) define a relatively large parameter space, making the comparison of results… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  17. arXiv:2507.17613  [pdf, ps, other

    cs.CV

    InvRGB+L: Inverse Rendering of Complex Scenes with Unified Color and LiDAR Reflectance Modeling

    Authors: Xiaoxue Chen, Bhargav Chandaka, Chih-Hao Lin, Ya-Qin Zhang, David Forsyth, Hao Zhao, Shenlong Wang

    Abstract: We present InvRGB+L, a novel inverse rendering model that reconstructs large, relightable, and dynamic scenes from a single RGB+LiDAR sequence. Conventional inverse graphics methods rely primarily on RGB observations and use LiDAR mainly for geometric information, often resulting in suboptimal material estimates due to visible light interference. We find that LiDAR's intensity values-captured with… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    Comments: Accepted to ICCV 2025

  18. arXiv:2507.17588  [pdf, ps, other

    cs.CV cs.CL

    Dual-branch Prompting for Multimodal Machine Translation

    Authors: Jie Wang, Zhendong Yang, Liansong Zong, Xiaobo Zhang, Dexian Wang, Ji Zhang

    Abstract: Multimodal Machine Translation (MMT) typically enhances text-only translation by incorporating aligned visual features. Despite the remarkable progress, state-of-the-art MMT approaches often rely on paired image-text inputs at inference and are sensitive to irrelevant visual noise, which limits their robustness and practical applicability. To address these issues, we propose D2P-MMT, a diffusion-b… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  19. arXiv:2507.17585  [pdf, ps, other

    cs.CV cs.RO

    From Scan to Action: Leveraging Realistic Scans for Embodied Scene Understanding

    Authors: Anna-Maria Halacheva, Jan-Nico Zaech, Sombit Dey, Luc Van Gool, Danda Pani Paudel

    Abstract: Real-world 3D scene-level scans offer realism and can enable better real-world generalizability for downstream applications. However, challenges such as data volume, diverse annotation formats, and tool compatibility limit their use. This paper demonstrates a methodology to effectively leverage these scans and their annotations. We propose a unified annotation integration using USD, with applicati… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    Comments: Accepted at the OpenSUN3D Workshop, CVPR 2025. This workshop paper is not included in the official CVPR proceedings

  20. arXiv:2507.17561  [pdf, ps, other

    cs.RO

    Robot-mediated physical Human-Human Interaction in Neurorehabilitation: a position paper

    Authors: Lorenzo Vianello, Matthew Short, Julia Manczurowsky, Emek Barış Küçüktabak, Francesco Di Tommaso, Alessia Noccaro, Laura Bandini, Shoshana Clark, Alaina Fiorenza, Francesca Lunardini, Alberto Canton, Marta Gandolla, Alessandra L. G. Pedrocchi, Emilia Ambrosini, Manuel Murie-Fernandez, Carmen B. Roman, Jesus Tornero, Natacha Leon, Andrew Sawers, Jim Patton, Domenico Formica, Nevio Luigi Tagliamonte, Georg Rauter, Kilian Baur, Fabian Just , et al. (3 additional authors not shown)

    Abstract: Neurorehabilitation conventionally relies on the interaction between a patient and a physical therapist. Robotic systems can improve and enrich the physical feedback provided to patients after neurological injury, but they under-utilize the adaptability and clinical expertise of trained therapists. In this position paper, we advocate for a novel approach that integrates the therapist's clinical ex… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  21. arXiv:2507.17539  [pdf, ps, other

    cs.AI cs.CV eess.IV

    Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning

    Authors: Xinyao Liu, Diping Song

    Abstract: Multimodal large language models (MLLMs) demonstrate significant potential in the field of medical diagnosis. However, they face critical challenges in specialized domains such as ophthalmology, particularly the fragmentation of annotation granularity and inconsistencies in clinical reasoning logic, which hinder precise cross-modal understanding. This paper introduces FundusExpert, an ophthalmolog… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  22. arXiv:2507.17518  [pdf, ps, other

    cs.CR cs.AI cs.CY cs.HC cs.SE

    Enabling Cyber Security Education through Digital Twins and Generative AI

    Authors: Vita Santa Barletta, Vito Bavaro, Miriana Calvano, Antonio Curci, Antonio Piccinno, Davide Pio Posa

    Abstract: Digital Twins (DTs) are gaining prominence in cybersecurity for their ability to replicate complex IT (Information Technology), OT (Operational Technology), and IoT (Internet of Things) infrastructures, allowing for real time monitoring, threat analysis, and system simulation. This study investigates how integrating DTs with penetration testing tools and Large Language Models (LLMs) can enhance cy… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  23. arXiv:2507.17513  [pdf, ps, other

    cs.LG cs.AI

    HOTA: Hamiltonian framework for Optimal Transport Advection

    Authors: Nazar Buzun, Daniil Shlenskii, Maxim Bobrin, Dmitry V. Dylov

    Abstract: Optimal transport (OT) has become a natural framework for guiding the probability flows. Yet, the majority of recent generative models assume trivial geometry (e.g., Euclidean) and rely on strong density-estimation assumptions, yielding trajectories that do not respect the true principles of optimality in the underlying manifold. We present Hamiltonian Optimal Transport Advection (HOTA), a Hamilto… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  24. arXiv:2507.17509  [pdf, ps, other

    cond-mat.dis-nn cs.LG

    Graph Neural Network Approach to Predicting Magnetization in Quasi-One-Dimensional Ising Systems

    Authors: V. Slavin, O. Kryvchikov, D. Laptev

    Abstract: We present a graph-based deep learning framework for predicting the magnetic properties of quasi-one-dimensional Ising spin systems. The lattice geometry is encoded as a graph and processed by a graph neural network (GNN) followed by fully connected layers. The model is trained on Monte Carlo simulation data and accurately reproduces key features of the magnetization curve, including plateaus, cri… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    Comments: 18 pages, 4 figures

  25. arXiv:2507.17494  [pdf, ps, other

    stat.ML cs.AI cs.LG eess.SP

    To Trust or Not to Trust: On Calibration in ML-based Resource Allocation for Wireless Networks

    Authors: Rashika Raina, Nidhi Simmons, David E. Simmons, Michel Daoud Yacoub, Trung Q. Duong

    Abstract: In next-generation communications and networks, machine learning (ML) models are expected to deliver not only accurate predictions but also well-calibrated confidence scores that reflect the true likelihood of correct decisions. This paper studies the calibration performance of an ML-based outage predictor within a single-user, multi-resource allocation framework. We first establish key theoretica… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  26. arXiv:2507.17476  [pdf, ps, other

    cs.CL cs.AI

    MultiNRC: A Challenging and Native Multilingual Reasoning Evaluation Benchmark for LLMs

    Authors: Alexander R. Fabbri, Diego Mares, Jorge Flores, Meher Mankikar, Ernesto Hernandez, Dean Lee, Bing Liu, Chen Xing

    Abstract: Although recent Large Language Models (LLMs) have shown rapid improvement on reasoning benchmarks in English, the evaluation of such LLMs' multilingual reasoning capability across diverse languages and cultural contexts remains limited. Existing multilingual reasoning benchmarks are typically constructed by translating existing English reasoning benchmarks, biasing these benchmarks towards reasoni… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  27. arXiv:2507.17470  [pdf, ps, other

    quant-ph cs.AI cs.LG

    Demonstration of Efficient Predictive Surrogates for Large-scale Quantum Processors

    Authors: Wei-You Liao, Yuxuan Du, Xinbiao Wang, Tian-Ci Tian, Yong Luo, Bo Du, Dacheng Tao, He-Liang Huang

    Abstract: The ongoing development of quantum processors is driving breakthroughs in scientific discovery. Despite this progress, the formidable cost of fabricating large-scale quantum processors means they will remain rare for the foreseeable future, limiting their widespread application. To address this bottleneck, we introduce the concept of predictive surrogates, which are classical learning models desig… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    Comments: 53 pages, 15 figures, comments are welcome

  28. arXiv:2507.17455  [pdf, ps, other

    cs.CV cs.RO

    VLM-Guided Visual Place Recognition for Planet-Scale Geo-Localization

    Authors: Sania Waheed, Na Min An, Michael Milford, Sarvapali D. Ramchurn, Shoaib Ehsan

    Abstract: Geo-localization from a single image at planet scale (essentially an advanced or extreme version of the kidnapped robot problem) is a fundamental and challenging task in applications such as navigation, autonomous driving and disaster response due to the vast diversity of locations, environmental conditions, and scene variations. Traditional retrieval-based methods for geo-localization struggle wi… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  29. Efficient Neural Network Verification via Order Leading Exploration of Branch-and-Bound Trees

    Authors: Guanqin Zhang, Kota Fukuda, Zhenya Zhang, H. M. N. Dilum Bandara, Shiping Chen, Jianjun Zhao, Yulei Sui

    Abstract: The vulnerability of neural networks to adversarial perturbations has necessitated formal verification techniques that can rigorously certify the quality of neural networks. As the state-of-the-art, branch and bound (BaB) is a "divide-and-conquer" strategy that applies off-the-shelf verifiers to sub-problems for which they perform better. While BaB can identify the sub-problems that are necessary… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    Comments: This is an extended version of the ECOOP 2025 paper, with a comparison with DATE 2025 (Figure 7 of RQ1 in Section 5.2), as well as an in-depth discussion of OOPSLA 2025 in the related work (Section 6)

  30. arXiv:2507.17406  [pdf, ps, other

    cs.CV

    Physics-based Human Pose Estimation from a Single Moving RGB Camera

    Authors: Ayce Idil Aytekin, Chuqiao Li, Diogo Luvizon, Rishabh Dabral, Martin Oswald, Marc Habermann, Christian Theobalt

    Abstract: Most monocular and physics-based human pose tracking methods, while achieving state-of-the-art results, suffer from artifacts when the scene does not have a strictly flat ground plane or when the camera is moving. Moreover, these methods are often evaluated on in-the-wild real world videos without ground-truth data or on synthetic datasets, which fail to model the real world light transport, camer… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  31. The Wilhelm Tell Dataset of Affordance Demonstrations

    Authors: Rachel Ringe, Mihai Pomarlan, Nikolaos Tsiogkas, Stefano De Giorgis, Maria Hedblom, Rainer Malaka

    Abstract: Affordances - i.e. possibilities for action that an environment or objects in it provide - are important for robots operating in human environments to perceive. Existing approaches train such capabilities on annotated static images or shapes. This work presents a novel dataset for affordance learning of common household tasks. Unlike previous approaches, our dataset consists of video sequences dem… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    Comments: \c{opyright} 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

    Journal ref: 2025 20th ACM/IEEE International Conference on Human-Robot Interaction (HRI), Melbourne, Australia, 2025, pp. 1078-1082

  32. arXiv:2507.17391  [pdf, ps, other

    cs.DS

    Residual Prophet Inequalities

    Authors: Jose Correa, Sebastian Perez-Salazar, Dana Pizarro, Bruno Ziliotto

    Abstract: We introduce a variant of the classic prophet inequality, called \emph{residual prophet inequality} (RPI). In the RPI problem, we consider a finite sequence of $n$ nonnegative independent random values with known distributions, and a known integer $0\leq k\leq n-1$. Before the gambler observes the sequence, the top $k$ values are removed, whereas the remaining $n-k$ values are streamed sequentiall… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    MSC Class: 68W27; 60G40

  33. arXiv:2507.17389  [pdf, ps, other

    cs.SE cs.AI

    Investigating Training Data Detection in AI Coders

    Authors: Tianlin Li, Yunxiang Wei, Zhiming Li, Aishan Liu, Qing Guo, Xianglong Liu, Dongning Sun, Yang Liu

    Abstract: Recent advances in code large language models (CodeLLMs) have made them indispensable tools in modern software engineering. However, these models occasionally produce outputs that contain proprietary or sensitive code snippets, raising concerns about potential non-compliant use of training data, and posing risks to privacy and intellectual property. To ensure responsible and compliant deployment o… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  34. arXiv:2507.17379  [pdf, ps, other

    cs.RO

    Language-Conditioned Open-Vocabulary Mobile Manipulation with Pretrained Models

    Authors: Shen Tan, Dong Zhou, Xiangyu Shao, Junqiao Wang, Guanghui Sun

    Abstract: Open-vocabulary mobile manipulation (OVMM) that involves the handling of novel and unseen objects across different workspaces remains a significant challenge for real-world robotic applications. In this paper, we propose a novel Language-conditioned Open-Vocabulary Mobile Manipulation framework, named LOVMM, incorporating the large language model (LLM) and vision-language model (VLM) to tackle var… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    Comments: IJCAI 2025

  35. arXiv:2507.17354  [pdf, ps, other

    cs.FL

    Realisability and Complementability of Multiparty Session Types

    Authors: Cinzia Di Giusto, Etienne Lozes, Pascal Urso

    Abstract: Multiparty session types (MPST) are a type-based approach for specifying message-passing distributed systems. They rely on the notion of global type specifying the global behaviour and local types, which are the projections of the global behaviour onto each local participant. An essential property of global types is realisability, i.e., whether the composition of the local behaviours conforms to t… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  36. arXiv:2507.17348  [pdf, ps, other

    cs.LG

    TOC-UCO: a comprehensive repository of tabular ordinal classification datasets

    Authors: Rafael Ayllón-Gavilán, David Guijo-Rubio, Antonio Manuel Gómez-Orellana, David Guijo-Rubio, Francisco Bérchez-Moreno, Víctor Manuel Vargas-Yun, Pedro A. Gutiérrez

    Abstract: An ordinal classification (OC) problem corresponds to a special type of classification characterised by the presence of a natural order relationship among the classes. This type of problem can be found in a number of real-world applications, motivating the design and development of many ordinal methodologies over the last years. However, it is important to highlight that the development of the OC… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    Comments: 25 single column pages, 5 figures, 7 tables

  37. arXiv:2507.17346  [pdf, ps, other

    cs.LG

    DeCo-SGD: Joint Optimization of Delay Staleness and Gradient Compression Ratio for Distributed SGD

    Authors: Rongwei Lu, Jingyan Jiang, Chunyang Li, Haotian Dong, Xingguang Wei, Delin Cai, Zhi Wang

    Abstract: Distributed machine learning in high end-to-end latency and low, varying bandwidth network environments undergoes severe throughput degradation. Due to its low communication requirements, distributed SGD (D-SGD) remains the mainstream optimizer in such challenging networks, but it still suffers from significant throughput reduction. To mitigate these limitations, existing approaches typically empl… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  38. arXiv:2507.17338  [pdf, ps, other

    cs.RO

    Mobile Manipulation with Active Inference for Long-Horizon Rearrangement Tasks

    Authors: Corrado Pezzato, Ozan Çatal, Toon Van de Maele, Riddhi J. Pitliya, Tim Verbelen

    Abstract: Despite growing interest in active inference for robotic control, its application to complex, long-horizon tasks remains untested. We address this gap by introducing a fully hierarchical active inference architecture for goal-directed behavior in realistic robotic settings. Our model combines a high-level active inference model that selects among discrete skills realized via a whole-body active in… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  39. arXiv:2507.17332  [pdf, ps, other

    cs.CV

    PARTE: Part-Guided Texturing for 3D Human Reconstruction from a Single Image

    Authors: Hyeongjin Nam, Donghwan Kim, Gyeongsik Moon, Kyoung Mu Lee

    Abstract: The misaligned human texture across different human parts is one of the main limitations of existing 3D human reconstruction methods. Each human part, such as a jacket or pants, should maintain a distinct texture without blending into others. The structural coherence of human parts serves as a crucial cue to infer human textures in the invisible regions of a single image. However, most existing 3D… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    Comments: Published at ICCV 2025, 22 pages including the supplementary material

  40. arXiv:2507.17326  [pdf, ps, other

    cs.SD eess.AS

    Application of Whisper in Clinical Practice: the Post-Stroke Speech Assessment during a Naming Task

    Authors: Milena Davudova, Ziyuan Cai, Valentina Giunchiglia, Dragos C. Gruia, Giulia Sanguedolce, Adam Hampshire, Fatemeh Geranmayeh

    Abstract: Detailed assessment of language impairment following stroke remains a cognitively complex and clinician-intensive task, limiting timely and scalable diagnosis. Automatic Speech Recognition (ASR) foundation models offer a promising pathway to augment human evaluation through intelligent systems, but their effectiveness in the context of speech and language impairment remains uncertain. In this stud… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  41. arXiv:2507.17316  [pdf, ps, other

    stat.ML cs.LG

    Nearly Minimax Discrete Distribution Estimation in Kullback-Leibler Divergence with High Probability

    Authors: Dirk van der Hoeven, Julia Olkhovskaia, Tim van Erven

    Abstract: We consider the problem of estimating a discrete distribution $p$ with support of size $K$ and provide both upper and lower bounds with high probability in KL divergence. We prove that in the worst case, for any estimator $\widehat{p}$, with probability at least $δ$, $\text{KL}(p \| \widehat{p}) \geq C\max\{K,\ln(K)\ln(1/δ) \}/n $, where $n$ is the sample size and $C > 0$ is a constant. We introdu… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  42. arXiv:2507.17301  [pdf, ps, other

    cs.DC

    Efficient Column-Wise N:M Pruning on RISC-V CPU

    Authors: Chi-Wei Chu, Ding-Yong Hong, Jan-Jan Wu

    Abstract: In deep learning frameworks, weight pruning is a widely used technique for improving computational efficiency by reducing the size of large models. This is especially critical for convolutional operators, which often act as performance bottlenecks in convolutional neural networks (CNNs). However, the effectiveness of pruning heavily depends on how it is implemented, as different methods can signif… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  43. arXiv:2507.17291  [pdf, ps, other

    cs.LO cs.AI

    Integrating Belief Domains into Probabilistic Logic Programs

    Authors: Damiano Azzolini, Fabrizio Riguzzi, Theresa Swift

    Abstract: Probabilistic Logic Programming (PLP) under the Distribution Semantics is a leading approach to practical reasoning under uncertainty. An advantage of the Distribution Semantics is its suitability for implementation as a Prolog or Python library, available through two well-maintained implementations, namely ProbLog and cplint/PITA. However, current formulations of the Distribution Semantics use po… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    Comments: Under consideration in Theory and Practice of Logic Programming (TPLP)

  44. arXiv:2507.17289  [pdf, ps, other

    cs.AI

    Compliance Brain Assistant: Conversational Agentic AI for Assisting Compliance Tasks in Enterprise Environments

    Authors: Shitong Zhu, Chenhao Fang, Derek Larson, Neel Reddy Pochareddy, Rajeev Rao, Sophie Zeng, Yanqing Peng, Wendy Summer, Alex Goncalves, Arya Pudota, Herve Robert

    Abstract: This paper presents Compliance Brain Assistant (CBA), a conversational, agentic AI assistant designed to boost the efficiency of daily compliance tasks for personnel in enterprise environments. To strike a good balance between response quality and latency, we design a user query router that can intelligently choose between (i) FastTrack mode: to handle simple requests that only need additional rel… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  45. arXiv:2507.17281  [pdf, ps, other

    cs.CV

    Fully Automated SAM for Single-source Domain Generalization in Medical Image Segmentation

    Authors: Huanli Zhuo, Leilei Ma, Haifeng Zhao, Shiwei Zhou, Dengdi Sun, Yanping Fu

    Abstract: Although SAM-based single-source domain generalization models for medical image segmentation can mitigate the impact of domain shift on the model in cross-domain scenarios, these models still face two major challenges. First, the segmentation of SAM is highly dependent on domain-specific expert-annotated prompts, which prevents SAM from achieving fully automated medical image segmentation and ther… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    Comments: This manuscript has been accepted for presentation at the IEEE International Conference on Systems, Man, and Cybernetics (IEEE SMC 2025) and is copyrighted by IEEE

  46. arXiv:2507.17264  [pdf, ps, other

    cs.SE cs.AI cs.HC

    Understanding Prompt Programming Tasks and Questions

    Authors: Jenny T. Liang, Chenyang Yang, Agnia Sergeyuk, Travis D. Breaux, Brad A. Myers

    Abstract: Prompting foundation models (FMs) like large language models (LLMs) have enabled new AI-powered software features (e.g., text summarization) that previously were only possible by fine-tuning FMs. Now, developers are embedding prompts in software, known as prompt programs. The process of prompt programming requires the developer to make many changes to their prompt. Yet, the questions developers as… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  47. arXiv:2507.17259  [pdf, ps, other

    cs.CR cs.CL

    Tab-MIA: A Benchmark Dataset for Membership Inference Attacks on Tabular Data in LLMs

    Authors: Eyal German, Sagiv Antebi, Daniel Samira, Asaf Shabtai, Yuval Elovici

    Abstract: Large language models (LLMs) are increasingly trained on tabular data, which, unlike unstructured text, often contains personally identifiable information (PII) in a highly structured and explicit format. As a result, privacy risks arise, since sensitive records can be inadvertently retained by the model and exposed through data extraction or membership inference attacks (MIAs). While existing MIA… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  48. arXiv:2507.17248  [pdf, ps, other

    cs.HC cs.AI cs.GR

    Reality Proxy: Fluid Interactions with Real-World Objects in MR via Abstract Representations

    Authors: Xiaoan Liu, Difan Jia, Xianhao Carton Liu, Mar Gonzalez-Franco, Chen Zhu-Tian

    Abstract: Interacting with real-world objects in Mixed Reality (MR) often proves difficult when they are crowded, distant, or partially occluded, hindering straightforward selection and manipulation. We observe that these difficulties stem from performing interaction directly on physical objects, where input is tightly coupled to their physical constraints. Our key insight is to decouple interaction from th… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

    Comments: 16 pages, 9 figures. Accepted for publication in UIST'25 (The 38th Annual ACM Symposium on User Interface Software and Technology), Busan, Republic of Korea, 28 Sep - 1 Oct 2025

    ACM Class: H.5.2; I.3.6

  49. arXiv:2507.17245  [pdf, ps, other

    cs.LG cs.AI

    DistrAttention: An Efficient and Flexible Self-Attention Mechanism on Modern GPUs

    Authors: Haolin Jin, Mengbai Xiao, Yuan Yuan, Xiao Zhang, Dongxiao Yu, Guanghui Zhang, Haoliang Wang

    Abstract: The Transformer architecture has revolutionized deep learning, delivering the state-of-the-art performance in areas such as natural language processing, computer vision, and time series prediction. However, its core component, self-attention, has the quadratic time complexity relative to input sequence length, which hinders the scalability of Transformers. The exsiting approaches on optimizing sel… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.

  50. arXiv:2507.17235  [pdf, ps, other

    cs.SE quant-ph

    On the Feasibility of Quantum Unit Testing

    Authors: Andriy Miranskyy, José Campos, Anila Mjeda, Lei Zhang, Ignacio García Rodríguez de Guzmán

    Abstract: The increasing complexity of quantum software presents significant challenges for software verification and validation, particularly in the context of unit testing. This work presents a comprehensive study on quantum-centric unit tests, comparing traditional statistical approaches with tests specifically designed for quantum circuits. These include tests that run only on a classical computer, such… ▽ More

    Submitted 23 July, 2025; originally announced July 2025.