这是indexloc提供的服务,不要输入任何密码
Skip to main content

Showing 1–50 of 231 results for author: Patel, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.17012  [pdf, ps, other

    cs.AI cs.CE

    Towards Autonomous Sustainability Assessment via Multimodal AI Agents

    Authors: Zhihan Zhang, Alexander Metzger, Yuxuan Mei, Felix Hähnlein, Zachary Englhardt, Tingyu Cheng, Gregory D. Abowd, Shwetak Patel, Adriana Schulz, Vikram Iyer

    Abstract: Interest in sustainability information has surged in recent years. However, the data required for a life cycle assessment (LCA) that maps the materials and processes from product manufacturing to disposal into environmental impacts (EI) are often unavailable. Here we reimagine conventional LCA by introducing multimodal AI agents that emulate interactions between LCA experts and stakeholders like p… ▽ More

    Submitted 22 July, 2025; originally announced July 2025.

  2. arXiv:2507.11889  [pdf, ps, other

    cs.RO

    NemeSys: An Online Underwater Explorer with Goal-Driven Adaptive Autonomy

    Authors: Adnan Abdullah, Alankrit Gupta, Vaishnav Ramesh, Shivali Patel, Md Jahidul Islam

    Abstract: Adaptive mission control and dynamic parameter reconfiguration are essential for autonomous underwater vehicles (AUVs) operating in GPS-denied, communication-limited marine environments. However, most current AUV platforms execute static, pre-programmed missions or rely on tethered connections and high-latency acoustic channels for mid-mission updates, significantly limiting their adaptability and… ▽ More

    Submitted 16 July, 2025; originally announced July 2025.

    Comments: 10 pages, V1

  3. arXiv:2507.00990  [pdf, ps, other

    cs.RO cs.AI cs.CV

    Robotic Manipulation by Imitating Generated Videos Without Physical Demonstrations

    Authors: Shivansh Patel, Shraddhaa Mohan, Hanlin Mai, Unnat Jain, Svetlana Lazebnik, Yunzhu Li

    Abstract: This work introduces Robots Imitating Generated Videos (RIGVid), a system that enables robots to perform complex manipulation tasks--such as pouring, wiping, and mixing--purely by imitating AI-generated videos, without requiring any physical demonstrations or robot-specific training. Given a language command and an initial scene image, a video diffusion model generates potential demonstration vide… ▽ More

    Submitted 4 July, 2025; v1 submitted 1 July, 2025; originally announced July 2025.

    Comments: Project Page: https://rigvid-robot.github.io/

  4. arXiv:2506.23414  [pdf, ps, other

    cs.CV

    A High-Throughput Platform to Bench Test Smartphone-Based Heart Rate Measurements Derived From Video

    Authors: Ming-Zher Poh, Jonathan Wang, Jonathan Hsu, Lawrence Cai, Eric Teasley, James A. Taylor, Jameson K. Rogers, Anupam Pathak, Shwetak Patel

    Abstract: Smartphone-based heart rate (HR) monitoring apps using finger-over-camera photoplethysmography (PPG) face significant challenges in performance evaluation and device compatibility due to device variability and fragmentation. Manual testing is impractical, and standardized methods are lacking. This paper presents a novel, high-throughput bench-testing platform to address this critical need. We desi… ▽ More

    Submitted 29 June, 2025; originally announced June 2025.

  5. From Unstructured Communication to Intelligent RAG: Multi-Agent Automation for Supply Chain Knowledge Bases

    Authors: Yao Zhang, Zaixi Shang, Silpan Patel, Mikel Zuniga

    Abstract: Supply chain operations generate vast amounts of operational data; however, critical knowledge such as system usage practices, troubleshooting workflows, and resolution techniques often remains buried within unstructured communications like support tickets, emails, and chat logs. While RAG systems aim to leverage such communications as a knowledge base, their effectiveness is limited by raw data c… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: Accepted In Proceedings of the 1st Workshop on AI for Supply Chain: Today and Future @ 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.2 (KDD 25), August 3, 2025, Toronto, ON, Canada. ACM, New York, NY, USA, 14 pages, 2 figures

  6. arXiv:2506.09108  [pdf, ps, other

    cs.LG cs.AI cs.CL

    SensorLM: Learning the Language of Wearable Sensors

    Authors: Yuwei Zhang, Kumar Ayush, Siyuan Qiao, A. Ali Heydari, Girish Narayanswamy, Maxwell A. Xu, Ahmed A. Metwally, Shawn Xu, Jake Garrison, Xuhai Xu, Tim Althoff, Yun Liu, Pushmeet Kohli, Jiening Zhan, Mark Malhotra, Shwetak Patel, Cecilia Mascolo, Xin Liu, Daniel McDuff, Yuzhe Yang

    Abstract: We present SensorLM, a family of sensor-language foundation models that enable wearable sensor data understanding with natural language. Despite its pervasive nature, aligning and interpreting sensor data with language remains challenging due to the lack of paired, richly annotated sensor-text descriptions in uncurated, real-world wearable data. We introduce a hierarchical caption generation pipel… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  7. arXiv:2506.08249  [pdf, other

    cs.DB cs.CL

    RADAR: Benchmarking Language Models on Imperfect Tabular Data

    Authors: Ken Gu, Zhihan Zhang, Kate Lin, Yuwei Zhang, Akshay Paruchuri, Hong Yu, Mehran Kazemi, Kumar Ayush, A. Ali Heydari, Maxwell A. Xu, Girish Narayanswamy, Yun Liu, Ming-Zher Poh, Yuzhe Yang, Mark Malhotra, Shwetak Patel, Hamid Palangi, Xuhai Xu, Daniel McDuff, Tim Althoff, Xin Liu

    Abstract: Language models (LMs) are increasingly being deployed to perform autonomous data analyses. However, their data awareness -- the ability to recognize, reason over, and appropriately handle data artifacts such as missing values, outliers, and logical inconsistencies -- remains underexplored. These artifacts are especially common in real-world tabular data and, if mishandled, can significantly compro… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  8. arXiv:2506.07621  [pdf, ps, other

    cs.CL cs.AI cs.LG

    LoRMA: Low-Rank Multiplicative Adaptation for LLMs

    Authors: Harsh Bihany, Shubham Patel, Ashutosh Modi

    Abstract: Large Language Models have shown remarkable capabilities in the NLP domain. Their effectiveness can mainly be attributed to their ability to adapt to an array of downstream tasks. However, generally, full fine-tuning is a computationally expensive job. To mitigate this, many techniques have been developed that prime efficiency, a prominent one being Low-Rank Adaptation (LoRA). However, LoRA and it… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: Accepted at ACL Findings 2025; 21 pages (9 main paper + 5 pages references + 7 pages appendix)

  9. arXiv:2506.05321  [pdf, other

    cs.LG

    LSM-2: Learning from Incomplete Wearable Sensor Data

    Authors: Maxwell A. Xu, Girish Narayanswamy, Kumar Ayush, Dimitris Spathis, Shun Liao, Shyam A. Tailor, Ahmed Metwally, A. Ali Heydari, Yuwei Zhang, Jake Garrison, Samy Abdel-Ghaffar, Xuhai Xu, Ken Gu, Jacob Sunshine, Ming-Zher Poh, Yun Liu, Tim Althoff, Shrikanth Narayanan, Pushmeet Kohli, Mark Malhotra, Shwetak Patel, Yuzhe Yang, James M. Rehg, Xin Liu, Daniel McDuff

    Abstract: Foundation models, a cornerstone of recent advancements in machine learning, have predominantly thrived on complete and well-structured data. Wearable sensor data frequently suffers from significant missingness, posing a substantial challenge for self-supervised learning (SSL) models that typically assume complete data inputs. This paper introduces the second generation of Large Sensor Model (LSM-… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: Xu and Narayanswamy are co-first authors. McDuff and Liu are co-last authors

  10. arXiv:2506.04515  [pdf, ps, other

    q-bio.QM cs.AI cs.LG

    The Latent Space Hypothesis: Toward Universal Medical Representation Learning

    Authors: Salil Patel

    Abstract: Medical data range from genomic sequences and retinal photographs to structured laboratory results and unstructured clinical narratives. Although these modalities appear disparate, many encode convergent information about a single underlying physiological state. The Latent Space Hypothesis frames each observation as a projection of a unified, hierarchically organized manifold -- much like shadows… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 51 pages, 12 figures. A position paper examining the latent space hypothesis - the proposition that diverse medical data can be represented in shared latent spaces reflecting fundamental biological processes. The paper discusses theoretical foundations, reviews supporting evidence, and considers potential implications for medical AI and representation learning

  11. arXiv:2506.04467  [pdf

    physics.med-ph cs.AI

    Diffusion Transformer-based Universal Dose Denoising for Pencil Beam Scanning Proton Therapy

    Authors: Yuzhen Ding, Jason Holmes, Hongying Feng, Martin Bues, Lisa A. McGee, Jean-Claude M. Rwigema, Nathan Y. Yu, Terence S. Sio, Sameer R. Keole, William W. Wong, Steven E. Schild, Jonathan B. Ashman, Sujay A. Vora, Daniel J. Ma, Samir H. Patel, Wei Liu

    Abstract: Purpose: Intensity-modulated proton therapy (IMPT) offers precise tumor coverage while sparing organs at risk (OARs) in head and neck (H&N) cancer. However, its sensitivity to anatomical changes requires frequent adaptation through online adaptive radiation therapy (oART), which depends on fast, accurate dose calculation via Monte Carlo (MC) simulations. Reducing particle count accelerates MC but… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  12. arXiv:2505.18839  [pdf, ps, other

    cs.DS

    DNF Learning via Locally Mixing Random Walks

    Authors: Josh Alman, Shivam Nadimpalli, Shyamal Patel, Rocco A. Servedio

    Abstract: We give two results on PAC learning DNF formulas using membership queries in the challenging "distribution-free" learning framework, where learning algorithms must succeed for an arbitrary and unknown distribution over $\{0,1\}^n$. (1) We first give a quasi-polynomial time "list-decoding" algorithm for learning a single term of an unknown DNF formula. More precisely, for any target $s$-term DNF… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

  13. arXiv:2505.06201  [pdf, other

    cs.IT

    Decoding Algorithms for Two-dimensional Constacyclic Codes over $\mathbb{F}_q$

    Authors: Vidya Sagar, Shikha Patel, Shayan Srinivasa Garani

    Abstract: We derive the spectral domain properties of two-dimensional (2-D) $(λ_1, λ_2)$-constacyclic codes over $\mathbb{F}_q$ using the 2-D finite field Fourier transform (FFFT). Based on the spectral nulls of 2-D $(λ_1, λ_2)$-constacyclic codes, we characterize the structure of 2-D constacyclic coded arrays. The proposed 2-D construction has flexible code rates and works for any code areas, be it odd or… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

    Comments: 26 pages, 1 figure

  14. arXiv:2505.06046  [pdf, other

    cs.CL cs.LG

    Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information

    Authors: Joshua Harris, Fan Grayson, Felix Feldman, Timothy Laurence, Toby Nonnenmacher, Oliver Higgins, Leo Loman, Selina Patel, Thomas Finnie, Samuel Collins, Michael Borowitz

    Abstract: As Large Language Models (LLMs) become widely accessible, a detailed understanding of their knowledge within specific domains becomes necessary for successful real world use. This is particularly critical in public health, where failure to retrieve relevant, accurate, and current information could significantly impact UK residents. However, currently little is known about LLM knowledge of UK Gover… ▽ More

    Submitted 15 May, 2025; v1 submitted 9 May, 2025; originally announced May 2025.

    Comments: 24 pages, 10 pages main text

    MSC Class: 68T50

  15. arXiv:2505.03784  [pdf, ps, other

    cs.LG

    Insulin Resistance Prediction From Wearables and Routine Blood Biomarkers

    Authors: Ahmed A. Metwally, A. Ali Heydari, Daniel McDuff, Alexandru Solot, Zeinab Esmaeilpour, Anthony Z Faranesh, Menglian Zhou, David B. Savage, Conor Heneghan, Shwetak Patel, Cathy Speed, Javier L. Prieto

    Abstract: Insulin resistance, a precursor to type 2 diabetes, is characterized by impaired insulin action in tissues. Current methods for measuring insulin resistance, while effective, are expensive, inaccessible, not widely available and hinder opportunities for early intervention. In this study, we remotely recruited the largest dataset to date across the US to study insulin resistance (N=1,165 participan… ▽ More

    Submitted 30 April, 2025; originally announced May 2025.

  16. arXiv:2504.20086  [pdf, ps, other

    cs.CL cs.AI cs.CY cs.LG

    Understanding and Mitigating Risks of Generative AI in Financial Services

    Authors: Sebastian Gehrmann, Claire Huang, Xian Teng, Sergei Yurovski, Iyanuoluwa Shode, Chirag S. Patel, Arjun Bhorkar, Naveen Thomas, John Doucette, David Rosenberg, Mark Dredze, David Rabinowitz

    Abstract: To responsibly develop Generative AI (GenAI) products, it is critical to define the scope of acceptable inputs and outputs. What constitutes a "safe" response is an actively debated question. Academic work puts an outsized focus on evaluating models by themselves for general purpose aspects such as toxicity, bias, and fairness, especially in conversational applications being used by a broad audien… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

    Comments: Accepted to FAccT 2025

  17. arXiv:2504.16791  [pdf, other

    astro-ph.IM astro-ph.CO cs.AI

    Radiometer Calibration using Machine Learning

    Authors: S. A. K. Leeney, H. T. J. Bevins, E. de Lera Acedo, W. J. Handley, C. Kirkham, R. S. Patel, J. Zhu, D. Molnar, J. Cumner, D. Anstey, K. Artuc, G. Bernardi, M. Bucher, S. Carey, J. Cavillot, R. Chiello, W. Croukamp, D. I. L. de Villiers, J. A. Ely, A. Fialkov, T. Gessey-Jones, G. Kulkarni, A. Magro, P. D. Meerburg, S. Mittal , et al. (13 additional authors not shown)

    Abstract: Radiometers are crucial instruments in radio astronomy, forming the primary component of nearly all radio telescopes. They measure the intensity of electromagnetic radiation, converting this radiation into electrical signals. A radiometer's primary components are an antenna and a Low Noise Amplifier (LNA), which is the core of the ``receiver'' chain. Instrumental effects introduced by the receiver… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

    Comments: Under peer review for publication in Nature Scientific Reports as part of the Radio Astronomy collection

  18. arXiv:2504.16065  [pdf, ps, other

    cs.DS

    A Mysterious Connection Between Tolerant Junta Testing and Agnostically Learning Conjunctions

    Authors: Xi Chen, Shyamal Patel, Rocco A. Servedio

    Abstract: The main conceptual contribution of this paper is identifying a previously unnoticed connection between two central problems in computational learning theory and property testing: agnostically learning conjunctions and tolerantly testing juntas. Inspired by this connection, the main technical contribution is a pair of improved algorithms for these two problems. In more detail, - We give a dist… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

  19. arXiv:2503.23339  [pdf, other

    cs.AI cs.CL cs.HC

    A Scalable Framework for Evaluating Health Language Models

    Authors: Neil Mallinar, A. Ali Heydari, Xin Liu, Anthony Z. Faranesh, Brent Winslow, Nova Hammerquist, Benjamin Graef, Cathy Speed, Mark Malhotra, Shwetak Patel, Javier L. Prieto, Daniel McDuff, Ahmed A. Metwally

    Abstract: Large language models (LLMs) have emerged as powerful tools for analyzing complex datasets. Recent studies demonstrate their potential to generate useful, personalized responses when provided with patient-specific health information that encompasses lifestyle, biomarkers, and context. As LLM-driven health applications are increasingly adopted, rigorous and efficient one-sided evaluation methodolog… ▽ More

    Submitted 1 April, 2025; v1 submitted 30 March, 2025; originally announced March 2025.

  20. arXiv:2503.19328  [pdf, ps, other

    cs.CL cs.AI

    Substance over Style: Evaluating Proactive Conversational Coaching Agents

    Authors: Vidya Srinivas, Xuhai Xu, Xin Liu, Kumar Ayush, Isaac Galatzer-Levy, Shwetak Patel, Daniel McDuff, Tim Althoff

    Abstract: While NLP research has made strides in conversational tasks, many approaches focus on single-turn responses with well-defined objectives or evaluation criteria. In contrast, coaching presents unique challenges with initially undefined goals that evolve through multi-turn interactions, subjective evaluation criteria, mixed-initiative dialogue. In this work, we describe and implement five multi-turn… ▽ More

    Submitted 8 July, 2025; v1 submitted 24 March, 2025; originally announced March 2025.

    Comments: Accepted to ACL 2025

  21. arXiv:2503.14893  [pdf, other

    cs.HC

    Incorporating Sustainability in Electronics Design: Obstacles and Opportunities

    Authors: Zachary Englhardt, Felix Hähnlein, Yuxuan Mei, Tong Lin, Connor Masahiro Sun, Zhihan Zhang, Adriana Schulz, Shwetak Patel, Vikram Iyer

    Abstract: Life cycle assessment (LCA) is a methodology for holistically measuring the environmental impact of a product from initial manufacturing to end-of-life disposal. However, the extent to which LCA informs the design of computing devices remains unclear. To understand how this information is collected and applied, we interviewed 17 industry professionals with experience in LCA or electronics design,… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

  22. arXiv:2503.03783  [pdf, other

    q-bio.TO cs.AI cs.ET cs.HC cs.LG

    Passive Heart Rate Monitoring During Smartphone Use in Everyday Life

    Authors: Shun Liao, Paolo Di Achille, Jiang Wu, Silviu Borac, Jonathan Wang, Xin Liu, Eric Teasley, Lawrence Cai, Yuzhe Yang, Yun Liu, Daniel McDuff, Hao-Wei Su, Brent Winslow, Anupam Pathak, Shwetak Patel, James A. Taylor, Jameson K. Rogers, Ming-Zher Poh

    Abstract: Resting heart rate (RHR) is an important biomarker of cardiovascular health and mortality, but tracking it longitudinally generally requires a wearable device, limiting its availability. We present PHRM, a deep learning system for passive heart rate (HR) and RHR measurements during everyday smartphone use, using facial video-based photoplethysmography. Our system was developed using 225,773 videos… ▽ More

    Submitted 21 March, 2025; v1 submitted 4 March, 2025; originally announced March 2025.

    Comments: Updated author list

  23. arXiv:2503.00890  [pdf, other

    cs.CV cs.AI

    Estimating Blood Pressure with a Camera: An Exploratory Study of Ambulatory Patients with Cardiovascular Disease

    Authors: Theodore Curran, Chengqian Ma, Xin Liu, Daniel McDuff, Girish Narayanswamy, George Stergiou, Shwetak Patel, Eugene Yang

    Abstract: Hypertension is a leading cause of morbidity and mortality worldwide. The ability to diagnose and treat hypertension in the ambulatory population is hindered by limited access and poor adherence to current methods of monitoring blood pressure (BP), specifically, cuff-based devices. Remote photoplethysmography (rPPG) evaluates an individual's pulse waveform through a standard camera without physica… ▽ More

    Submitted 2 March, 2025; originally announced March 2025.

  24. arXiv:2502.08643  [pdf, other

    cs.RO cs.AI cs.CV

    A Real-to-Sim-to-Real Approach to Robotic Manipulation with VLM-Generated Iterative Keypoint Rewards

    Authors: Shivansh Patel, Xinchen Yin, Wenlong Huang, Shubham Garg, Hooshang Nayyeri, Li Fei-Fei, Svetlana Lazebnik, Yunzhu Li

    Abstract: Task specification for robotic manipulation in open-world environments is challenging, requiring flexible and adaptive objectives that align with human intentions and can evolve through iterative feedback. We introduce Iterative Keypoint Reward (IKER), a visually grounded, Python-based reward function that serves as a dynamic task specification. Our framework leverages VLMs to generate and refine… ▽ More

    Submitted 18 February, 2025; v1 submitted 12 February, 2025; originally announced February 2025.

    Comments: ICRA 2025, Project Page: https://iker-robot.github.io/

  25. arXiv:2502.03540  [pdf, other

    cs.LG cs.AI

    Path Planning for Masked Diffusion Model Sampling

    Authors: Fred Zhangzhi Peng, Zachary Bezemek, Sawan Patel, Jarrid Rector-Brooks, Sherwood Yao, Avishek Joey Bose, Alexander Tong, Pranam Chatterjee

    Abstract: Any order generation of discrete data using masked diffusion models (MDMs) offers a compelling alternative to traditional autoregressive models, especially in domains that lack a natural causal ordering of data. However, current popular MDMs depart from their successful continuous diffusion model counterparts with simplified masked inference wherein unmasked tokens cannot be iteratively refined --… ▽ More

    Submitted 27 May, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

  26. arXiv:2501.17286  [pdf

    physics.med-ph cs.AI cs.CL

    Fine-Tuning Open-Source Large Language Models to Improve Their Performance on Radiation Oncology Tasks: A Feasibility Study to Investigate Their Potential Clinical Applications in Radiation Oncology

    Authors: Peilong Wang, Zhengliang Liu, Yiwei Li, Jason Holmes, Peng Shu, Lian Zhang, Xiang Li, Quanzheng Li, Brady S. Laughlin, Diego Santos Toesca, Sujay A. Vora, Samir H. Patel, Terence T. Sio, Tianming Liu, Wei Liu

    Abstract: Background: The radiation oncology clinical practice involves many steps relying on the dynamic interplay of abundant text data. Large language models have displayed remarkable capabilities in processing complex text information. But their direct applications in specific fields like radiation oncology remain underexplored. Purpose: This study aims to investigate whether fine-tuning LLMs with dom… ▽ More

    Submitted 28 January, 2025; originally announced January 2025.

  27. arXiv:2501.16680  [pdf, ps, other

    cs.CR cs.DS

    Differentially Private Set Representations

    Authors: Sarvar Patel, Giuseppe Persiano, Joon Young Seo, Kevin Yeo

    Abstract: We study the problem of differentially private (DP) mechanisms for representing sets of size $k$ from a large universe. Our first construction creates $(ε,δ)$-DP representations with error probability of $1/(e^ε+ 1)$ using space at most $1.05 k ε\cdot \log(e)$ bits where the time to construct a representation is $O(k \log(1/δ))$ while decoding time is $O(\log(1/δ))$. We also present a second algor… ▽ More

    Submitted 21 July, 2025; v1 submitted 27 January, 2025; originally announced January 2025.

    Comments: Appears at NeurIPS 2024

  28. arXiv:2501.16309  [pdf, other

    physics.med-ph cs.AI

    Evaluating The Performance of Using Large Language Models to Automate Summarization of CT Simulation Orders in Radiation Oncology

    Authors: Meiyun Cao, Shaw Hu, Jason Sharp, Edward Clouser, Jason Holmes, Linda L. Lam, Xiaoning Ding, Diego Santos Toesca, Wendy S. Lindholm, Samir H. Patel, Sujay A. Vora, Peilong Wang, Wei Liu

    Abstract: Purpose: This study aims to use a large language model (LLM) to automate the generation of summaries from the CT simulation orders and evaluate its performance. Materials and Methods: A total of 607 CT simulation orders for patients were collected from the Aria database at our institution. A locally hosted Llama 3.1 405B model, accessed via the Application Programming Interface (API) service, wa… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

  29. arXiv:2501.14249  [pdf, other

    cs.LG cs.AI cs.CL

    Humanity's Last Exam

    Authors: Long Phan, Alice Gatti, Ziwen Han, Nathaniel Li, Josephina Hu, Hugh Zhang, Chen Bo Calvin Zhang, Mohamed Shaaban, John Ling, Sean Shi, Michael Choi, Anish Agrawal, Arnav Chopra, Adam Khoja, Ryan Kim, Richard Ren, Jason Hausenloy, Oliver Zhang, Mantas Mazeika, Dmitry Dodonov, Tung Nguyen, Jaeho Lee, Daron Anderson, Mikhail Doroshenko, Alun Cennyth Stokes , et al. (1084 additional authors not shown)

    Abstract: Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of… ▽ More

    Submitted 19 April, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 29 pages, 6 figures

  30. arXiv:2501.03277  [pdf, other

    cs.CL

    HonkaiChat: Companions from Anime that feel alive!

    Authors: Yueze Liu, Yichi Zhang, Shaan Om Patel, Zhaoyang Zhu, Shilong Guo

    Abstract: Modern conversational agents, including anime-themed chatbots, are frequently reactive and personality-driven but fail to capture the dynamic nature of human interactions. We propose an event-driven dialogue framework to address these limitations by embedding dynamic events in conversation prompts and fine-tuning models on character-specific data. Evaluations on GPT-4 and comparisons with industry… ▽ More

    Submitted 5 January, 2025; originally announced January 2025.

    Comments: 5 pages, 4 figures. This is a preprint. Not yet submitted to a journal or conference. More iterated versions to be updated

  31. arXiv:2412.11964  [pdf, other

    stat.ML cs.LG

    BetaExplainer: A Probabilistic Method to Explain Graph Neural Networks

    Authors: Whitney Sloneker, Shalin Patel, Michael Wang, Lorin Crawford, Ritambhara Singh

    Abstract: Graph neural networks (GNNs) are powerful tools for conducting inference on graph data but are often seen as "black boxes" due to difficulty in extracting meaningful subnetworks driving predictive performance. Many interpretable GNN methods exist, but they cannot quantify uncertainty in edge weights and suffer in predictive accuracy when applied to challenging graph structures. In this work, we pr… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

  32. arXiv:2412.09915  [pdf, other

    cs.IT

    Two-dimensional Constacyclic Codes over $\mathbb{F}_q$

    Authors: Vidya Sagar, Shikha Patel, Shayan Srinivasa Garani

    Abstract: We consider two-dimensional $(λ_1, λ_2)$-constacyclic codes over $\mathbb{F}_{q}$ of area $M N$, where $q$ is some power of prime $p$ with $\gcd(M,p)=1$ and $\gcd(N,p)=1$. With the help of common zero (CZ) set, we characterize 2-D constacyclic codes. Further, we provide an algorithm to construct an ideal basis of these codes by using their essential common zero (ECZ) sets. We describe the dual of… ▽ More

    Submitted 1 May, 2025; v1 submitted 13 December, 2024; originally announced December 2024.

    Comments: 25 pages, 1 figure

  33. arXiv:2412.08984  [pdf, other

    q-bio.QM cs.LG

    Predicting Emergency Department Visits for Patients with Type II Diabetes

    Authors: Javad M Alizadeh, Jay S Patel, Gabriel Tajeu, Yuzhou Chen, Ilene L Hollin, Mukesh K Patel, Junchao Fei, Huanmei Wu

    Abstract: Over 30 million Americans are affected by Type II diabetes (T2D), a treatable condition with significant health risks. This study aims to develop and validate predictive models using machine learning (ML) techniques to estimate emergency department (ED) visits among patients with T2D. Data for these patients was obtained from the HealthShare Exchange (HSX), focusing on demographic details, diagnos… ▽ More

    Submitted 12 December, 2024; originally announced December 2024.

    Comments: This manuscript has been accepted and presented at AI-PHSS 2024: The 2024 International Workshop on AI Applications in Public Health and Social Services in conjunction with the 22nd International Conference of Artificial Intelligence in Medicine (AIME 2024)

  34. arXiv:2412.06936  [pdf, other

    cs.CY cs.AI cs.LG

    Creating a Cooperative AI Policymaking Platform through Open Source Collaboration

    Authors: Aiden Lewington, Alekhya Vittalam, Anshumaan Singh, Anuja Uppuluri, Arjun Ashok, Ashrith Mandayam Athmaram, Austin Milt, Benjamin Smith, Charlie Weinberger, Chatanya Sarin, Christoph Bergmeir, Cliff Chang, Daivik Patel, Daniel Li, David Bell, Defu Cao, Donghwa Shin, Edward Kang, Edwin Zhang, Enhui Li, Felix Chen, Gabe Smithline, Haipeng Chen, Henry Gasztowtt, Hoon Shin , et al. (26 additional authors not shown)

    Abstract: Advances in artificial intelligence (AI) present significant risks and opportunities, requiring improved governance to mitigate societal harms and promote equitable benefits. Current incentive structures and regulatory delays may hinder responsible AI development and deployment, particularly in light of the transformative potential of large language models (LLMs). To address these challenges, we p… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

  35. arXiv:2411.18855  [pdf, other

    cs.CV cs.LG cs.MM

    Improving Accuracy and Generalization for Efficient Visual Tracking

    Authors: Ram Zaveri, Shivang Patel, Yu Gu, Gianfranco Doretto

    Abstract: Efficient visual trackers overfit to their training distributions and lack generalization abilities, resulting in them performing well on their respective in-distribution (ID) test sets and not as well on out-of-distribution (OOD) sequences, imposing limitations to their deployment in-the-wild under constrained resources. We introduce SiamABC, a highly efficient Siamese tracker that significantly… ▽ More

    Submitted 6 February, 2025; v1 submitted 27 November, 2024; originally announced November 2024.

    Comments: WACV 2025

  36. arXiv:2411.07553  [pdf, ps, other

    cs.DS

    A Simple Algorithm for Dynamic Carpooling with Recourse

    Authors: Yuval Efron, Shyamal Patel, Cliff Stein

    Abstract: We give an algorithm for the fully-dynamic carpooling problem with recourse: Edges arrive and depart online from a graph $G$ with $n$ nodes according to an adaptive adversary. Our goal is to maintain an orientation $H$ of $G$ that keeps the discrepancy, defined as $\max_{v \in V} |\text{deg}_H^+(v) - \text{deg}_H^-(v)|$, small at all times. We present a simple algorithm and analysis for this probl… ▽ More

    Submitted 22 November, 2024; v1 submitted 12 November, 2024; originally announced November 2024.

    Comments: To appear in SOSA 2025

  37. arXiv:2411.04303  [pdf

    cs.CY stat.AP

    Analysis of Droughts and Their Intensities in California from 2000 to 2020

    Authors: Ujjwal, Shikha C. Patel, Bansari K. Shah, Nicholas Ogbonna, Huthaifa I Ashqar

    Abstract: Drought has been perceived as a persistent threat globally and the complex mechanism of various factors contributing to its emergence makes it more troublesome to understand. Droughts and their severity trends have been a point of concern in the USA as well, since the economic impact of droughts has been substantial, especially in parts that contribute majorly to US agriculture. California is the… ▽ More

    Submitted 6 November, 2024; originally announced November 2024.

  38. arXiv:2410.13638  [pdf, other

    cs.LG cs.AI cs.HC

    Scaling Wearable Foundation Models

    Authors: Girish Narayanswamy, Xin Liu, Kumar Ayush, Yuzhe Yang, Xuhai Xu, Shun Liao, Jake Garrison, Shyam Tailor, Jake Sunshine, Yun Liu, Tim Althoff, Shrikanth Narayanan, Pushmeet Kohli, Jiening Zhan, Mark Malhotra, Shwetak Patel, Samy Abdel-Ghaffar, Daniel McDuff

    Abstract: Wearable sensors have become ubiquitous thanks to a variety of health tracking features. The resulting continuous and longitudinal measurements from everyday life generate large volumes of data; however, making sense of these observations for scientific and actionable insights is non-trivial. Inspired by the empirical success of generative modeling, where large neural networks learn powerful repre… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  39. arXiv:2410.12935  [pdf, other

    quant-ph cond-mat.stat-mech cs.LG math.OC

    Quantum Boltzmann machine learning of ground-state energies

    Authors: Dhrumil Patel, Daniel Koch, Saahil Patel, Mark M. Wilde

    Abstract: Estimating the ground-state energy of Hamiltonians is a fundamental task for which it is believed that quantum computers can be helpful. Several approaches have been proposed toward this goal, including algorithms based on quantum phase estimation and hybrid quantum-classical optimizers involving parameterized quantum circuits, the latter falling under the umbrella of the variational quantum eigen… ▽ More

    Submitted 30 October, 2024; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: v2: 7 pages of main text, 29 pages of supplementary material, 5 figures

    Report number: AFRL-2024-0949

  40. arXiv:2410.11176  [pdf

    cs.CV cs.AI

    Improving Bias in Facial Attribute Classification: A Combined Impact of KL Divergence induced Loss Function and Dual Attention

    Authors: Shweta Patel, Dakshina Ranjan Kisku

    Abstract: Ensuring that AI-based facial recognition systems produce fair predictions and work equally well across all demographic groups is crucial. Earlier systems often exhibited demographic bias, particularly in gender and racial classification, with lower accuracy for women and individuals with darker skin tones. To tackle this issue and promote fairness in facial recognition, researchers have introduce… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: 15 pages, 9 figures, 5 tables

    MSC Class: 68T06 ACM Class: I.2.10

  41. arXiv:2410.01657  [pdf, other

    cs.DC cs.LG physics.comp-ph

    Scalable and Consistent Graph Neural Networks for Distributed Mesh-based Data-driven Modeling

    Authors: Shivam Barwey, Riccardo Balin, Bethany Lusch, Saumil Patel, Ramesh Balakrishnan, Pinaki Pal, Romit Maulik, Venkatram Vishwanath

    Abstract: This work develops a distributed graph neural network (GNN) methodology for mesh-based modeling applications using a consistent neural message passing layer. As the name implies, the focus is on enabling scalable operations that satisfy physical consistency via halo nodes at sub-graph boundaries. Here, consistency refers to the fact that a GNN trained and evaluated on one rank (one large graph) is… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  42. arXiv:2410.00016  [pdf, other

    cs.CY

    A Dataset of the Operating Station Heat Rate for 806 Indian Coal Plant Units using Machine Learning

    Authors: Yifu Ding, Jansen Wong, Serena Patel, Dharik Mallapragada, Guiyan Zang, Robert Stoner

    Abstract: India aims to achieve net-zero emissions by 2070 and has set an ambitious target of 500 GW of renewable power generation capacity by 2030. Coal plants currently contribute to more than 60\% of India's electricity generation in 2022. Upgrading and decarbonizing high-emission coal plants became a pressing energy issue. A key technical parameter for coal plants is the operating station heat rate (SHR… ▽ More

    Submitted 14 September, 2024; originally announced October 2024.

  43. arXiv:2409.18642  [pdf

    cs.CR cs.AI cs.CV

    Enhanced Convolution Neural Network with Optimized Pooling and Hyperparameter Tuning for Network Intrusion Detection

    Authors: Ayush Kumar Sharma, Sourav Patel, Supriya Bharat Wakchaure, Abirami S

    Abstract: Network Intrusion Detection Systems (NIDS) are essential for protecting computer networks from malicious activities, including Denial of Service (DoS), Probing, User-to-Root (U2R), and Remote-to-Local (R2L) attacks. Without effective NIDS, networks are vulnerable to significant security breaches and data loss. Machine learning techniques provide a promising approach to enhance NIDS by automating t… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

    Comments: 7 Pages , 2 figures , 4 Tables , Conference paper

  44. arXiv:2409.18301  [pdf, other

    cs.CV cs.AI cs.LG

    Wavelet-Driven Generalizable Framework for Deepfake Face Forgery Detection

    Authors: Lalith Bharadwaj Baru, Rohit Boddeda, Shilhora Akshay Patel, Sai Mohan Gajapaka

    Abstract: The evolution of digital image manipulation, particularly with the advancement of deep generative models, significantly challenges existing deepfake detection methods, especially when the origin of the deepfake is obscure. To tackle the increasing complexity of these forgeries, we propose \textbf{Wavelet-CLIP}, a deepfake detection framework that integrates wavelet transforms with features derived… ▽ More

    Submitted 7 January, 2025; v1 submitted 26 September, 2024; originally announced September 2024.

    Comments: 9 Pages, 2 Figures, 3 Tables

  45. arXiv:2409.18290  [pdf, other

    cs.AI cs.CY

    Retrospective Comparative Analysis of Prostate Cancer In-Basket Messages: Responses from Closed-Domain LLM vs. Clinical Teams

    Authors: Yuexing Hao, Jason M. Holmes, Jared Hobson, Alexandra Bennett, Daniel K. Ebner, David M. Routman, Satomi Shiraishi, Samir H. Patel, Nathan Y. Yu, Chris L. Hallemeier, Brooke E. Ball, Mark R. Waddle, Wei Liu

    Abstract: In-basket message interactions play a crucial role in physician-patient communication, occurring during all phases (pre-, during, and post) of a patient's care journey. However, responding to these patients' inquiries has become a significant burden on healthcare workflows, consuming considerable time for clinical care teams. To address this, we introduce RadOnc-GPT, a specialized Large Language M… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

  46. arXiv:2409.13571  [pdf, other

    cs.MA cs.AI

    Scalable Multi-agent Reinforcement Learning for Factory-wide Dynamic Scheduling

    Authors: Jaeyeon Jang, Diego Klabjan, Han Liu, Nital S. Patel, Xiuqi Li, Balakrishnan Ananthanarayanan, Husam Dauod, Tzung-Han Juang

    Abstract: Real-time dynamic scheduling is a crucial but notoriously challenging task in modern manufacturing processes due to its high decision complexity. Recently, reinforcement learning (RL) has been gaining attention as an impactful technique to handle this challenge. However, classical RL methods typically rely on human-made dispatching rules, which are not suitable for large-scale factory-wide schedul… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

  47. arXiv:2409.10325  [pdf, other

    cs.DC cs.AR cs.ET physics.data-an

    PASS: An Asynchronous Probabilistic Processor for Next Generation Intelligence

    Authors: Saavan Patel, Philip Canoza, Adhiraj Datar, Steven Lu, Chirag Garg, Sayeef Salahuddin

    Abstract: New computing paradigms are required to solve the most challenging computational problems where no exact polynomial time solution exists.Probabilistic Ising Accelerators has gained promise on these problems with the ability to model complex probability distributions and find ground states of intractable problems. In this context, we have demonstrated the Parallel Asynchronous Stochastic Sampler (P… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

    Comments: 13 page main text, 5 main figures, 21 pages supplementary and methods, 7 supplementary figures, 2 supplementary tables

  48. arXiv:2409.07769  [pdf, other

    physics.flu-dyn cs.CE cs.LG physics.comp-ph

    Mesh-based Super-Resolution of Fluid Flows with Multiscale Graph Neural Networks

    Authors: Shivam Barwey, Pinaki Pal, Saumil Patel, Riccardo Balin, Bethany Lusch, Venkatram Vishwanath, Romit Maulik, Ramesh Balakrishnan

    Abstract: A graph neural network (GNN) approach is introduced in this work which enables mesh-based three-dimensional super-resolution of fluid flows. In this framework, the GNN is designed to operate not on the full mesh-based field at once, but on localized meshes of elements (or cells) directly. To facilitate mesh-based GNN representations in a manner similar to spectral (or finite) element discretizatio… ▽ More

    Submitted 1 May, 2025; v1 submitted 12 September, 2024; originally announced September 2024.

  49. arXiv:2408.05692  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    A Novel Momentum-Based Deep Learning Techniques for Medical Image Classification and Segmentation

    Authors: Koushik Biswas, Ridal Pal, Shaswat Patel, Debesh Jha, Meghana Karri, Amit Reza, Gorkem Durak, Alpay Medetalibeyoglu, Matthew Antalek, Yury Velichko, Daniela Ladner, Amir Borhani, Ulas Bagci

    Abstract: Accurately segmenting different organs from medical images is a critical prerequisite for computer-assisted diagnosis and intervention planning. This study proposes a deep learning-based approach for segmenting various organs from CT and MRI scans and classifying diseases. Our study introduces a novel technique integrating momentum within residual blocks for enhanced training dynamics in medical i… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

    Comments: 8 pages

  50. arXiv:2408.04948  [pdf, other

    cs.CL cs.LG q-fin.ST stat.AP stat.ML

    HybridRAG: Integrating Knowledge Graphs and Vector Retrieval Augmented Generation for Efficient Information Extraction

    Authors: Bhaskarjit Sarmah, Benika Hall, Rohan Rao, Sunil Patel, Stefano Pasquali, Dhagash Mehta

    Abstract: Extraction and interpretation of intricate information from unstructured text data arising in financial applications, such as earnings call transcripts, present substantial challenges to large language models (LLMs) even using the current best practices to use Retrieval Augmented Generation (RAG) (referred to as VectorRAG techniques which utilize vector databases for information retrieval) due to… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: 9 pages, 2 figures, 5 tables