这是indexloc提供的服务,不要输入任何密码
Skip to main content

Showing 1–50 of 122 results for author: Shah, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.04133  [pdf, ps, other

    math.OC cs.DS cs.LG

    Online Convex Optimization with Switching Cost with Only One Single Gradient Evaluation

    Authors: Harsh Shah, Purna Chandrasekhar, Rahul Vaze

    Abstract: Online convex optimization with switching cost is considered under the frugal information setting where at time $t$, before action $x_t$ is taken, only a single function evaluation and a single gradient is available at the previously chosen action $x_{t-1}$ for either the current cost function $f_t$ or the most recent cost function $f_{t-1}$. When the switching cost is linear, online algorithms wi… ▽ More

    Submitted 5 July, 2025; originally announced July 2025.

    Comments: 9 pages, 2 figures

  2. arXiv:2506.17779  [pdf, ps, other

    cs.LG cs.AI

    Toward Autonomous UI Exploration: The UIExplorer Benchmark

    Authors: Andrei Cristian Nica, Akshaya Vishnu Kudlu Shanbhogue, Harshil Shah, Aleix Cambray, Tudor Berariu, Lucas Maystre, David Barber

    Abstract: Autonomous agents must know how to explore user interfaces (UIs) for reliable task solving, yet systematic evaluation of this crucial phase is lacking. We introduce UIExplore-Bench, the first benchmark explicitly dedicated to UI exploration. The benchmark evaluates agents with either Structured mode (granting access to layout information like DOM trees) or Screen mode (relying on GUI-only observat… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  3. arXiv:2506.13170  [pdf, ps, other

    cs.CR

    Dual Protection Ring: User Profiling Via Differential Privacy and Service Dissemination Through Private Information Retrieval

    Authors: Imdad Ullah, Najm Hassan, Tariq Ahamed Ahangar, Zawar Hussain Shah, Mehregan Mahdavi, Andrew Levula

    Abstract: User profiling is crucial in providing personalised services, as it relies on analysing user behaviour and preferences to deliver targeted services. This approach enhances user experience and promotes heightened engagement. Nevertheless, user profiling also gives rise to noteworthy privacy considerations due to the extensive tracking and monitoring of personal data, potentially leading to surveill… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  4. arXiv:2506.07494  [pdf, ps, other

    cs.SD cs.CY eess.AS

    Towards Energy-Efficient and Low-Latency Voice-Controlled Smart Homes: A Proposal for Offline Speech Recognition and IoT Integration

    Authors: Peng Huang, Imdad Ullah, Xiaotong Wei, Tariq Ahamed Ahanger, Najm Hassan, Zawar Hussain Shah

    Abstract: The smart home systems, based on AI speech recognition and IoT technology, enable people to control devices through verbal commands and make people's lives more efficient. However, existing AI speech recognition services are primarily deployed on cloud platforms on the Internet. When users issue a command, speech recognition devices like ``Amazon Echo'' will post a recording through numerous netwo… ▽ More

    Submitted 11 June, 2025; v1 submitted 9 June, 2025; originally announced June 2025.

  5. arXiv:2506.06574  [pdf, ps, other

    cs.AI cs.MA

    The Optimization Paradox in Clinical AI Multi-Agent Systems

    Authors: Suhana Bedi, Iddah Mlauzi, Daniel Shin, Sanmi Koyejo, Nigam H. Shah

    Abstract: Multi-agent artificial intelligence systems are increasingly deployed in clinical settings, yet the relationship between component-level optimization and system-wide performance remains poorly understood. We evaluated this relationship using 2,400 real patient cases from the MIMIC-CDM dataset across four abdominal pathologies (appendicitis, pancreatitis, cholecystitis, diverticulitis), decomposing… ▽ More

    Submitted 11 June, 2025; v1 submitted 6 June, 2025; originally announced June 2025.

  6. arXiv:2506.05836  [pdf, ps, other

    cs.SE

    Analysis of cost-efficiency of serverless approaches

    Authors: Nakhat Syeda, Harsh Shah, Rajvinder Singh, Suraj Jaju, Sumedha Kumar, Gourav Chhabra, Maria Spichkova

    Abstract: In this paper, we present a survey of research studies related to the cost-effectiveness of serverless approach and corresponding cost savings. We conducted a systematic literature review using Google Scholar search engine, covering the period from 2010 to 2024. We identified 34 related studies, from which we extracted 17 parameters that might influence the relative cost savings of applying the se… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  7. arXiv:2505.23802  [pdf, ps, other

    cs.CL cs.AI

    MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks

    Authors: Suhana Bedi, Hejie Cui, Miguel Fuentes, Alyssa Unell, Michael Wornow, Juan M. Banda, Nikesh Kotecha, Timothy Keyes, Yifan Mai, Mert Oez, Hao Qiu, Shrey Jain, Leonardo Schettini, Mehr Kashyap, Jason Alan Fries, Akshay Swaminathan, Philip Chung, Fateme Nateghi, Asad Aali, Ashwin Nayak, Shivam Vedak, Sneha S. Jain, Birju Patel, Oluseyi Fayanju, Shreya Shah , et al. (56 additional authors not shown)

    Abstract: While large language models (LLMs) achieve near-perfect scores on medical licensing exams, these evaluations inadequately reflect the complexity and diversity of real-world clinical practice. We introduce MedHELM, an extensible evaluation framework for assessing LLM performance for medical tasks with three key contributions. First, a clinician-validated taxonomy spanning 5 categories, 22 subcatego… ▽ More

    Submitted 2 June, 2025; v1 submitted 26 May, 2025; originally announced May 2025.

  8. arXiv:2504.16353  [pdf

    cs.CL cs.AI

    Transformer-Based Extraction of Statutory Definitions from the U.S. Code

    Authors: Arpana Hosabettu, Harsh Shah

    Abstract: Automatic extraction of definitions from legal texts is critical for enhancing the comprehension and clarity of complex legal corpora such as the United States Code (U.S.C.). We present an advanced NLP system leveraging transformer-based architectures to automatically extract defined terms, their definitions, and their scope from the U.S.C. We address the challenges of automatically identifying le… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

    Comments: 7 pages, to be published in IEEE AIIoT 2025

  9. arXiv:2504.16062  [pdf, other

    cs.RO cs.CV

    ForesightNav: Learning Scene Imagination for Efficient Exploration

    Authors: Hardik Shah, Jiaxu Xing, Nico Messikommer, Boyang Sun, Marc Pollefeys, Davide Scaramuzza

    Abstract: Understanding how humans leverage prior knowledge to navigate unseen environments while making exploratory decisions is essential for developing autonomous robots with similar abilities. In this work, we propose ForesightNav, a novel exploration strategy inspired by human imagination and reasoning. Our approach equips robotic agents with the capability to predict contextual information, such as oc… ▽ More

    Submitted 5 May, 2025; v1 submitted 22 April, 2025; originally announced April 2025.

    Journal ref: IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Nashville, 2025

  10. arXiv:2501.12370  [pdf, ps, other

    cs.LG cs.AI

    Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models

    Authors: Samira Abnar, Harshay Shah, Dan Busbridge, Alaaeldin Mohamed Elnouby Ali, Josh Susskind, Vimal Thilak

    Abstract: Scaling the capacity of language models has consistently proven to be a reliable approach for improving performance and unlocking new capabilities. Capacity can be primarily defined by two dimensions: the number of model parameters and the compute per example. While scaling typically involves increasing both, the precise interplay between these factors and their combined contribution to overall ca… ▽ More

    Submitted 2 July, 2025; v1 submitted 21 January, 2025; originally announced January 2025.

  11. arXiv:2501.00031  [pdf, other

    cs.CL

    Distilling Large Language Models for Efficient Clinical Information Extraction

    Authors: Karthik S. Vedula, Annika Gupta, Akshay Swaminathan, Ivan Lopez, Suhana Bedi, Nigam H. Shah

    Abstract: Large language models (LLMs) excel at clinical information extraction but their computational demands limit practical deployment. Knowledge distillation--the process of transferring knowledge from larger to smaller models--offers a potential solution. We evaluate the performance of distilled BERT models, which are approximately 1,000 times smaller than modern LLMs, for clinical named entity recogn… ▽ More

    Submitted 20 December, 2024; originally announced January 2025.

    Comments: 19 pages, 1 figure, 10 tables

    MSC Class: 68T50 ACM Class: I.2.7

  12. arXiv:2412.16178  [pdf, other

    cs.LG cs.AI cs.CE

    Context Clues: Evaluating Long Context Models for Clinical Prediction Tasks on EHRs

    Authors: Michael Wornow, Suhana Bedi, Miguel Angel Fuentes Hernandez, Ethan Steinberg, Jason Alan Fries, Christopher Re, Sanmi Koyejo, Nigam H. Shah

    Abstract: Foundation Models (FMs) trained on Electronic Health Records (EHRs) have achieved state-of-the-art results on numerous clinical prediction tasks. However, most existing EHR FMs have context windows of <1k tokens. This prevents them from modeling full patient EHRs which can exceed 10k's of events. Recent advancements in subquadratic long-context architectures (e.g., Mamba) offer a promising solutio… ▽ More

    Submitted 18 March, 2025; v1 submitted 9 December, 2024; originally announced December 2024.

  13. arXiv:2412.10986  [pdf, other

    cs.CE

    On Scalable Design for User-Centric Multi-Modal Shared E-Mobility Systems using MILP and Modified Dijkstra's Algorithm

    Authors: Maqsood Hussain Shah, Ji Li, Mingming Liu

    Abstract: In the rapidly evolving landscape of urban transportation, shared e-mobility services have emerged as a sustainable solution to meet growing demand for flexible, eco-friendly travel. However, the existing literature lacks a comprehensive multi-modal optimization framework with focus on user preferences and real-world constraints. This paper presents a multi-modal optimization framework for shared… ▽ More

    Submitted 14 December, 2024; originally announced December 2024.

    Comments: The paper has been accepted by the 2025 IEEE Symposium Series on Computational Intelligence

  14. arXiv:2412.06222  [pdf, other

    cs.GT econ.TH

    Blotto on the Ballot: A Ballot Stuffing Blotto Game

    Authors: Harsh Shah, Jayakrishnan Nair, D Manjunath, Narayan Mandayam

    Abstract: We consider the following Colonel Blotto game between parties $P_1$ and $P_A.$ $P_1$ deploys a non negative number of troops across $J$ battlefields, while $P_A$ chooses $K,$ $K < J,$ battlefields to remove all of $P_1$'s troops from the chosen battlefields. $P_1$ has the objective of maximizing the number of surviving troops while $P_A$ wants to minimize it. Drawing an analogy with ballot stuffin… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

    Comments: 11 pages, 5 figures

  15. arXiv:2411.09361  [pdf, other

    cs.CV cs.LG

    Time-to-Event Pretraining for 3D Medical Imaging

    Authors: Zepeng Huo, Jason Alan Fries, Alejandro Lozano, Jeya Maria Jose Valanarasu, Ethan Steinberg, Louis Blankemeier, Akshay S. Chaudhari, Curtis Langlotz, Nigam H. Shah

    Abstract: With the rise of medical foundation models and the growing availability of imaging data, scalable pretraining techniques offer a promising way to identify imaging biomarkers predictive of future disease risk. While current self-supervised methods for 3D medical imaging models capture local structural features like organ morphology, they fail to link pixel biomarkers with long-term health outcomes… ▽ More

    Submitted 19 March, 2025; v1 submitted 14 November, 2024; originally announced November 2024.

    Comments: 34 pages, 19 figures

  16. arXiv:2410.09455  [pdf, other

    cs.CL cs.AI cs.LG

    VERITAS-NLI : Validation and Extraction of Reliable Information Through Automated Scraping and Natural Language Inference

    Authors: Arjun Shah, Hetansh Shah, Vedica Bafna, Charmi Khandor, Sindhu Nair

    Abstract: In today's day and age where information is rapidly spread through online platforms, the rise of fake news poses an alarming threat to the integrity of public discourse, societal trust, and reputed news sources. Classical machine learning and Transformer-based models have been extensively studied for the task of fake news detection, however they are hampered by their reliance on training data and… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

    Comments: Preprint, 15 pages, 7 figures

    ACM Class: I.2.1; I.2.7

  17. arXiv:2409.14689  [pdf, other

    cs.IR cs.LG

    EDGE-Rec: Efficient and Data-Guided Edge Diffusion For Recommender Systems Graphs

    Authors: Utkarsh Priyam, Hemit Shah, Edoardo Botta

    Abstract: Most recommender systems research focuses on binary historical user-item interaction encodings to predict future interactions. User features, item features, and interaction strengths remain largely under-utilized in this space or only indirectly utilized, despite proving largely effective in large-scale production recommendation systems. We propose a new attention mechanism, loosely based on the p… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.

    Comments: 6 pages, 13 figures

  18. arXiv:2409.09095  [pdf, other

    cs.LG cs.DB

    meds_reader: A fast and efficient EHR processing library

    Authors: Ethan Steinberg, Michael Wornow, Suhana Bedi, Jason Alan Fries, Matthew B. A. McDermott, Nigam H. Shah

    Abstract: The growing demand for machine learning in healthcare requires processing increasingly large electronic health record (EHR) datasets, but existing pipelines are not computationally efficient or scalable. In this paper, we introduce meds_reader, an optimized Python package for efficient EHR data processing that is designed to take advantage of many intrinsic properties of EHR data for improved spee… ▽ More

    Submitted 14 November, 2024; v1 submitted 12 September, 2024; originally announced September 2024.

    Comments: Findings paper presented at Machine Learning for Health (ML4H) symposium 2024, December 15-16, 2024, Vancouver, Canada, 8 pages

  19. arXiv:2409.00729  [pdf, other

    cs.LG cs.CL

    ContextCite: Attributing Model Generation to Context

    Authors: Benjamin Cohen-Wang, Harshay Shah, Kristian Georgiev, Aleksander Madry

    Abstract: How do language models use information provided as context when generating a response? Can we infer whether a particular generated statement is actually grounded in the context, a misinterpretation, or fabricated? To help answer these questions, we introduce the problem of context attribution: pinpointing the parts of the context (if any) that led a model to generate a particular statement. We the… ▽ More

    Submitted 13 September, 2024; v1 submitted 1 September, 2024; originally announced September 2024.

  20. arXiv:2408.07245  [pdf, other

    cs.LG

    q-exponential family for policy optimization

    Authors: Lingwei Zhu, Haseeb Shah, Han Wang, Yukie Nagai, Martha White

    Abstract: Policy optimization methods benefit from a simple and tractable policy parametrization, usually the Gaussian for continuous action spaces. In this paper, we consider a broader policy family that remains tractable: the $q$-exponential family. This family of policies is flexible, allowing the specification of both heavy-tailed policies ($q>1$) and light-tailed policies ($q<1$). This paper examines t… ▽ More

    Submitted 24 January, 2025; v1 submitted 13 August, 2024; originally announced August 2024.

    Comments: accepted by ICLR 2025

  21. arXiv:2407.00541  [pdf

    cs.CL cs.AI cs.IR

    Answering real-world clinical questions using large language model based systems

    Authors: Yen Sia Low, Michael L. Jackson, Rebecca J. Hyde, Robert E. Brown, Neil M. Sanghavi, Julian D. Baldwin, C. William Pike, Jananee Muralidharan, Gavin Hui, Natasha Alexander, Hadeel Hassan, Rahul V. Nene, Morgan Pike, Courtney J. Pokrzywa, Shivam Vedak, Adam Paul Yan, Dong-han Yao, Amy R. Zipursky, Christina Dinh, Philip Ballentine, Dan C. Derieg, Vladimir Polony, Rehan N. Chawdry, Jordan Davies, Brigham B. Hyde , et al. (2 additional authors not shown)

    Abstract: Evidence to guide healthcare decisions is often limited by a lack of relevant and trustworthy literature as well as difficulty in contextualizing existing research for a specific patient. Large language models (LLMs) could potentially address both challenges by either summarizing published literature or generating new studies based on real-world data (RWD). We evaluated the ability of five LLM-bas… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 28 pages (2 figures, 3 tables) inclusive of 8 pages of supplemental materials (4 supplemental figures and 4 supplemental tables)

  22. arXiv:2406.13264  [pdf, other

    cs.AI cs.LG cs.SE

    WONDERBREAD: A Benchmark for Evaluating Multimodal Foundation Models on Business Process Management Tasks

    Authors: Michael Wornow, Avanika Narayan, Ben Viggiano, Ishan S. Khare, Tathagat Verma, Tibor Thompson, Miguel Angel Fuentes Hernandez, Sudharsan Sundar, Chloe Trujillo, Krrish Chawla, Rongfei Lu, Justin Shen, Divya Nagaraj, Joshua Martinez, Vardhan Agrawal, Althea Hudson, Nigam H. Shah, Christopher Re

    Abstract: Existing ML benchmarks lack the depth and diversity of annotations needed for evaluating models on business process management (BPM) tasks. BPM is the practice of documenting, measuring, improving, and automating enterprise workflows. However, research has focused almost exclusively on one task - full end-to-end automation using agents based on multimodal foundation models (FMs) like GPT-4. This f… ▽ More

    Submitted 10 October, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

  23. arXiv:2406.06512  [pdf, other

    cs.CV cs.AI

    Merlin: A Vision Language Foundation Model for 3D Computed Tomography

    Authors: Louis Blankemeier, Joseph Paul Cohen, Ashwin Kumar, Dave Van Veen, Syed Jamal Safdar Gardezi, Magdalini Paschali, Zhihong Chen, Jean-Benoit Delbrouck, Eduardo Reis, Cesar Truyts, Christian Bluethgen, Malte Engmann Kjeldskov Jensen, Sophie Ostmeier, Maya Varma, Jeya Maria Jose Valanarasu, Zhongnan Fang, Zepeng Huo, Zaid Nabulsi, Diego Ardila, Wei-Hung Weng, Edson Amaro Junior, Neera Ahuja, Jason Fries, Nigam H. Shah, Andrew Johnston , et al. (6 additional authors not shown)

    Abstract: Over 85 million computed tomography (CT) scans are performed annually in the US, of which approximately one quarter focus on the abdomen. Given the current radiologist shortage, there is a large impetus to use artificial intelligence to alleviate the burden of interpreting these complex imaging studies. Prior state-of-the-art approaches for automated medical image interpretation leverage vision la… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 18 pages, 7 figures

  24. arXiv:2405.03710  [pdf, other

    cs.SE cs.AI cs.LG

    Automating the Enterprise with Foundation Models

    Authors: Michael Wornow, Avanika Narayan, Krista Opsahl-Ong, Quinn McIntyre, Nigam H. Shah, Christopher Re

    Abstract: Automating enterprise workflows could unlock $4 trillion/year in productivity gains. Despite being of interest to the data management community for decades, the ultimate vision of end-to-end workflow automation has remained elusive. Current solutions rely on process mining and robotic process automation (RPA), in which a bot is hard-coded to follow a set of predefined rules for completing a workfl… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  25. arXiv:2404.11534  [pdf, other

    cs.LG cs.AI stat.ML

    Decomposing and Editing Predictions by Modeling Model Computation

    Authors: Harshay Shah, Andrew Ilyas, Aleksander Madry

    Abstract: How does the internal computation of a machine learning model transform inputs into predictions? In this paper, we introduce a task called component modeling that aims to address this question. The goal of component modeling is to decompose an ML model's prediction in terms of its components -- simple functions (e.g., convolution filters, attention heads) that are the "building blocks" of model co… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  26. arXiv:2403.18822  [pdf

    q-fin.TR cs.LG

    Enhancing Financial Data Visualization for Investment Decision-Making

    Authors: Nisarg Patel, Harmit Shah, Kishan Mewada

    Abstract: Navigating the intricate landscape of financial markets requires adept forecasting of stock price movements. This paper delves into the potential of Long Short-Term Memory (LSTM) networks for predicting stock dynamics, with a focus on discerning nuanced rise and fall patterns. Leveraging a dataset from the New York Stock Exchange (NYSE), the study incorporates multiple features to enhance LSTM's c… ▽ More

    Submitted 9 December, 2023; originally announced March 2024.

    Comments: 5 pages, 10 figures

  27. Data-driven Energy Consumption Modelling for Electric Micromobility using an Open Dataset

    Authors: Yue Ding, Sen Yan, Maqsood Hussain Shah, Hongyuan Fang, Ji Li, Mingming Liu

    Abstract: The escalating challenges of traffic congestion and environmental degradation underscore the critical importance of embracing E-Mobility solutions in urban spaces. In particular, micro E-Mobility tools such as E-scooters and E-bikes, play a pivotal role in this transition, offering sustainable alternatives for urban commuters. However, the energy consumption patterns for these tools are a critical… ▽ More

    Submitted 8 November, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: 7 pages, 5 figures, 4 tables. This manuscript has been accepted by the IEEE ITEC 2024

  28. arXiv:2403.07964  [pdf, other

    cs.AI

    Optimal Design and Implementation of an Open-source Emulation Platform for User-Centric Shared E-mobility Services

    Authors: Maqsood Hussain Shah, Yue Ding, Shaoshu Zhu, Yingqi Gu, Mingming Liu

    Abstract: With the rising concern over transportation emissions and pollution on a global scale, shared electric mobility services like E-cars, E-bikes, and E-scooters have emerged as promising solutions to mitigate these pressing challenges. However, existing shared E-mobility services exhibit critical design deficiencies, including insufficient service integration, imprecise energy consumption forecasting… ▽ More

    Submitted 1 July, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: 15 pages, 5 figures

  29. arXiv:2403.07911  [pdf

    cs.CY cs.AI

    Standing on FURM ground -- A framework for evaluating Fair, Useful, and Reliable AI Models in healthcare systems

    Authors: Alison Callahan, Duncan McElfresh, Juan M. Banda, Gabrielle Bunney, Danton Char, Jonathan Chen, Conor K. Corbin, Debadutta Dash, Norman L. Downing, Sneha S. Jain, Nikesh Kotecha, Jonathan Masterson, Michelle M. Mello, Keith Morse, Srikar Nallan, Abby Pandya, Anurang Revri, Aditya Sharma, Christopher Sharp, Rahul Thapa, Michael Wornow, Alaa Youssef, Michael A. Pfeffer, Nigam H. Shah

    Abstract: The impact of using artificial intelligence (AI) to guide patient care or operational processes is an interplay of the AI model's output, the decision-making protocol based on that output, and the capacity of the stakeholders involved to take the necessary subsequent action. Estimating the effects of this interplay before deployment, and studying it in real time afterwards, are essential to bridge… ▽ More

    Submitted 14 March, 2024; v1 submitted 26 February, 2024; originally announced March 2024.

  30. GNSS Positioning using Cost Function Regulated Multilateration and Graph Neural Networks

    Authors: Amir Jalalirad, Davide Belli, Bence Major, Songwon Jee, Himanshu Shah, Will Morrison

    Abstract: In urban environments, where line-of-sight signals from GNSS satellites are frequently blocked by high-rise objects, GNSS receivers are subject to large errors in measuring satellite ranges. Heuristic methods are commonly used to estimate these errors and reduce the impact of noisy measurements on localization accuracy. In our work, we replace these error estimation heuristics with a deep learning… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: Published in The Proceedings of the Institute of Navigation GNSS+ 2023

  31. arXiv:2402.05125  [pdf, other

    cs.CL cs.AI

    Zero-Shot Clinical Trial Patient Matching with LLMs

    Authors: Michael Wornow, Alejandro Lozano, Dev Dash, Jenelle Jindal, Kenneth W. Mahaffey, Nigam H. Shah

    Abstract: Matching patients to clinical trials is a key unsolved challenge in bringing new drugs to market. Today, identifying patients who meet a trial's eligibility criteria is highly manual, taking up to 1 hour per patient. Automated screening is challenging, however, as it requires understanding unstructured clinical text. Large language models (LLMs) offer a promising solution. In this work, we explore… ▽ More

    Submitted 10 April, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

  32. Homogenization Effects of Large Language Models on Human Creative Ideation

    Authors: Barrett R. Anderson, Jash Hemant Shah, Max Kreminski

    Abstract: Large language models (LLMs) are now being used in a wide variety of contexts, including as creativity support tools (CSTs) intended to help their users come up with new ideas. But do LLMs actually support user creativity? We hypothesized that the use of an LLM as a CST might make the LLM's users feel more creative, and even broaden the range of ideas suggested by each individual user, but also ho… ▽ More

    Submitted 10 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted to C&C 2024

  33. arXiv:2401.18040  [pdf

    cs.CL cs.AI

    Enhancing End-to-End Multi-Task Dialogue Systems: A Study on Intrinsic Motivation Reinforcement Learning Algorithms for Improved Training and Adaptability

    Authors: Navin Kamuni, Hardik Shah, Sathishkumar Chintala, Naveen Kunchakuri, Sujatha Alla Old Dominion

    Abstract: End-to-end multi-task dialogue systems are usually designed with separate modules for the dialogue pipeline. Among these, the policy module is essential for deciding what to do in response to user input. This policy is trained by reinforcement learning algorithms by taking advantage of an environment in which an agent receives feedback in the form of a reward signal. The current dialogue systems,… ▽ More

    Submitted 25 March, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

    Comments: 6 pages, 1 figure, 18th IEEE International Conference on Semantic Computing

  34. arXiv:2401.03271  [pdf, other

    eess.IV cs.CV cs.IR

    Analysis and Validation of Image Search Engines in Histopathology

    Authors: Isaiah Lahr, Saghir Alfasly, Peyman Nejat, Jibran Khan, Luke Kottom, Vaishnavi Kumbhar, Areej Alsaafin, Abubakr Shafique, Sobhan Hemati, Ghazal Alabtah, Nneka Comfere, Dennis Murphee, Aaron Mangold, Saba Yasir, Chady Meroueh, Lisa Boardman, Vijay H. Shah, Joaquin J. Garcia, H. R. Tizhoosh

    Abstract: Searching for similar images in archives of histology and histopathology images is a crucial task that may aid in patient matching for various purposes, ranging from triaging and diagnosis to prognosis and prediction. Whole slide images (WSIs) are highly detailed digital representations of tissue specimens mounted on glass slides. Matching WSI to WSI can serve as the critical method for patient ma… ▽ More

    Submitted 8 June, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

    Journal ref: IEEE Reviews in Biomedical Engineering, 2024

  35. Advancing Web Accessibility -- A guide to transitioning Design Systems from WCAG 2.0 to WCAG 2.1

    Authors: Hardik Shah

    Abstract: This research focuses on the critical process of upgrading a Design System from Web Content Accessibility Guidelines (WCAG) 2.0 to WCAG 2.1, which is an essential step in enhancing web accessibility. It emphasizes the importance of staying up to date on increasing accessibility requirements, as well as the critical function of Design Systems in supporting inclusion in digital environments. The art… ▽ More

    Submitted 28 November, 2023; originally announced December 2023.

    Comments: 13 pages, 3 figures, 15th International Conference on Web services & Semantic Technology (WeST 2023)

  36. arXiv:2312.01624  [pdf, other

    cs.LG cs.AI

    GVFs in the Real World: Making Predictions Online for Water Treatment

    Authors: Muhammad Kamran Janjua, Haseeb Shah, Martha White, Erfan Miahi, Marlos C. Machado, Adam White

    Abstract: In this paper we investigate the use of reinforcement-learning based prediction approaches for a real drinking-water treatment plant. Developing such a prediction system is a critical step on the path to optimizing and automating water treatment. Before that, there are many questions to answer about the predictability of the data, suitable neural network architectures, how to overcome partial obse… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: Published in Machine Learning (2023)

    Journal ref: Machine Learning (2023): 1-31

  37. Harnessing customized built-in elements -- Empowering Component-Based Software Engineering and Design Systems with HTML5 Web Components

    Authors: Hardik Shah

    Abstract: Customized built-in elements in HTML5 significantly transform web development. These elements enable developers to create unique HTML components tailored with specific design and purpose. Customized built-in elements enable developers to address the unique needs of web applications more quickly, supporting consistent user interfaces and experiences across diverse digital platforms. This study inve… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 13 pages, 5 figures, 15th International Conference on Web services & Semantic Technology (WeST 2023)

  38. arXiv:2311.12166  [pdf

    eess.SP cs.LG

    Creating Temporally Correlated High-Resolution Profiles of Load Injection Using Constrained Generative Adversarial Networks

    Authors: Hritik Gopal Shah, Behrouz Azimian, Anamitra Pal

    Abstract: Traditional smart meters, which measure energy usage every 15 minutes or more and report it at least a few hours later, lack the granularity needed for real-time decision-making. To address this practical problem, we introduce a new method using generative adversarial networks (GAN) that enforces temporal consistency on its high-resolution outputs via hard inequality constraints using convex optim… ▽ More

    Submitted 1 September, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: 6 pages

  39. arXiv:2311.10798  [pdf, other

    cs.LG cs.AI cs.CV eess.IV

    INSPECT: A Multimodal Dataset for Pulmonary Embolism Diagnosis and Prognosis

    Authors: Shih-Cheng Huang, Zepeng Huo, Ethan Steinberg, Chia-Chun Chiang, Matthew P. Lungren, Curtis P. Langlotz, Serena Yeung, Nigam H. Shah, Jason A. Fries

    Abstract: Synthesizing information from multiple data sources plays a crucial role in the practice of modern medicine. Current applications of artificial intelligence in medicine often focus on single-modality data due to a lack of publicly available, multimodal medical datasets. To address this limitation, we introduce INSPECT, which contains de-identified longitudinal records from a large cohort of patien… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  40. arXiv:2311.10794  [pdf, other

    cs.CV

    Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression

    Authors: Animesh Sinha, Bo Sun, Anmol Kalia, Arantxa Casanova, Elliot Blanchard, David Yan, Winnie Zhang, Tony Nelli, Jiahui Chen, Hardik Shah, Licheng Yu, Mitesh Kumar Singh, Ankit Ramchandani, Maziar Sanjabi, Sonal Gupta, Amy Bearman, Dhruv Mahajan

    Abstract: We introduce Style Tailoring, a recipe to finetune Latent Diffusion Models (LDMs) in a distinct domain with high visual quality, prompt alignment and scene diversity. We choose sticker image generation as the target domain, as the images significantly differ from photorealistic samples typically generated by large-scale LDMs. We start with a competent text-to-image model, like Emu, and show that r… ▽ More

    Submitted 3 October, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: 10 pages, 5 figures

  41. arXiv:2311.02573  [pdf, other

    cs.DS cs.CV

    Group Testing for Accurate and Efficient Range-Based Near Neighbor Search for Plagiarism Detection

    Authors: Harsh Shah, Kashish Mittal, Ajit Rajwade

    Abstract: This work presents an adaptive group testing framework for the range-based high dimensional near neighbor search problem. Our method efficiently marks each item in a database as neighbor or non-neighbor of a query point, based on a cosine distance threshold without exhaustive search. Like other methods for large scale retrieval, our approach exploits the assumption that most of the items in the da… ▽ More

    Submitted 6 September, 2024; v1 submitted 5 November, 2023; originally announced November 2023.

    Comments: 28 pages (including Supplementary Material)

  42. arXiv:2310.04546  [pdf, other

    cs.CR

    Privacy-Preserving Financial Anomaly Detection via Federated Learning & Multi-Party Computation

    Authors: Sunpreet Arora, Andrew Beams, Panagiotis Chatzigiannis, Sebastian Meiser, Karan Patel, Srinivasan Raghuraman, Peter Rindal, Harshal Shah, Yizhen Wang, Yuhang Wu, Hao Yang, Mahdi Zamani

    Abstract: One of the main goals of financial institutions (FIs) today is combating fraud and financial crime. To this end, FIs use sophisticated machine-learning models trained using data collected from their customers. The output of machine learning models may be manually reviewed for critical use cases, e.g., determining the likelihood of a transaction being anomalous and the subsequent course of action.… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Comments: 12 pages

  43. arXiv:2310.02486  [pdf, other

    eess.IV cs.CV cs.LG

    OCU-Net: A Novel U-Net Architecture for Enhanced Oral Cancer Segmentation

    Authors: Ahmed Albishri, Syed Jawad Hussain Shah, Yugyung Lee, Rong Wang

    Abstract: Accurate detection of oral cancer is crucial for improving patient outcomes. However, the field faces two key challenges: the scarcity of deep learning-based image segmentation research specifically targeting oral cancer and the lack of annotated data. Our study proposes OCU-Net, a pioneering U-Net image segmentation architecture exclusively designed to detect oral cancer in hematoxylin and eosin… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  44. A Review on AI Algorithms for Energy Management in E-Mobility Services

    Authors: Sen Yan, Maqsood Hussain Shah, Ji Li, Noel O'Connor, Mingming Liu

    Abstract: E-mobility, or electric mobility, has emerged as a pivotal solution to address pressing environmental and sustainability concerns in the transportation sector. The depletion of fossil fuels, escalating greenhouse gas emissions, and the imperative to combat climate change underscore the significance of transitioning to electric vehicles (EVs). This paper seeks to explore the potential of artificial… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: 8 pages, 4 tables, 1 figure

  45. arXiv:2308.14089  [pdf, other

    cs.CL cs.AI cs.LG

    MedAlign: A Clinician-Generated Dataset for Instruction Following with Electronic Medical Records

    Authors: Scott L. Fleming, Alejandro Lozano, William J. Haberkorn, Jenelle A. Jindal, Eduardo P. Reis, Rahul Thapa, Louis Blankemeier, Julian Z. Genkins, Ethan Steinberg, Ashwin Nayak, Birju S. Patel, Chia-Chun Chiang, Alison Callahan, Zepeng Huo, Sergios Gatidis, Scott J. Adams, Oluseyi Fayanju, Shreya J. Shah, Thomas Savage, Ethan Goh, Akshay S. Chaudhari, Nima Aghaeepour, Christopher Sharp, Michael A. Pfeffer, Percy Liang , et al. (5 additional authors not shown)

    Abstract: The ability of large language models (LLMs) to follow natural language instructions with human-level fluency suggests many opportunities in healthcare to reduce administrative burden and improve quality of care. However, evaluating LLMs on realistic text generation tasks for healthcare remains challenging. Existing question answering datasets for electronic health record (EHR) data fail to capture… ▽ More

    Submitted 24 December, 2023; v1 submitted 27 August, 2023; originally announced August 2023.

  46. arXiv:2308.05127  [pdf, other

    cs.CR cs.AI cs.CV cs.LG

    Data-Free Model Extraction Attacks in the Context of Object Detection

    Authors: Harshit Shah, Aravindhan G, Pavan Kulkarni, Yuvaraj Govidarajulu, Manojkumar Parmar

    Abstract: A significant number of machine learning models are vulnerable to model extraction attacks, which focus on stealing the models by using specially curated queries against the target model. This task is well accomplished by using part of the training data or a surrogate dataset to train a new model that mimics a target model in a white-box environment. In pragmatic situations, however, the target mo… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: Submitted to The 14th International Conference on Computer Vision Systems (ICVS 2023), to be published in Springer, Lecture Notes in Computer Science

  47. arXiv:2307.08024  [pdf

    cs.AI

    Bayesian inference for data-efficient, explainable, and safe robotic motion planning: A review

    Authors: Chengmin Zhou, Chao Wang, Haseeb Hassan, Himat Shah, Bingding Huang, Pasi Fränti

    Abstract: Bayesian inference has many advantages in robotic motion planning over four perspectives: The uncertainty quantification of the policy, safety (risk-aware) and optimum guarantees of robot motions, data-efficiency in training of reinforcement learning, and reducing the sim2real gap when the robot is applied to real-world tasks. However, the application of Bayesian inference in robotic motion planni… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

  48. arXiv:2307.05463  [pdf, other

    cs.CV

    EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone

    Authors: Shraman Pramanick, Yale Song, Sayan Nag, Kevin Qinghong Lin, Hardik Shah, Mike Zheng Shou, Rama Chellappa, Pengchuan Zhang

    Abstract: Video-language pre-training (VLP) has become increasingly important due to its ability to generalize to various vision and language tasks. However, existing egocentric VLP frameworks utilize separate video and language encoders and learn task-specific cross-modal information only during fine-tuning, limiting the development of a unified system. In this work, we introduce the second generation of e… ▽ More

    Submitted 18 August, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: Published in ICCV 2023

  49. arXiv:2307.02028  [pdf, other

    cs.LG cs.AI cs.CL

    EHRSHOT: An EHR Benchmark for Few-Shot Evaluation of Foundation Models

    Authors: Michael Wornow, Rahul Thapa, Ethan Steinberg, Jason A. Fries, Nigam H. Shah

    Abstract: While the general machine learning (ML) community has benefited from public datasets, tasks, and models, the progress of ML in healthcare has been hampered by a lack of such shared assets. The success of foundation models creates new challenges for healthcare ML by requiring access to shared pretrained models to validate performance benefits. We help address these challenges through three contribu… ▽ More

    Submitted 11 December, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

  50. arXiv:2306.05785  [pdf, other

    cs.LG

    End-to-End Neural Network Compression via $\frac{\ell_1}{\ell_2}$ Regularized Latency Surrogates

    Authors: Anshul Nasery, Hardik Shah, Arun Sai Suggala, Prateek Jain

    Abstract: Neural network (NN) compression via techniques such as pruning, quantization requires setting compression hyperparameters (e.g., number of channels to be pruned, bitwidths for quantization) for each layer either manually or via neural architecture search (NAS) which can be computationally expensive. We address this problem by providing an end-to-end technique that optimizes for model's Floating Po… ▽ More

    Submitted 13 June, 2023; v1 submitted 9 June, 2023; originally announced June 2023.