这是indexloc提供的服务,不要输入任何密码
Skip to main content

Showing 1–50 of 94 results for author: Kannan, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2510.03155  [pdf, ps, other

    q-bio.NC cs.AI cs.LG

    Stimulus-Voltage-Based Prediction of Action Potential Onset Timing: Classical vs. Quantum-Inspired Approaches

    Authors: Stevens Johnson, Varun Puram, Johnson Thomas, Acsah Konuparamban, Ashwin Kannan

    Abstract: Accurate modeling of neuronal action potential (AP) onset timing is crucial for understanding neural coding of danger signals. Traditional leaky integrate-and-fire (LIF) models, while widely used, exhibit high relative error in predicting AP onset latency, especially under strong or rapidly changing stimuli. Inspired by recent experimental findings and quantum theory, we present a quantum-inspired… ▽ More

    Submitted 3 October, 2025; originally announced October 2025.

  2. arXiv:2508.06687  [pdf

    cs.RO

    Optimal Planning and Machine Learning for Responsive Tracking and Enhanced Forecasting of Wildfires using a Spacecraft Constellation

    Authors: Sreeja Roy-Singh, Vinay Ravindra, Richard Levinson, Mahta Moghaddam, Jan Mandel, Adam Kochanski, Angel Farguell Caus, Kurtis Nelson, Samira Alkaee Taleghan, Archana Kannan, Amer Melebari

    Abstract: We propose a novel concept of operations using optimal planning methods and machine learning (ML) to collect spaceborne data that is unprecedented for monitoring wildfires, process it to create new or enhanced products in the context of wildfire danger or spread monitoring, and assimilate them to improve existing, wildfire decision support tools delivered to firefighters within latency appropriate… ▽ More

    Submitted 15 August, 2025; v1 submitted 8 August, 2025; originally announced August 2025.

    Journal ref: IJCAI 2025 - Artificial Intelligence and Sustainability Workshop

  3. arXiv:2507.06261  [pdf, ps, other

    cs.CL cs.AI

    Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

    Authors: Gheorghe Comanici, Eric Bieber, Mike Schaekermann, Ice Pasupat, Noveen Sachdeva, Inderjit Dhillon, Marcel Blistein, Ori Ram, Dan Zhang, Evan Rosen, Luke Marris, Sam Petulla, Colin Gaffney, Asaf Aharoni, Nathan Lintz, Tiago Cardal Pais, Henrik Jacobsson, Idan Szpektor, Nan-Jiang Jiang, Krishna Haridasan, Ahmed Omran, Nikunj Saunshi, Dara Bahri, Gaurav Mishra, Eric Chu , et al. (3410 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA performance on frontier coding and reasoning benchmarks. In addition to its incredible coding and reasoning skills, Gemini 2.5 Pro is a thinking model that excels at multimodal unde… ▽ More

    Submitted 16 October, 2025; v1 submitted 7 July, 2025; originally announced July 2025.

    Comments: 72 pages, 17 figures

  4. arXiv:2507.03226  [pdf, ps, other

    cs.AI

    Efficient Knowledge Graph Construction and Retrieval from Unstructured Text for Large-Scale RAG Systems

    Authors: Congmin Min, Rhea Mathew, Joyce Pan, Sahil Bansal, Abbas Keshavarzi, Amar Viswanathan Kannan

    Abstract: We propose a scalable and cost-efficient framework for deploying Graph-based Retrieval Augmented Generation (GraphRAG) in enterprise environments. While GraphRAG has shown promise for multi-hop reasoning and structured retrieval, its adoption has been limited by the high computational cost of constructing knowledge graphs using large language models (LLMs) and the latency of graph-based retrieval.… ▽ More

    Submitted 7 August, 2025; v1 submitted 3 July, 2025; originally announced July 2025.

  5. arXiv:2506.22591  [pdf, ps, other

    cs.CV

    BrainMT: A Hybrid Mamba-Transformer Architecture for Modeling Long-Range Dependencies in Functional MRI Data

    Authors: Arunkumar Kannan, Martin A. Lindquist, Brian Caffo

    Abstract: Recent advances in deep learning have made it possible to predict phenotypic measures directly from functional magnetic resonance imaging (fMRI) brain volumes, sparking significant interest in the neuroimaging community. However, existing approaches, primarily based on convolutional neural networks or transformer architectures, often struggle to model the complex relationships inherent in fMRI dat… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: Accepted at MICCAI 2025

  6. arXiv:2503.03784  [pdf, ps, other

    q-bio.NC cs.LG cs.NE

    Neural Models of Task Adaptation: A Tutorial on Spiking Networks for Executive Control

    Authors: Ashwin Viswanathan Kannan, Madhumitha Ganesan

    Abstract: Understanding cognitive flexibility and task-switching mechanisms in neural systems requires biologically plausible computational models. This tutorial presents a step-by-step approach to constructing a spiking neural network (SNN) that simulates task-switching dynamics within the cognitive control network. The model incorporates biologically realistic features, including lateral inhibition, adapt… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Comments: 6 pages

  7. arXiv:2503.03084  [pdf, other

    cs.LG cs.AI cs.DC cs.NE

    Hopfield Networks Meet Big Data: A Brain-Inspired Deep Learning Framework for Semantic Data Linking

    Authors: Ashwin Viswanathan Kannan, Johnson P Thomas, Abhimanyu Mukerji

    Abstract: The exponential rise in data generation has led to vast, heterogeneous datasets crucial for predictive analytics and decision-making. Ensuring data quality and semantic integrity remains a challenge. This paper presents a brain-inspired distributed cognitive framework that integrates deep learning with Hopfield networks to identify and link semantically related attributes across datasets. Modeled… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Comments: 7 pages

  8. arXiv:2502.15069  [pdf, other

    cs.CL cs.AI

    Rare Disease Differential Diagnosis with Large Language Models at Scale: From Abdominal Actinomycosis to Wilson's Disease

    Authors: Elliot Schumacher, Dhruv Naik, Anitha Kannan

    Abstract: Large language models (LLMs) have demonstrated impressive capabilities in disease diagnosis. However, their effectiveness in identifying rarer diseases, which are inherently more challenging to diagnose, remains an open question. Rare disease performance is critical with the increasing use of LLMs in healthcare settings. This is especially true if a primary care physician needs to make a rarer pro… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

  9. arXiv:2502.05439  [pdf, other

    cs.AI cs.CE cs.CL cs.LG

    Agentic AI Systems Applied to tasks in Financial Services: Modeling and model risk management crews

    Authors: Izunna Okpala, Ashkan Golgoon, Arjun Ravi Kannan

    Abstract: The advent of large language models has ushered in a new era of agentic systems, where artificial intelligence programs exhibit remarkable autonomous decision-making capabilities across diverse domains. This paper explores agentic system workflows in the financial services industry. In particular, we build agentic crews with human-in-the-loop module that can effectively collaborate to perform comp… ▽ More

    Submitted 29 April, 2025; v1 submitted 7 February, 2025; originally announced February 2025.

    MSC Class: 68T01 (Primary) 68T05; 68N99; 68T05; 68T20; 68T50; 62H30; 65C20; 68P20 (Secondary) ACM Class: I.2.0; I.2.1; I.2.2; I.2.6; I.2.7; I.5.1; I.6.0; I.7.1

  10. arXiv:2411.14868  [pdf

    cs.CV

    Defective Edge Detection Using Cascaded Ensemble Canny Operator

    Authors: Anjali Nambiyar Rajkumar Kannan

    Abstract: Edge detection has been one of the most difficult challenges in computer vision because of the difficulty in identifying the borders and edges from the real-world images including objects of varying kinds and sizes. Methods based on ensemble learning, which use a combination of backbones and attention modules, outperformed more conventional approaches, such as Sobel and Canny edge detection. Never… ▽ More

    Submitted 24 December, 2024; v1 submitted 22 November, 2024; originally announced November 2024.

    Comments: 2 Pages and 2 Figures

  11. arXiv:2411.00287  [pdf, other

    cs.LG cs.AI cs.CE math.NA stat.ML

    MBExplainer: Multilevel bandit-based explanations for downstream models with augmented graph embeddings

    Authors: Ashkan Golgoon, Ryan Franks, Khashayar Filom, Arjun Ravi Kannan

    Abstract: In many industrial applications, it is common that the graph embeddings generated from training GNNs are used in an ensemble model where the embeddings are combined with other tabular features (e.g., original node or edge features) in a downstream ML task. The tabular features may even arise naturally if, e.g., one tries to build a graph such that some of the node or edge features are stored in a… ▽ More

    Submitted 31 October, 2024; originally announced November 2024.

    MSC Class: 68T01 ACM Class: I.2

    Journal ref: Mach Learn 114, 201 (2025)

  12. arXiv:2410.06225  [pdf, other

    cs.LG cs.AI

    A Timeline and Analysis for Representation Plasticity in Large Language Models

    Authors: Akshat Kannan

    Abstract: The ability to steer AI behavior is crucial to preventing its long term dangerous and catastrophic potential. Representation Engineering (RepE) has emerged as a novel, powerful method to steer internal model behaviors, such as "honesty", at a top-down level. Understanding the steering of representations should thus be placed at the forefront of alignment initiatives. Unfortunately, current efforts… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  13. arXiv:2410.02898  [pdf, other

    eess.SY cs.LG cs.RO

    Solving Reach-Avoid-Stay Problems Using Deep Deterministic Policy Gradients

    Authors: Gabriel Chenevert, Jingqi Li, Achyuta kannan, Sangjae Bae, Donggun Lee

    Abstract: Reach-Avoid-Stay (RAS) optimal control enables systems such as robots and air taxis to reach their targets, avoid obstacles, and stay near the target. However, current methods for RAS often struggle with handling complex, dynamic environments and scaling to high-dimensional systems. While reinforcement learning (RL)-based reachability analysis addresses these challenges, it has yet to tackle the R… ▽ More

    Submitted 7 October, 2024; v1 submitted 3 October, 2024; originally announced October 2024.

  14. arXiv:2408.11967  [pdf, other

    cs.LG econ.EM stat.AP

    Valuing an Engagement Surface using a Large Scale Dynamic Causal Model

    Authors: Abhimanyu Mukerji, Sushant More, Ashwin Viswanathan Kannan, Lakshmi Ravi, Hua Chen, Naman Kohli, Chris Khawand, Dinesh Mandalapu

    Abstract: With recent rapid growth in online shopping, AI-powered Engagement Surfaces (ES) have become ubiquitous across retail services. These engagement surfaces perform an increasing range of functions, including recommending new products for purchase, reminding customers of their orders and providing delivery notifications. Understanding the causal effect of engagement surfaces on value driven for custo… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: 10 pages, 5 figures. Accepted at Applied Data Science track of KDD 2024, Barcelona, Spain

  15. arXiv:2407.11215  [pdf, other

    cs.LG cs.AI cs.CE cs.CL math.NA

    Mechanistic interpretability of large language models with applications to the financial services industry

    Authors: Ashkan Golgoon, Khashayar Filom, Arjun Ravi Kannan

    Abstract: Large Language Models such as GPTs (Generative Pre-trained Transformers) exhibit remarkable capabilities across a broad spectrum of applications. Nevertheless, due to their intrinsic complexity, these models present substantial challenges in interpreting their internal decision-making processes. This lack of transparency poses critical challenges when it comes to their adaptation by financial inst… ▽ More

    Submitted 15 October, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

    MSC Class: 68T01 ACM Class: I.2.7

    Journal ref: 5th ACM International Conference on AI in Finance (ICAIF 2024)

  16. arXiv:2406.02778  [pdf, ps, other

    cs.LG

    MS-IMAP -- A Multi-Scale Graph Embedding Approach for Interpretable Manifold Learning

    Authors: Shay Deutsch, Lionel Yelibi, Alex Tong Lin, Arjun Ravi Kannan

    Abstract: Deriving meaningful representations from complex, high-dimensional data in unsupervised settings is crucial across diverse machine learning applications. This paper introduces a framework for multi-scale graph network embedding based on spectral graph wavelets that employs a contrastive learning approach. We theoretically show that in Paley-Wiener spaces on combinatorial graphs, the spectral graph… ▽ More

    Submitted 8 August, 2025; v1 submitted 4 June, 2024; originally announced June 2024.

  17. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1112 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 16 December, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  18. arXiv:2312.14976  [pdf, other

    cs.CV cs.CY

    Gaussian Harmony: Attaining Fairness in Diffusion-based Face Generation Models

    Authors: Basudha Pal, Arunkumar Kannan, Ram Prabhakar Kathirvel, Alice J. O'Toole, Rama Chellappa

    Abstract: Diffusion models have achieved great progress in face generation. However, these models amplify the bias in the generation process, leading to an imbalance in distribution of sensitive attributes such as age, gender and race. This paper proposes a novel solution to this problem by balancing the facial attributes of the generated images. We mitigate the bias by localizing the means of the facial at… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  19. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1326 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 9 May, 2025; v1 submitted 18 December, 2023; originally announced December 2023.

  20. arXiv:2311.18071  [pdf, other

    cs.CV

    Turn Down the Noise: Leveraging Diffusion Models for Test-time Adaptation via Pseudo-label Ensembling

    Authors: Mrigank Raman, Rohan Shah, Akash Kannan, Pranit Chawla

    Abstract: The goal of test-time adaptation is to adapt a source-pretrained model to a continuously changing target domain without relying on any source data. Typically, this is either done by updating the parameters of the model (model adaptation) using inputs from the target domain or by modifying the inputs themselves (input adaptation). However, methods that modify the model suffer from the issue of comp… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Accepted to Workshop on Distribution Shifts: New Frontiers with Foundation Models at Neurips 2023

  21. arXiv:2311.08303  [pdf, other

    cs.CL cs.AI

    Extrinsically-Focused Evaluation of Omissions in Medical Summarization

    Authors: Elliot Schumacher, Daniel Rosenthal, Dhruv Naik, Varun Nair, Luladay Price, Geoffrey Tso, Anitha Kannan

    Abstract: Large language models (LLMs) have shown promise in safety-critical applications such as healthcare, yet the ability to quantify performance has lagged. An example of this challenge is in evaluating a summary of the patient's medical record. A resulting summary can enable the provider to get a high-level overview of the patient's health status quickly. Yet, a summary that omits important facts abou… ▽ More

    Submitted 11 November, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: Accepted to ML4H 2024

  22. arXiv:2310.19797  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    DEFT: Dexterous Fine-Tuning for Real-World Hand Policies

    Authors: Aditya Kannan, Kenneth Shaw, Shikhar Bahl, Pragna Mannam, Deepak Pathak

    Abstract: Dexterity is often seen as a cornerstone of complex manipulation. Humans are able to perform a host of skills with their hands, from making food to operating tools. In this paper, we investigate these challenges, especially in the case of soft, deformable objects as well as complex, relatively long-horizon tasks. However, learning such behaviors from scratch can be data inefficient. To circumvent… ▽ More

    Submitted 12 December, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: In CoRL 2023. Website at https://dexterous-finetuning.github.io/

  23. arXiv:2306.03652  [pdf, other

    cs.CL

    Injecting knowledge into language generation: a case study in auto-charting after-visit care instructions from medical dialogue

    Authors: Maksim Eremeev, Ilya Valmianski, Xavier Amatriain, Anitha Kannan

    Abstract: Factual correctness is often the limiting factor in practical applications of natural language generation in high-stakes domains such as healthcare. An essential requirement for maintaining factuality is the ability to deal with rare tokens. This paper focuses on rare tokens that appear in both the source and the reference sequences, and which, when missed during generation, decrease the factual c… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: ACL 2023 (main conference)

  24. arXiv:2305.14394  [pdf, other

    cs.NE cs.AI cs.LG q-bio.NC

    Unsupervised Spiking Neural Network Model of Prefrontal Cortex to study Task Switching with Synaptic deficiency

    Authors: Ashwin Viswanathan Kannan, Goutam Mylavarapu, Johnson P Thomas

    Abstract: In this study, we build a computational model of Prefrontal Cortex (PFC) using Spiking Neural Networks (SNN) to understand how neurons adapt and respond to tasks switched under short and longer duration of stimulus changes. We also explore behavioral deficits arising out of the PFC lesions by simulating lesioned states in our Spiking architecture model. Although there are some computational models… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  25. arXiv:2305.05982  [pdf, other

    cs.CL cs.AI cs.LG

    Generating medically-accurate summaries of patient-provider dialogue: A multi-stage approach using large language models

    Authors: Varun Nair, Elliot Schumacher, Anitha Kannan

    Abstract: A medical provider's summary of a patient visit serves several critical purposes, including clinical decision-making, facilitating hand-offs between providers, and as a reference for the patient. An effective summary is required to be coherent and accurately capture all the medically relevant information in the dialogue, despite the complexity of patient-generated language. Even minor inaccuracies… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

  26. arXiv:2304.14364  [pdf, other

    cs.CL cs.AI cs.LG

    CONSCENDI: A Contrastive and Scenario-Guided Distillation Approach to Guardrail Models for Virtual Assistants

    Authors: Albert Yu Sun, Varun Nair, Elliot Schumacher, Anitha Kannan

    Abstract: A wave of new task-based virtual assistants has been fueled by increasingly powerful large language models (LLMs), such as GPT-4 (OpenAI, 2023). A major challenge in deploying LLM-based virtual conversational assistants in real world settings is ensuring they operate within what is admissible for the task. To overcome this challenge, the designers of these virtual assistants rely on an independent… ▽ More

    Submitted 3 April, 2024; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: To appear in NAACL 2024

  27. arXiv:2304.01974  [pdf, other

    cs.CL cs.IR

    Dialogue-Contextualized Re-ranking for Medical History-Taking

    Authors: Jian Zhu, Ilya Valmianski, Anitha Kannan

    Abstract: AI-driven medical history-taking is an important component in symptom checking, automated patient intake, triage, and other AI virtual care applications. As history-taking is extremely varied, machine learning models require a significant amount of data to train. To overcome this challenge, existing systems are developed using indirect data or expert knowledge. This leads to a training-inference g… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: Code and pre-trained S4 checkpoints will be available after publication

  28. arXiv:2303.17071  [pdf, other

    cs.CL cs.AI cs.LG

    DERA: Enhancing Large Language Model Completions with Dialog-Enabled Resolving Agents

    Authors: Varun Nair, Elliot Schumacher, Geoffrey Tso, Anitha Kannan

    Abstract: Large language models (LLMs) have emerged as valuable tools for many natural language understanding tasks. In safety-critical applications such as healthcare, the utility of these models is governed by their ability to generate outputs that are factually accurate and complete. In this work, we present dialog-enabled resolving agents (DERA). DERA is a paradigm made possible by the increased convers… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

  29. arXiv:2303.10216  [pdf, other

    cs.LG math.PR

    Approximation of group explainers with coalition structure using Monte Carlo sampling on the product space of coalitions and features

    Authors: Konstandinos Kotsiopoulos, Alexey Miroshnikov, Khashayar Filom, Arjun Ravi Kannan

    Abstract: In recent years, many Machine Learning (ML) explanation techniques have been designed using ideas from cooperative game theory. These game-theoretic explainers suffer from high complexity, hindering their exact computation in practical settings. In our work, we focus on a wide class of linear game values, as well as coalitional values, for the marginal game based on a given ML model and predictor… ▽ More

    Submitted 18 April, 2024; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: 31 pages, 6 figures

  30. arXiv:2302.12862  [pdf, other

    cs.LG cs.DC

    FLINT: A Platform for Federated Learning Integration

    Authors: Ewen Wang, Ajay Kannan, Yuefeng Liang, Boyi Chen, Mosharaf Chowdhury

    Abstract: Cross-device federated learning (FL) has been well-studied from algorithmic, system scalability, and training speed perspectives. Nonetheless, moving from centralized training to cross-device FL for millions or billions of devices presents many risks, including performance loss, developer inertia, poor user experience, and unexpected application failures. In addition, the corresponding infrastruct… ▽ More

    Submitted 10 March, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: Preprint for MLSys 2023

    MSC Class: F.2.2; I.2.7

  31. On marginal feature attributions of tree-based models

    Authors: Khashayar Filom, Alexey Miroshnikov, Konstandinos Kotsiopoulos, Arjun Ravi Kannan

    Abstract: Due to their power and ease of use, tree-based machine learning models, such as random forests and gradient-boosted tree ensembles, have become very popular. To interpret them, local feature attributions based on marginal expectations, e.g. marginal (interventional) Shapley, Owen or Banzhaf values, may be employed. Such methods are true to the model and implementation invariant, i.e. dependent onl… ▽ More

    Submitted 5 May, 2024; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: Minor corrections. 30 pages+appendix (64 pages in total), 10 figures. To appear in Foundations of Data Science

    MSC Class: Primary: 68T01; 91A12; 91A80; 05A19; Secondary: 91A68; 91A06; 05C05

  32. arXiv:2210.02658  [pdf, other

    cs.CL

    Learning functional sections in medical conversations: iterative pseudo-labeling and human-in-the-loop approach

    Authors: Mengqian Wang, Ilya Valmianski, Xavier Amatriain, Anitha Kannan

    Abstract: Medical conversations between patients and medical professionals have implicit functional sections, such as "history taking", "summarization", "education", and "care plan." In this work, we are interested in learning to automatically extract these sections. A direct approach would require collecting large amounts of expert annotations for this task, which is inherently costly due to the contextual… ▽ More

    Submitted 7 October, 2022; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: Changed the github link as it was invalid

  33. arXiv:2207.05817  [pdf, other

    cs.CL cs.AI cs.LG

    OSLAT: Open Set Label Attention Transformer for Medical Entity Retrieval and Span Extraction

    Authors: Raymond Li, Ilya Valmianski, Li Deng, Xavier Amatriain, Anitha Kannan

    Abstract: Medical entity span extraction and linking are critical steps for many healthcare NLP tasks. Most existing entity extraction methods either have a fixed vocabulary of medical entities or require span annotations. In this paper, we propose a method for linking an open set of entities that does not require any span annotations. Our method, Open Set Label Attention Transformer (OSLAT), uses the label… ▽ More

    Submitted 20 November, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: 18 pages, 2 figures, Camera-Ready for ML4H 2022 (Proceedings Track)

  34. arXiv:2202.11043  [pdf, other

    stat.ML cs.CR cs.LG econ.EM

    Differentially Private Estimation of Heterogeneous Causal Effects

    Authors: Fengshi Niu, Harsha Nori, Brian Quistorff, Rich Caruana, Donald Ngwe, Aadharsh Kannan

    Abstract: Estimating heterogeneous treatment effects in domains such as healthcare or social science often involves sensitive data where protecting privacy is important. We introduce a general meta-algorithm for estimating conditional average treatment effects (CATE) with differential privacy (DP) guarantees. Our meta-algorithm can work with simple, single-stage CATE estimators such as S-learner and more co… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

  35. arXiv:2111.11259  [pdf, other

    cs.LG math.PR

    Model-agnostic bias mitigation methods with regressor distribution control for Wasserstein-based fairness metrics

    Authors: Alexey Miroshnikov, Konstandinos Kotsiopoulos, Ryan Franks, Arjun Ravi Kannan

    Abstract: This article is a companion paper to our earlier work Miroshnikov et al. (2021) on fairness interpretability, which introduces bias explanations. In the current work, we propose a bias mitigation methodology based upon the construction of post-processed models with fairer regressor distributions for Wasserstein-based fairness metrics. By identifying the list of predictors contributing the most to… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

    Comments: 29 pages, 32 figures

    MSC Class: 49Q22; 91A12; 68T01

  36. arXiv:2111.09381  [pdf, other

    cs.CL cs.AI cs.LG

    MEDCOD: A Medically-Accurate, Emotive, Diverse, and Controllable Dialog System

    Authors: Rhys Compton, Ilya Valmianski, Li Deng, Costa Huang, Namit Katariya, Xavier Amatriain, Anitha Kannan

    Abstract: We present MEDCOD, a Medically-Accurate, Emotive, Diverse, and Controllable Dialog system with a unique approach to the natural language generator module. MEDCOD has been developed and evaluated specifically for the history taking task. It integrates the advantage of a traditional modular approach to incorporate (medical) domain knowledge with modern deep learning techniques to generate flexible,… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: 9 pages. Accepted at Machine Learning for Health (ML4H) 2021

  37. arXiv:2111.07564  [pdf, other

    cs.CL cs.AI cs.LG

    Adding more data does not always help: A study in medical conversation summarization with PEGASUS

    Authors: Varun Nair, Namit Katariya, Xavier Amatriain, Ilya Valmianski, Anitha Kannan

    Abstract: Medical conversation summarization is integral in capturing information gathered during interactions between patients and physicians. Summarized conversations are used to facilitate patient hand-offs between physicians, and as part of providing care in the future. Summaries, however, can be time-consuming to produce and require domain expertise. Modern pre-trained NLP models such as PEGASUS have e… ▽ More

    Submitted 28 November, 2021; v1 submitted 15 November, 2021; originally announced November 2021.

    Comments: Accepted to Machine Learning for Healthcare Workshop, NeurIPS 2021

  38. arXiv:2110.07356  [pdf, other

    cs.CL cs.AI cs.LG

    Medically Aware GPT-3 as a Data Generator for Medical Dialogue Summarization

    Authors: Bharath Chintagunta, Namit Katariya, Xavier Amatriain, Anitha Kannan

    Abstract: In medical dialogue summarization, summaries must be coherent and must capture all the medically relevant information in the dialogue. However, learning effective models for summarization require large amounts of labeled data which is especially hard to obtain. We present an algorithm to create synthetic training data with an explicit focus on capturing medically relevant information. We utilize G… ▽ More

    Submitted 9 September, 2021; originally announced October 2021.

    Comments: Accepted to Machine learning for healthcare 2021

  39. arXiv:2104.12950  [pdf, other

    cs.AI cs.CL

    Document Structure aware Relational Graph Convolutional Networks for Ontology Population

    Authors: Abhay M Shalghar, Ayush Kumar, Balaji Ganesan, Aswin Kannan, Akshay Parekh, Shobha G

    Abstract: Ontologies comprising of concepts, their attributes, and relationships are used in many knowledge based AI systems. While there have been efforts towards populating domain specific ontologies, we examine the role of document structure in learning ontological relationships between concepts in any document corpus. Inspired by ideas from hypernym discovery and explainability, our method performs abou… ▽ More

    Submitted 12 April, 2022; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: 8 pages single column, 5 figures. DLG4NLP Workshop at ICLR 2022

  40. arXiv:2104.04487  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Language model fusion for streaming end to end speech recognition

    Authors: Rodrigo Cabrera, Xiaofeng Liu, Mohammadreza Ghodsi, Zebulun Matteson, Eugene Weinstein, Anjuli Kannan

    Abstract: Streaming processing of speech audio is required for many contemporary practical speech recognition tasks. Even with the large corpora of manually transcribed speech data available today, it is impossible for such corpora to cover adequately the long tail of linguistic content that's important for tasks such as open-ended dictation and voice search. We seek to address both the streaming and the ta… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: 5 pages

  41. arXiv:2102.10878  [pdf, other

    cs.GT math.PR

    Stability theory of game-theoretic group feature explanations for machine learning models

    Authors: Alexey Miroshnikov, Konstandinos Kotsiopoulos, Khashayar Filom, Arjun Ravi Kannan

    Abstract: In this article, we study feature attributions of Machine Learning (ML) models originating from linear game values and coalitional values defined as operators on appropriate functional spaces. The main focus is on random games based on the conditional and marginal expectations. The first part of our work formulates a stability theory for these explanation operators by establishing certain bounds f… ▽ More

    Submitted 10 August, 2024; v1 submitted 22 February, 2021; originally announced February 2021.

    Comments: 82 pages, 43 figures. Typos fixed. Some technical results have been improved

    MSC Class: 91A06; 91A12; 91A80; 46N30; 46N99; 68T01

  42. arXiv:2011.06874  [pdf, other

    cs.CL cs.LG

    Medical symptom recognition from patient text: An active learning approach for long-tailed multilabel distributions

    Authors: Ali Mottaghi, Prathusha K Sarma, Xavier Amatriain, Serena Yeung, Anitha Kannan

    Abstract: We study the problem of medical symptoms recognition from patient text, for the purposes of gathering pertinent information from the patient (known as history-taking). A typical patient text is often descriptive of the symptoms the patient is experiencing and a single instance of such a text can be "labeled" with multiple symptoms. This makes learning a medical symptoms recognizer challenging on a… ▽ More

    Submitted 28 March, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

  43. Wasserstein-based fairness interpretability framework for machine learning models

    Authors: Alexey Miroshnikov, Konstandinos Kotsiopoulos, Ryan Franks, Arjun Ravi Kannan

    Abstract: The objective of this article is to introduce a fairness interpretability framework for measuring and explaining the bias in classification and regression models at the level of a distribution. In our work, we measure the model bias across sub-population distributions in the model output using the Wasserstein metric. To properly quantify the contributions of predictors, we take into account the fa… ▽ More

    Submitted 8 March, 2022; v1 submitted 5 November, 2020; originally announced November 2020.

    Comments: 39 pages. (submitted for publication)

    MSC Class: 49Q22; 91A12; 68T01; 90C08

    Journal ref: Machine Learning Journal (2022), Springer

  44. arXiv:2009.08666  [pdf, other

    cs.CL cs.AI cs.LG

    Dr. Summarize: Global Summarization of Medical Dialogue by Exploiting Local Structures

    Authors: Anirudh Joshi, Namit Katariya, Xavier Amatriain, Anitha Kannan

    Abstract: Understanding a medical conversation between a patient and a physician poses a unique natural language understanding challenge since it combines elements of standard open ended conversation with very domain specific elements that require expertise and medical knowledge. Summarization of medical conversations is a particularly important aspect of medical conversation understanding since it addresse… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

    Comments: Accepted for publication in Findings of EMNLP at EMNLP 2020

  45. arXiv:2008.13546  [pdf, other

    cs.IR cs.CL cs.LG

    Effective Transfer Learning for Identifying Similar Questions: Matching User Questions to COVID-19 FAQs

    Authors: Clara H. McCreery, Namit Katariya, Anitha Kannan, Manish Chablani, Xavier Amatriain

    Abstract: People increasingly search online for answers to their medical questions but the rate at which medical questions are asked online significantly exceeds the capacity of qualified people to answer them. This leaves many questions unanswered or inadequately answered. Many of these questions are not unique, and reliable identification of similar questions would enable more efficient and effective ques… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1910.04192

  46. arXiv:2008.03323  [pdf, other

    cs.AI cs.LG

    COVID-19 in differential diagnosis of online symptom assessments

    Authors: Anitha Kannan, Richard Chen, Vignesh Venkataraman, Geoffrey J. Tso, Xavier Amatriain

    Abstract: The COVID-19 pandemic has magnified an already existing trend of people looking for healthcare solutions online. One class of solutions are symptom checkers, which have become very popular in the context of COVID-19. Traditional symptom checkers, however, are based on manually curated expert systems that are inflexible and hard to modify, especially in a quickly changing situation like the one we… ▽ More

    Submitted 30 November, 2020; v1 submitted 7 August, 2020; originally announced August 2020.

    Comments: Accepted at the Machine Learning for Health (ML4H) at NeurIPS 2020 - Extended Abstract

  47. arXiv:2004.09571  [pdf, other

    eess.AS cs.SD stat.ML

    Language-agnostic Multilingual Modeling

    Authors: Arindrima Datta, Bhuvana Ramabhadran, Jesse Emond, Anjuli Kannan, Brian Roark

    Abstract: Multilingual Automated Speech Recognition (ASR) systems allow for the joint training of data-rich and data-scarce languages in a single model. This enables data and parameter sharing across languages, which is especially beneficial for the data-scarce languages. However, most state-of-the-art multilingual models require the encoding of language information and therefore are not as flexible or scal… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

  48. arXiv:2003.12710  [pdf, other

    cs.CL cs.LG cs.SD

    A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency

    Authors: Tara N. Sainath, Yanzhang He, Bo Li, Arun Narayanan, Ruoming Pang, Antoine Bruguier, Shuo-yiin Chang, Wei Li, Raziel Alvarez, Zhifeng Chen, Chung-Cheng Chiu, David Garcia, Alex Gruenstein, Ke Hu, Minho Jin, Anjuli Kannan, Qiao Liang, Ian McGraw, Cal Peyser, Rohit Prabhavalkar, Golan Pundak, David Rybach, Yuan Shangguan, Yash Sheth, Trevor Strohman , et al. (4 additional authors not shown)

    Abstract: Thus far, end-to-end (E2E) models have not been shown to outperform state-of-the-art conventional models with respect to both quality, i.e., word error rate (WER), and latency, i.e., the time the hypothesis is finalized after the user stops speaking. In this paper, we develop a first-pass Recurrent Neural Network Transducer (RNN-T) model and a second-pass Listen, Attend, Spell (LAS) rescorer that… ▽ More

    Submitted 1 May, 2020; v1 submitted 28 March, 2020; originally announced March 2020.

    Comments: In Proceedings of IEEE ICASSP 2020

  49. arXiv:1912.08041  [pdf, other

    cs.LG stat.ML

    The accuracy vs. coverage trade-off in patient-facing diagnosis models

    Authors: Anitha Kannan, Jason Alan Fries, Eric Kramer, Jen Jen Chen, Nigam Shah, Xavier Amatriain

    Abstract: A third of adults in America use the Internet to diagnose medical concerns, and online symptom checkers are increasingly part of this process. These tools are powered by diagnosis models similar to clinical decision support systems, with the primary difference being the coverage of symptoms and diagnoses. To be useful to patients and physicians, these models must have high accuracy while covering… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

  50. arXiv:1911.08554  [pdf, other

    cs.CL cs.AI cs.LG

    Classification as Decoder: Trading Flexibility for Control in Medical Dialogue

    Authors: Sam Shleifer, Manish Chablani, Anitha Kannan, Namit Katariya, Xavier Amatriain

    Abstract: Generative seq2seq dialogue systems are trained to predict the next word in dialogues that have already occurred. They can learn from large unlabeled conversation datasets, build a deeper understanding of conversational context, and generate a wide variety of responses. This flexibility comes at the cost of control, a concerning tradeoff in doctor/patient interactions. Inaccuracies, typos, or unde… ▽ More

    Submitted 15 November, 2019; originally announced November 2019.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2019 - Extended Abstract. arXiv admin note: substantial text overlap with arXiv:1910.03476