这是indexloc提供的服务,不要输入任何密码
Skip to main content

Showing 1–11 of 11 results for author: Peter, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.06261  [pdf, ps, other

    cs.CL cs.AI

    Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

    Authors: Gheorghe Comanici, Eric Bieber, Mike Schaekermann, Ice Pasupat, Noveen Sachdeva, Inderjit Dhillon, Marcel Blistein, Ori Ram, Dan Zhang, Evan Rosen, Luke Marris, Sam Petulla, Colin Gaffney, Asaf Aharoni, Nathan Lintz, Tiago Cardal Pais, Henrik Jacobsson, Idan Szpektor, Nan-Jiang Jiang, Krishna Haridasan, Ahmed Omran, Nikunj Saunshi, Dara Bahri, Gaurav Mishra, Eric Chu , et al. (3284 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA performance on frontier coding and reasoning benchmarks. In addition to its incredible coding and reasoning skills, Gemini 2.5 Pro is a thinking model that excels at multimodal unde… ▽ More

    Submitted 22 July, 2025; v1 submitted 7 July, 2025; originally announced July 2025.

    Comments: 72 pages, 17 figures

  2. arXiv:2505.06265  [pdf, ps, other

    cs.LG

    ONERA's CRM WBPN database for machine learning activities, related regression challenge and first results

    Authors: Jacques Peter, Quentin Bennehard, Sébastien Heib, Jean-Luc Hantrais-Gervois, Frédéric Moëns

    Abstract: This paper presents a new Computational Fluid Dynamics database, developed at ONERA, to support the advancement of machine learning techniques for aerodynamic field prediction. It contains 468 Reynolds-Averaged Navier-Stokes simulations using the Spalart-Allmaras turbulence model, performed on the NASA/Boeing Common Research Model wing-body-pylon-nacelle configuration. The database spans a wide ra… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

    Comments: 16 pages, 9 figures

  3. arXiv:2503.24013  [pdf, other

    cs.CL

    You Cannot Feed Two Birds with One Score: the Accuracy-Naturalness Tradeoff in Translation

    Authors: Gergely Flamich, David Vilar, Jan-Thorsten Peter, Markus Freitag

    Abstract: The goal of translation, be it by human or by machine, is, given some text in a source language, to produce text in a target language that simultaneously 1) preserves the meaning of the source text and 2) achieves natural expression in the target language. However, researchers in the machine translation community usually assess translations using a single score intended to capture semantic accurac… ▽ More

    Submitted 1 April, 2025; v1 submitted 31 March, 2025; originally announced March 2025.

    Comments: Corrected a typo in Eq (3)

  4. arXiv:2503.19786  [pdf, other

    cs.CL cs.AI

    Gemma 3 Technical Report

    Authors: Gemma Team, Aishwarya Kamath, Johan Ferret, Shreya Pathak, Nino Vieillard, Ramona Merhej, Sarah Perrin, Tatiana Matejovicova, Alexandre Ramé, Morgane Rivière, Louis Rouillard, Thomas Mesnard, Geoffrey Cideron, Jean-bastien Grill, Sabela Ramos, Edouard Yvinec, Michelle Casbon, Etienne Pot, Ivo Penchev, Gaël Liu, Francesco Visin, Kathleen Kenealy, Lucas Beyer, Xiaohai Zhai, Anton Tsitsulin , et al. (191 additional authors not shown)

    Abstract: We introduce Gemma 3, a multimodal addition to the Gemma family of lightweight open models, ranging in scale from 1 to 27 billion parameters. This version introduces vision understanding abilities, a wider coverage of languages and longer context - at least 128K tokens. We also change the architecture of the model to reduce the KV-cache memory that tends to explode with long context. This is achie… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

  5. arXiv:2503.14240  [pdf, other

    cs.LG

    Persistent Homology-induced Graph Ensembles for Time Series Regressions

    Authors: Viet The Nguyen, Duy Anh Pham, An Thai Le, Jans Peter, Gunther Gust

    Abstract: The effectiveness of Spatio-temporal Graph Neural Networks (STGNNs) in time-series applications is often limited by their dependence on fixed, hand-crafted input graph structures. Motivated by insights from the Topological Data Analysis (TDA) paradigm, of which real-world data exhibits multi-scale patterns, we construct several graphs using Persistent Homology Filtration -- a mathematical framewor… ▽ More

    Submitted 19 March, 2025; v1 submitted 18 March, 2025; originally announced March 2025.

  6. arXiv:2503.09238  [pdf, other

    cs.SE

    Smart Feeding Station: Non-Invasive, Automated IoT Monitoring of Goodman's Mouse Lemurs in a Semi-Natural Rainforest Habitat

    Authors: Jonas Peter, Victor Luder, Leyla Rivero Davis, Lukas Schulthess, Michele Magno

    Abstract: In recent years, zoological institutions have made significant strides to reimagine ex situ animal habitats, moving away from traditional single-species enclosures towards expansive multi-species environments, more closely resembling semi-natural ecosystems. This paradigm shift, driven by a commitment to animal welfare, encourages a broader range of natural behaviors through abiotic and biotic int… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

    Comments: Accepted to IEEE International Instrumentation and Measurement Technology Conference(I2MTC) (6 pages)

  7. arXiv:2311.05350  [pdf, other

    cs.CL

    There's no Data Like Better Data: Using QE Metrics for MT Data Filtering

    Authors: Jan-Thorsten Peter, David Vilar, Daniel Deutsch, Mara Finkelstein, Juraj Juraska, Markus Freitag

    Abstract: Quality Estimation (QE), the evaluation of machine translation output without the need of explicit references, has seen big improvements in the last years with the use of neural metrics. In this paper we analyze the viability of using QE metrics for filtering out bad quality sentence pairs in the training data of neural machine translation systems~(NMT). While most corpus filtering methods are foc… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: to be published at WMT23

  8. arXiv:1909.08185  [pdf, ps, other

    cs.IT cs.LG eess.SP

    Learned-SBL: A Deep Learning Architecture for Sparse Signal Recovery

    Authors: Rubin Jose Peter, Chandra R. Murthy

    Abstract: In this paper, we present a computationally efficient sparse signal recovery scheme using Deep Neural Networks (DNN). The architecture of the introduced neural network is inspired from sparse Bayesian learning (SBL) and named as Learned-SBL (L-SBL). We design a common architecture to recover sparse as well as block sparse vectors from single measurement vector (SMV) or multiple measurement vectors… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

    Comments: 13 pages, 22 figures

  9. Local System Voting Feature for Machine Translation System Combination

    Authors: Markus Freitag, Jan-Thorsten Peter, Stephan Peitz, Minwei Feng, Hermann Ney

    Abstract: In this paper, we enhance the traditional confusion network system combination approach with an additional model trained by a neural network. This work is motivated by the fact that the commonly used binary system voting models only assign each input system a global weight which is responsible for the global impact of each input system on all translations. This prevents individual systems with low… ▽ More

    Submitted 9 February, 2017; originally announced February 2017.

    Comments: published WMT 2015

    Journal ref: Proceedings of the Tenth Workshop on Statistical Machine Translation (WMT), 2015

  10. arXiv:1607.01628  [pdf, other

    cs.CL cs.NE

    Guided Alignment Training for Topic-Aware Neural Machine Translation

    Authors: Wenhu Chen, Evgeny Matusov, Shahram Khadivi, Jan-Thorsten Peter

    Abstract: In this paper, we propose an effective way for biasing the attention mechanism of a sequence-to-sequence neural machine translation (NMT) model towards the well-studied statistical word alignment models. We show that our novel guided alignment training approach improves translation quality on real-life e-commerce texts consisting of product titles and descriptions, overcoming the problems posed by… ▽ More

    Submitted 6 July, 2016; originally announced July 2016.

    Comments: 11 pages

  11. arXiv:1005.4585  [pdf

    cs.OH

    A Novel Algorithm for Informative Meta Similarity Clusters Using Minimum Spanning Tree

    Authors: S. John Peter, S. P. Victor

    Abstract: The minimum spanning tree clustering algorithm is capable of detecting clusters with irregular boundaries. In this paper we propose two minimum spanning trees based clustering algorithm. The first algorithm produces k clusters with center and guaranteed intra-cluster similarity. The radius and diameter of k clusters are computed to find the tightness of k clusters. The variance of the k clusters a… ▽ More

    Submitted 6 May, 2010; originally announced May 2010.

    Comments: IEEE Publication format, International Journal of Computer Science and Information Security, IJCSIS, Vol. 8 No. 1, April 2010, USA. ISSN 1947 5500, http://sites.google.com/site/ijcsis/