akontra

akontra

30 followers · 327 following

Starred repositories

Pleias / toxic-commons

The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.

Python 18 2 Updated Nov 10, 2024

Nicozwy / AIGTD-Survey

The official GitHub page for the survey paper of AIGTD entitled "The Imitation Game Revisited: A Comprehensive Survey on Recent Advances in AI-generated Text Detection."

39 Updated Mar 1, 2025

desklib / ai-text-detector

Desklib's AI Text Detector

20 Updated Feb 17, 2025

liamdugan / raid

RAID is the largest and most challenging benchmark for AI-generated text detection. (ACL 2024)

Python 74 25 Updated Jul 3, 2025

Pleias / OCRoscope

Small python package to measure OCR quality and other related metrics.

Python 24 3 Updated Feb 19, 2024

openSUSE / cavil

The legal review and SBOM system used by SUSE and openSUSE

Perl 59 7 Updated Jul 10, 2025

michelle123lam / lloom

Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM (CHI 2024 paper). LLooM automatically surfaces high-level concepts to analyze unstructured text.

Python 118 20 Updated Jun 4, 2025

chtmp223 / suri

Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)

Python 24 1 Updated Nov 10, 2024

chtmp223 / Frankentext

Frankentext: Stitching random text fragments into long-form narratives

5 Updated Jun 2, 2025

SakanaAI / text-to-lora

Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input

Python 800 47 Updated Jun 8, 2025

microsoft / presidio

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

Python 5,002 687 Updated Jul 13, 2025

quotient-ai / judges

A small library of LLM judges

Python 229 27 Updated Jun 25, 2025

QwenLM / AutoIF

Python 294 26 Updated Jul 25, 2024

tomaarsen / attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Python 701 41 Updated Apr 10, 2024

OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, Kimi, and more!

Python 1,366 92 Updated Jul 13, 2025

jkvt2 / hierloss

Hierarchical Loss function

Python 14 2 Updated May 6, 2019

jsvine / waybackpack

Download the entire Wayback Machine archive for a given URL.

Python 3,049 204 Updated Apr 21, 2025

stephanlensky / zendriver

A blazing fast, async-first, undetectable webscraping/web automation framework based on ultrafunkamsterdam/nodriver. Now with Docker support!

Python 600 46 Updated Jul 12, 2025

ma787639046 / bowdpr

[SIGIR24] Pre-training with Bag-of-Word Prediction for Dense Passage Retrieval

Python 17 1 Updated Feb 29, 2024

ultrafunkamsterdam / nodriver

Successor of Undetected-Chromedriver. Providing a blazing fast framework for web automation, webscraping, bots and any other creative ideas which are normally hindered by annoying anti bot systems …

Python 2,691 269 Updated Jul 6, 2025

plageon / HtmlRAG

HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieval Results in RAG Systems (WWW 2025)

Python 430 34 Updated Jun 11, 2025

explosion / sense2vec

🦆 Contextually-keyed word vectors

Python 1,655 241 Updated Apr 23, 2025

hotwired / stimulus

A modest JavaScript framework for the HTML you already have

TypeScript 12,921 432 Updated Jun 24, 2025

GAIR-NLP / DeepResearcher

Scaling Deep Research via Reinforcement Learning in Real-world Environments.

Python 504 41 Updated Apr 13, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,883 1,796 Updated Jul 13, 2025

sunnynexus / Search-o1

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Python 980 89 Updated May 13, 2025

anthropics / claude-code

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

PowerShell 19,295 1,099 Updated Jul 11, 2025

overlookmotel / yauzl-mac

Unzipping with yauzl with added support for Mac OS Archive Utility ZIP files

JavaScript 1 1 Updated Aug 3, 2023

denorg / scrypt

🔑 Deno library for hashing passwords using scrypt

JavaScript 21 5 Updated Apr 22, 2025

fpgaminer / joytag

The JoyTag Image Tagging Model

Python 510 32 Updated May 18, 2024

akontra

Starred repositories

golang

Swift

Fortran

Publishing

PHP

Natural language processing

Machine learning

Font

Deep learning

Data visualization