jie311

jie311

16 followers · 177 following

Stars

m-xu77 / CTR-Prediction-with-Embeddings-and-Fast-Inference

This project builds a production-ready Click-Through Rate (CTR) prediction pipeline using real-world advertising data.

Python 1 Updated Apr 30, 2025

randomm / qwen3-embedding-tei-onnx

Convert Qwen3-Embedding-0.6B to ONNX format for Text Embeddings Inference (TEI)

Python 7 1 Updated Jun 17, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,430 905 Updated Oct 18, 2025

LDLINGLINGLING / AutoPlan2

Zero-human, cold-start construction of long-chain agents in professional domains

Python 43 8 Updated Apr 29, 2025

sgl-project / SpecForge

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 430 94 Updated Oct 18, 2025

codelion / openevolve

Open-source implementation of AlphaEvolve

Python 4,190 618 Updated Oct 17, 2025

codelion / adaptive-classifier

A flexible, adaptive classification system for dynamic text classification

Python 477 32 Updated Oct 7, 2025

codelion / ellora

Enhancing LLMs with LoRA

Jupyter Notebook 169 13 Updated Sep 10, 2025

codelion / icm

Internal Coherence Maximization (ICM): A Label-Free, Unsupervised Training Framework for LLMs

Python 22 4 Updated Sep 5, 2025

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—language models

Python 29,318 2,331 Updated Oct 17, 2025

haizelabs / dspy-redteam

Red-Teaming Language Models with DSPy

Python 220 24 Updated Feb 13, 2025

Techiral / awesome-llm-jailbreaks

Latest AI Jailbreak Payloads & Exploit Techniques for GPT, QWEN, and all LLM Models

20 4 Updated Sep 4, 2025

confident-ai / deepteam

DeepTeam is a framework to red team LLMs and LLM systems.

Python 782 110 Updated Oct 17, 2025

OSU-NLP-Group / AmpleGCG

AmpleGCG: Learning a Universal and Transferable Generator of Adversarial Attacks on Both Open and Closed LLM

Python 74 8 Updated Nov 3, 2024

xirui-li / DrAttack

Official implementation of paper: DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers

JavaScript 64 12 Updated Aug 25, 2024

huizhang-L / CodeChameleon

Python 24 2 Updated Mar 20, 2024

GuoruiC / FJD

Python 1 1 Updated Sep 8, 2025

sail-sg / imperceptible-jailbreaks

[ArXiv 2025] Imperceptible Jailbreaking against Large Language Models

Python 20 4 Updated Oct 7, 2025

BishopFox / BrokenHill

A productionized greedy coordinate gradient (GCG) attack tool for large language models (LLMs)

Python 142 22 Updated Dec 18, 2024

xndong / SATA

Repository of SATA paradigm for LLM jailbreak.

Python 3 1 Updated Feb 15, 2025

DSN-2024 / DSN

DSN jailbreak Attack & Evaluation Ensemble

Python 11 1 Updated Jul 24, 2025

scaleapi / mrt

https://scale.com/research/mrt

Jupyter Notebook 5 1 Updated Oct 8, 2025

0x4m4 / hexstrike-ai

HexStrike AI MCP Agents is an advanced MCP server that lets AI agents (Claude, GPT, Copilot, etc.) autonomously run 150+ cybersecurity tools for automated pentesting, vulnerability discovery, bug b…

Python 3,909 930 Updated Sep 19, 2025

AlienZhang1996 / DH-CoT

Two text jailbreak attacks against commercial black-box LLMs and a malicious content detection method, the latter of which is applied to red team dataset cleaning and jailbreak response detection.

11 Updated Oct 11, 2025

danieleschmidt / agentic-redteam-radar

Open-source implementation of an agent-centric red-teaming scanner inspired by Cloud Security Alliance guidance. Automated security testing for AI agents and LLM applications.

Python 3 1 Updated Oct 14, 2025

Vignesh010101 / red-teaming

A sophisticated red-teaming agent built with LangGraph and Ollama to probe OpenAI's GPT-OSS-20B model for vulnerabilities and harmful behaviors. (Specifically built for the OpenAI Open Model Hackat…

Python 1 1 Updated Sep 10, 2025

neuron-insight-lab / LLM-security-testing-dataset

A comprehensive dataset for Large Language Model (LLM) security evaluation, featuring three categories: Benign, Borderline, and Malicious. This repository provides critical data support for AI safe…

Python 1 1 Updated Sep 26, 2025

Tencent / AI-Infra-Guard

A.I.G (AI-Infra-Guard) is a comprehensive, intelligent, and easy-to-use AI Red Teaming platform developed by Tencent Zhuque Lab.

Python 1,835 214 Updated Oct 16, 2025

ythuang02 / R2J

Rewrite to Jailbreak: Discover Learnable and Transferable Implicit Harmfulness Instruction (ACL2025)

Python 4 1 Updated Aug 23, 2025

LLBao / ArrAttack

The official implementation of our ICLR 2025 paper "One Model Transfer to All: On Robust Jailbreak Prompts Generation against LLMs".

Python 6 2 Updated May 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly